climateprediction.net home page
Task 15440282

Task 15440282

Name hadcm3n_z9ma_1880_40_008245316_2
Workunit 8400440
Created 20 Nov 2012, 19:20:06 UTC
Sent 20 Nov 2012, 19:20:42 UTC
Report deadline 20 Feb 2013, 2:47:53 UTC
Received 3 Dec 2012, 9:37:23 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1212089
Run time 12 days 8 hours 28 min 19 sec
CPU time 8 days 22 hours 2 min 49 sec
Validate state Invalid
Credit 6,531.84
Device peak FLOPS 1.69 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.29</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
09:33:07 (27055): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:33:08 (27055): No heartbeat from core client for 30 sec - exiting
22:50:31 (30574): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:50:32 (30574): No heartbeat from core client for 30 sec - exiting
22:50:33 (30574): No heartbeat from core client for 30 sec - exiting
22:50:34 (30574): No heartbeat from core client for 30 sec - exiting
22:50:35 (30574): No heartbeat from core client for 30 sec - exiting
22:50:36 (30574): No heartbeat from core client for 30 sec - exiting
22:50:39 (30574): No heartbeat from core client for 30 sec - exiting
22:50:40 (30574): No heartbeat from core client for 30 sec - exiting
22:50:41 (30574): No heartbeat from core client for 30 sec - exiting
22:50:42 (30574): No heartbeat from core client for 30 sec - exiting
22:50:43 (30574): No heartbeat from core client for 30 sec - exiting
03:34:50 (27926): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:42:07 (24594): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:42:08 (24594): No heartbeat from core client for 30 sec - exiting
07:10:10 (4906): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:26:32 (5210): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:26:33 (5210): No heartbeat from core client for 30 sec - exiting
10:26:34 (5210): No heartbeat from core client for 30 sec - exiting
10:26:35 (5210): No heartbeat from core client for 30 sec - exiting
10:26:36 (5210): No heartbeat from core client for 30 sec - exiting
10:26:37 (5210): No heartbeat from core client for 30 sec - exiting
10:26:38 (5210): No heartbeat from core client for 30 sec - exiting
10:26:39 (5210): No heartbeat from core client for 30 sec - exiting
10:26:40 (5210): No heartbeat from core client for 30 sec - exiting
10:26:41 (5210): No heartbeat from core client for 30 sec - exiting
19:10:03 (6006): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:10:04 (6006): No heartbeat from core client for 30 sec - exiting
19:10:05 (6006): No heartbeat from core client for 30 sec - exiting
19:10:07 (6006): No heartbeat from core client for 30 sec - exiting
19:10:08 (6006): No heartbeat from core client for 30 sec - exiting
19:10:09 (6006): No heartbeat from core client for 30 sec - exiting
19:10:10 (6006): No heartbeat from core client for 30 sec - exiting
19:10:11 (6006): No heartbeat from core client for 30 sec - exiting
19:10:12 (6006): No heartbeat from core client for 30 sec - exiting
06:38:22 (8331): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:38:23 (8331): No heartbeat from core client for 30 sec - exiting
04:30:42 (9985): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:30:43 (9985): No heartbeat from core client for 30 sec - exiting
04:30:44 (9985): No heartbeat from core client for 30 sec - exiting
07:30:21 (5558): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:30:25 (5558): No heartbeat from core client for 30 sec - exiting
07:00:56 (6612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77ca400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77ca430]
/lib32/libc.so.6(gsignal+0x4f)[0xf75dbc3f]
/lib32/libc.so.6(abort+0x175)[0xf75dd505]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf75c6ba3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7704400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7704430]
/lib32/libc.so.6(gsignal+0x4f)[0xf7515c3f]
/lib32/libc.so.6(abort+0x175)[0xf7517505]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf7500ba3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7738400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7738430]
/lib32/libc.so.6(gsignal+0x4f)[0xf7549c3f]
/lib32/libc.so.6(abort+0x175)[0xf754b505]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf7534ba3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf7782400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf7782430]
/lib32/libc.so.6(gsignal+0x4f)[0xf7593c3f]
/lib32/libc.so.6(abort+0x175)[0xf7595505]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf757eba3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf772d400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf772d430]
/lib32/libc.so.6(gsignal+0x4f)[0xf753ec3f]
/lib32/libc.so.6(abort+0x175)[0xf7540505]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf7529ba3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77c4400]
linux-gate.so.1(__kernel_vsyscall+0x10)[0xf77c4430]
/lib32/libc.so.6(gsignal+0x4f)[0xf75d5c3f]
/lib32/libc.so.6(abort+0x175)[0xf75d7505]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf75c0ba3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7648, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Dec 2012 17:50:37 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 544,320 739,424 1.3584
02 Dec 2012 04:12:47 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 518,400 704,365 1.3587
01 Dec 2012 14:27:33 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 492,480 669,320 1.3591
01 Dec 2012 00:44:27 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 466,560 633,788 1.3584
30 Nov 2012 11:02:23 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 440,640 598,565 1.3584
29 Nov 2012 21:19:06 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 414,720 563,289 1.3582
29 Nov 2012 08:02:19 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 388,800 527,809 1.3575
28 Nov 2012 18:30:19 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 362,880 492,073 1.3560
28 Nov 2012 05:21:00 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 336,960 456,904 1.3560
27 Nov 2012 14:54:56 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 311,040 421,646 1.3556
27 Nov 2012 01:17:06 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 285,120 386,582 1.3559
26 Nov 2012 11:51:12 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 259,200 351,380 1.3556
25 Nov 2012 22:27:51 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 233,280 316,445 1.3565
25 Nov 2012 08:55:27 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 207,360 281,420 1.3572
24 Nov 2012 19:07:58 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 181,440 246,295 1.3574
24 Nov 2012 05:17:40 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 155,520 211,192 1.3580
23 Nov 2012 15:34:38 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 129,600 175,872 1.3570
23 Nov 2012 01:49:10 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 103,680 140,777 1.3578
22 Nov 2012 12:18:08 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 77,760 105,648 1.3586
21 Nov 2012 22:35:24 1212089 15440282 hadcm3n_z9ma_1880_40_008245316_2 51,840 70,224 1.3546


©2024 climateprediction.net