climateprediction.net home page
Task 15775981

Task 15775981

Name hadcm3n_n1wp_1920_40_008365879_0
Workunit 8516738
Created 11 May 2013, 3:00:09 UTC
Sent 11 May 2013, 3:04:28 UTC
Report deadline 10 Aug 2013, 10:31:39 UTC
Received 12 May 2013, 6:40:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1282911
Run time 1 days 0 hours 25 min 37 sec
CPU time 22 hours 57 min 46 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 2.00 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
04:09:49 (13259): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:20:44 (13277): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:24:32 (13370): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:24:33 (13370): No heartbeat from core client for 30 sec - exiting
04:24:34 (13370): No heartbeat from core client for 30 sec - exiting
04:24:35 (13370): No heartbeat from core client for 30 sec - exiting
04:24:36 (13370): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
04:28:35 (13452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:28:36 (13452): No heartbeat from core client for 30 sec - exiting
04:28:37 (13452): No heartbeat from core client for 30 sec - exiting
04:28:38 (13452): No heartbeat from core client for 30 sec - exiting
04:33:08 (13544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:33:09 (13544): No heartbeat from core client for 30 sec - exiting
04:36:44 (13616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:44:29 (13711): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:44:30 (13711): No heartbeat from core client for 30 sec - exiting
04:44:31 (13711): No heartbeat from core client for 30 sec - exiting
04:44:32 (13711): No heartbeat from core client for 30 sec - exiting
04:44:33 (13711): No heartbeat from core client for 30 sec - exiting
04:44:34 (13711): No heartbeat from core client for 30 sec - exiting
04:44:35 (13711): No heartbeat from core client for 30 sec - exiting
04:44:36 (13711): No heartbeat from core client for 30 sec - exiting
04:44:37 (13711): No heartbeat from core client for 30 sec - exiting
04:44:38 (13711): No heartbeat from core client for 30 sec - exiting
04:44:39 (13711): No heartbeat from core client for 30 sec - exiting
04:44:40 (13711): No heartbeat from core client for 30 sec - exiting
04:44:41 (13711): No heartbeat from core client for 30 sec - exiting
04:44:42 (13711): No heartbeat from core client for 30 sec - exiting
04:44:43 (13711): No heartbeat from core client for 30 sec - exiting
06:47:10 (13795): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:47:11 (13795): No heartbeat from core client for 30 sec - exiting
06:47:12 (13795): No heartbeat from core client for 30 sec - exiting
06:47:13 (13795): No heartbeat from core client for 30 sec - exiting
06:47:14 (13795): No heartbeat from core client for 30 sec - exiting
06:47:15 (13795): No heartbeat from core client for 30 sec - exiting
06:47:16 (13795): No heartbeat from core client for 30 sec - exiting
06:51:08 (13958): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:12:18 (14012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:55:34 (14878): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:59:35 (15071): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:59:36 (15071): No heartbeat from core client for 30 sec - exiting
05:59:37 (15071): No heartbeat from core client for 30 sec - exiting
06:03:30 (15194): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:03:31 (15194): No heartbeat from core client for 30 sec - exiting
06:03:32 (15194): No heartbeat from core client for 30 sec - exiting
06:07:42 (15305): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77a2400]
[0xf77a2425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75bf1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c2825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75aa4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15378, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf770c400]
[0xf770c425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75291df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf752c825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75144d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15378, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77c1400]
[0xf77c1425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75de1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75e1825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75c94d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15378, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7718400]
[0xf7718425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75351df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7538825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75204d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15378, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77db400]
[0xf77db425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75fb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75e34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15378, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf776b400]
[0xf776b425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75881df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf758b825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75734d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15378, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 May 2013 23:12:31 1281428 15775981 hadcm3n_n1wp_1920_40_008365879_0 25,920 61,917 2.3888


©2024 cpdn.org