climateprediction.net home page
Task 13657706

Task 13657706

Name hadcm3n_yi12_1900_40_007515698_4
Workunit 7713173
Created 24 Nov 2011, 3:41:04 UTC
Sent 24 Nov 2011, 3:49:27 UTC
Report deadline 23 Feb 2012, 11:16:38 UTC
Received 26 Nov 2011, 1:39:37 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1066269
Run time 1 days 3 hours 59 min 18 sec
CPU time 16 hours 14 min 27 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 2.93 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>6.12.42</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
07:50:58 (4945): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:50:59 (4945): No heartbeat from core client for 30 sec - exiting
07:53:13 (5170): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:53:14 (5170): No heartbeat from core client for 30 sec - exiting
07:54:37 (5203): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:04:41 (5214): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:04:42 (5214): No heartbeat from core client for 30 sec - exiting
08:04:43 (5214): No heartbeat from core client for 30 sec - exiting
08:04:44 (5214): No heartbeat from core client for 30 sec - exiting
08:04:45 (5214): No heartbeat from core client for 30 sec - exiting
08:04:46 (5214): No heartbeat from core client for 30 sec - exiting
08:04:47 (5214): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
10:11:43 (5260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:11:44 (5260): No heartbeat from core client for 30 sec - exiting
10:11:45 (5260): No heartbeat from core client for 30 sec - exiting
10:11:46 (5260): No heartbeat from core client for 30 sec - exiting
10:11:47 (5260): No heartbeat from core client for 30 sec - exiting
10:11:48 (5260): No heartbeat from core client for 30 sec - exiting
10:11:49 (5260): No heartbeat from core client for 30 sec - exiting
10:11:50 (5260): No heartbeat from core client for 30 sec - exiting
10:11:51 (5260): No heartbeat from core client for 30 sec - exiting
10:11:52 (5260): No heartbeat from core client for 30 sec - exiting
10:11:53 (5260): No heartbeat from core client for 30 sec - exiting
10:12:00 (5260): No heartbeat from core client for 30 sec - exiting
10:12:01 (5260): No heartbeat from core client for 30 sec - exiting
10:12:02 (5260): No heartbeat from core client for 30 sec - exiting
10:12:03 (5260): No heartbeat from core client for 30 sec - exiting
10:12:04 (5260): No heartbeat from core client for 30 sec - exiting
10:12:05 (5260): No heartbeat from core client for 30 sec - exiting
12:07:05 (12440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:07:14 (12440): No heartbeat from core client for 30 sec - exiting
12:07:15 (12440): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
14:34:05 (12864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:19:49 (15023): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:19:50 (15023): No heartbeat from core client for 30 sec - exiting
23:19:51 (15023): No heartbeat from core client for 30 sec - exiting
23:19:59 (15023): No heartbeat from core client for 30 sec - exiting
23:20:00 (15023): No heartbeat from core client for 30 sec - exiting
23:20:01 (15023): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
00:17:16 (15543): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:17:17 (15543): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
05:35:37 (17072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:35:38 (17072): No heartbeat from core client for 30 sec - exiting
05:35:39 (17072): No heartbeat from core client for 30 sec - exiting
05:35:40 (17072): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
05:57:09 (17357): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:57:12 (17357): No heartbeat from core client for 30 sec - exiting
05:57:13 (17357): No heartbeat from core client for 30 sec - exiting
05:57:14 (17357): No heartbeat from core client for 30 sec - exiting
05:57:15 (17357): No heartbeat from core client for 30 sec - exiting
05:57:16 (17357): No heartbeat from core client for 30 sec - exiting
05:57:17 (17357): No heartbeat from core client for 30 sec - exiting
05:57:18 (17357): No heartbeat from core client for 30 sec - exiting
05:57:19 (17357): No heartbeat from core client for 30 sec - exiting
05:57:20 (17357): No heartbeat from core client for 30 sec - exiting
05:57:21 (17357): No heartbeat from core client for 30 sec - exiting
05:57:22 (17357): No heartbeat from core client for 30 sec - exiting
05:57:23 (17357): No heartbeat from core client for 30 sec - exiting
05:57:27 (17357): No heartbeat from core client for 30 sec - exiting
05:57:28 (17357): No heartbeat from core client for 30 sec - exiting
05:57:29 (17357): No heartbeat from core client for 30 sec - exiting
05:57:30 (17357): No heartbeat from core client for 30 sec - exiting
05:57:31 (17357): No heartbeat from core client for 30 sec - exiting
05:57:32 (17357): No heartbeat from core client for 30 sec - exiting
05:57:33 (17357): No heartbeat from core client for 30 sec - exiting
06:57:51 (17443): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:57:52 (17443): No heartbeat from core client for 30 sec - exiting
08:23:43 (17670): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:23:45 (17670): No heartbeat from core client for 30 sec - exiting
08:23:46 (17670): No heartbeat from core client for 30 sec - exiting
08:23:47 (17670): No heartbeat from core client for 30 sec - exiting
08:23:48 (17670): No heartbeat from core client for 30 sec - exiting
08:23:49 (17670): No heartbeat from core client for 30 sec - exiting
08:23:50 (17670): No heartbeat from core client for 30 sec - exiting
08:23:56 (17670): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
11:41:01 (18543): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:41:02 (18543): No heartbeat from core client for 30 sec - exiting
11:41:03 (18543): No heartbeat from core client for 30 sec - exiting
11:41:04 (18543): No heartbeat from core client for 30 sec - exiting
11:41:05 (18543): No heartbeat from core client for 30 sec - exiting
11:41:06 (18543): No heartbeat from core client for 30 sec - exiting
11:50:30 (18829): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:00:33 (18882): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:00:34 (18882): No heartbeat from core client for 30 sec - exiting
12:00:35 (18882): No heartbeat from core client for 30 sec - exiting
12:00:36 (18882): No heartbeat from core client for 30 sec - exiting
12:00:37 (18882): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77db400]
[0xf77db430]
/lib32/libc.so.6(gsignal+0x50)[0xf7601a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75ea2cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19590, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77d8400]
[0xf77d8430]
/lib32/libc.so.6(gsignal+0x50)[0xf75fea60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75e72cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19590, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77a0400]
[0xf77a0430]
/lib32/libc.so.6(gsignal+0x50)[0xf75c6a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75af2cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19590, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf774f400]
[0xf774f430]
/lib32/libc.so.6(gsignal+0x50)[0xf7575a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf755e2cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19590, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7727400]
[0xf7727430]
/lib32/libc.so.6(gsignal+0x50)[0xf754da60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75362cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19590, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (8 frames):
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf770a400]
[0xf770a430]
/lib32/libc.so.6(gsignal+0x50)[0xf7530a60]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xff)[0xf75192cf]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=19590, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Nov 2011 03:59:44 1066269 13657706 hadcm3n_yi12_1900_40_007515698_4 25,920 40,470 1.5613


©2024 cpdn.org