climateprediction.net home page
Task 15619290

Task 15619290

Name hadcm3n_zbe9_1880_40_008251930_3
Workunit 8407054
Created 22 Feb 2013, 3:57:58 UTC
Sent 22 Feb 2013, 3:58:01 UTC
Report deadline 24 May 2013, 11:25:12 UTC
Received 4 Mar 2013, 5:08:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1240735
Run time 9 days 22 hours 50 min 22 sec
CPU time 9 days 20 hours 3 min 32 sec
Validate state Invalid
Credit 6,842.88
Device peak FLOPS 3.08 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:49:46 (10016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
23:35:15 (10732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:45:22 (29083): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:45:37 (29083): No heartbeat from core client for 30 sec - exiting
23:45:38 (29083): No heartbeat from core client for 30 sec - exiting
23:45:39 (29083): No heartbeat from core client for 30 sec - exiting
23:45:40 (29083): No heartbeat from core client for 30 sec - exiting
23:45:41 (29083): No heartbeat from core client for 30 sec - exiting
23:45:42 (29083): No heartbeat from core client for 30 sec - exiting
23:45:43 (29083): No heartbeat from core client for 30 sec - exiting
23:45:44 (29083): No heartbeat from core client for 30 sec - exiting
23:45:45 (29083): No heartbeat from core client for 30 sec - exiting
23:45:46 (29083): No heartbeat from core client for 30 sec - exiting
23:45:47 (29083): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
23:47:14 (31838): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:47:52 (31838): No heartbeat from core client for 30 sec - exiting
23:47:53 (31838): No heartbeat from core client for 30 sec - exiting
23:47:54 (31838): No heartbeat from core client for 30 sec - exiting
23:47:55 (31838): No heartbeat from core client for 30 sec - exiting
23:47:56 (31838): No heartbeat from core client for 30 sec - exiting
23:47:57 (31838): No heartbeat from core client for 30 sec - exiting
23:47:58 (31838): No heartbeat from core client for 30 sec - exiting
23:47:59 (31838): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7792400]
[0xf7792430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75a51df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a8825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75904d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=32415, iMonCtr=1
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf771d400]
[0xf771d430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75301df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7533825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf751b4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76e5400]
[0xf76e5430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf74f81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf74fb825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74e34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77be400]
[0xf77be430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75d11df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75d4825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75bc4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77b2400]
[0xf77b2430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75c51df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c8825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75b04d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7770400]
[0xf7770430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75831df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7586825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf756e4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf779e400]
[0xf779e430]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b11df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b4825]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/home/boinc/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759c4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4009, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Mar 2013 20:32:32 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 570,240 821,106 1.4399
03 Mar 2013 10:06:11 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 544,320 783,762 1.4399
02 Mar 2013 23:23:32 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 518,400 745,635 1.4383
02 Mar 2013 12:55:06 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 492,480 708,164 1.4380
02 Mar 2013 02:20:26 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 466,560 670,515 1.4371
01 Mar 2013 15:15:51 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 440,640 632,488 1.4354
01 Mar 2013 04:41:29 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 414,720 594,928 1.4345
28 Feb 2013 18:00:23 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 388,800 556,730 1.4319
28 Feb 2013 07:17:24 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 362,880 518,833 1.4298
27 Feb 2013 20:33:11 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 336,960 480,530 1.4261
27 Feb 2013 10:02:06 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 311,040 443,098 1.4246
26 Feb 2013 23:19:28 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 285,120 404,898 1.4201
26 Feb 2013 12:45:10 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 259,200 367,254 1.4169
26 Feb 2013 02:08:04 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 233,280 329,290 1.4116
25 Feb 2013 15:41:56 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 207,360 291,897 1.4077
25 Feb 2013 05:13:31 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 181,440 254,723 1.4039
24 Feb 2013 18:27:21 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 155,520 215,787 1.3875
24 Feb 2013 07:44:59 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 129,600 178,036 1.3737
23 Feb 2013 20:28:53 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 103,680 140,921 1.3592
23 Feb 2013 11:07:37 1240735 15619290 hadcm3n_zbe9_1880_40_008251930_3 77,760 109,329 1.4060


©2024 cpdn.org