climateprediction.net home page
Task 15806602

Task 15806602

Name hadcm3n_n2cl_1920_40_008377207_0
Workunit 8528066
Created 30 May 2013, 11:05:04 UTC
Sent 31 May 2013, 2:12:37 UTC
Report deadline 30 Aug 2013, 9:39:48 UTC
Received 9 Jul 2013, 3:31:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1229296
Run time 19 days 8 hours 22 min 51 sec
CPU time 18 days 12 hours 26 min 15 sec
Validate state Invalid
Credit 9,642.24
Device peak FLOPS 2.37 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:49:42 (7793): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:26:14 (29978): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:26:25 (29978): No heartbeat from core client for 30 sec - exiting
19:26:26 (29978): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
20:39:36 (17970): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:36:01 (21149): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
08:57:50 (3300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:18:25 (18936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:25:16 (10684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
20:29:28 (13038): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:43:45 (4133): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:44:16 (4133): No heartbeat from core client for 30 sec - exiting
21:44:17 (4133): No heartbeat from core client for 30 sec - exiting
21:44:18 (4133): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:20:42 (29510): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
19:20:57 (29510): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:27:16 (23043): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:13:25 (5070): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:14:08 (5070): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:32:10 (23890): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:32:17 (23890): No heartbeat from core client for 30 sec - exiting
19:44:13 (4580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:28:08 (8667): No heartbeat from core client for 30 sec - exiting
20:28:10 (8667): No heartbeat from core client for 30 sec - exiting
20:28:11 (8667): No heartbeat from core client for 30 sec - exiting
20:28:12 (8667): No heartbeat from core client for 30 sec - exiting
20:28:13 (8667): No heartbeat from core client for 30 sec - exiting
20:28:14 (8667): No heartbeat from core client for 30 sec - exiting
20:28:15 (8667): No heartbeat from core client for 30 sec - exiting
20:28:16 (8667): No heartbeat from core client for 30 sec - exiting
20:28:17 (8667): No heartbeat from core client for 30 sec - exiting
20:28:18 (8667): No heartbeat from core client for 30 sec - exiting
20:28:19 (8667): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:28:20 (8667): No heartbeat from core client for 30 sec - exiting
20:28:21 (8667): No heartbeat from core client for 30 sec - exiting
20:28:23 (8667): No heartbeat from core client for 30 sec - exiting
20:28:24 (8667): No heartbeat from core client for 30 sec - exiting
20:28:25 (8667): No heartbeat from core client for 30 sec - exiting
20:28:33 (8667): No heartbeat from core client for 30 sec - exiting
20:28:35 (8667): No heartbeat from core client for 30 sec - exiting
20:28:48 (8667): No heartbeat from core client for 30 sec - exiting
20:28:50 (8667): No heartbeat from core client for 30 sec - exiting
20:28:51 (8667): No heartbeat from core client for 30 sec - exiting
20:28:52 (8667): No heartbeat from core client for 30 sec - exiting
20:28:53 (8667): No heartbeat from core client for 30 sec - exiting
20:28:54 (8667): No heartbeat from core client for 30 sec - exiting
20:28:55 (8667): No heartbeat from core client for 30 sec - exiting
20:28:58 (8667): No heartbeat from core client for 30 sec - exiting
20:28:59 (8667): No heartbeat from core client for 30 sec - exiting
20:29:00 (8667): No heartbeat from core client for 30 sec - exiting
20:29:01 (8667): No heartbeat from core client for 30 sec - exiting
20:29:02 (8667): No heartbeat from core client for 30 sec - exiting
20:29:03 (8667): No heartbeat from core client for 30 sec - exiting
20:29:05 (8667): No heartbeat from core client for 30 sec - exiting
20:29:06 (8667): No heartbeat from core client for 30 sec - exiting
20:29:16 (8667): No heartbeat from core client for 30 sec - exiting
20:29:17 (8667): No heartbeat from core client for 30 sec - exiting
20:29:18 (8667): No heartbeat from core client for 30 sec - exiting
20:29:21 (8667): No heartbeat from core client for 30 sec - exiting
20:29:23 (8667): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77c3400]
[0xf77c3430]
/lib32/libc.so.6(gsignal+0x4f)[0xf75dc25f]
/lib32/libc.so.6(abort+0x175)[0xf75df7b5]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf75c74b3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31749, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76e4400]
[0xf76e4430]
/lib32/libc.so.6(gsignal+0x4f)[0xf74fd25f]
/lib32/libc.so.6(abort+0x175)[0xf75007b5]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf74e84b3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31749, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7759400]
[0xf7759430]
/lib32/libc.so.6(gsignal+0x4f)[0xf757225f]
/lib32/libc.so.6(abort+0x175)[0xf75757b5]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf755d4b3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31749, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7702400]
[0xf7702430]
/lib32/libc.so.6(gsignal+0x4f)[0xf751b25f]
/lib32/libc.so.6(abort+0x175)[0xf751e7b5]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf75064b3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31749, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77d0400]
[0xf77d0430]
/lib32/libc.so.6(gsignal+0x4f)[0xf75e925f]
/lib32/libc.so.6(abort+0x175)[0xf75ec7b5]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf75d44b3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31749, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf777f400]
[0xf777f430]
/lib32/libc.so.6(gsignal+0x4f)[0xf759825f]
/lib32/libc.so.6(abort+0x175)[0xf759b7b5]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib32/libc.so.6(__libc_start_main+0xf3)[0xf75834b3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=31749, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Jul 2013 01:19:26 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 803,520 1,593,091 1.9826
08 Jul 2013 05:11:30 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 777,600 1,550,438 1.9939
07 Jul 2013 16:37:55 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 751,680 1,506,148 2.0037
07 Jul 2013 04:20:01 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 725,760 1,462,251 2.0148
06 Jul 2013 15:39:18 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 699,840 1,417,521 2.0255
06 Jul 2013 05:41:49 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 673,920 1,373,761 2.0385
06 Jul 2013 04:43:07 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 648,000 1,329,532 2.0517
04 Jul 2013 14:26:12 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 622,080 1,284,687 2.0651
03 Jul 2013 13:22:39 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 596,160 1,240,295 2.0805
03 Jul 2013 00:38:30 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 570,240 1,195,441 2.0964
02 Jul 2013 12:04:03 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 544,320 1,151,221 2.1150
02 Jul 2013 11:14:08 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 518,400 1,096,764 2.1157
02 Jul 2013 10:40:41 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 492,480 1,040,939 2.1137
02 Jul 2013 10:16:35 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 466,560 985,124 2.1115
02 Jul 2013 09:56:20 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 440,640 929,360 2.1091
28 Jun 2013 02:45:46 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 414,720 873,614 2.1065
27 Jun 2013 02:06:10 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 388,800 818,001 2.1039
26 Jun 2013 02:06:13 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 362,880 762,326 2.1008
25 Jun 2013 01:02:39 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 336,960 708,428 2.1024
10 Jun 2013 02:20:06 1229296 15806602 hadcm3n_n2cl_1920_40_008377207_0 311,040 653,107 2.0998


©2024 cpdn.org