climateprediction.net home page
Task 15790416

Task 15790416

Name hadcm3n_zik1_1920_40_008316144_3
Workunit 8467279
Created 19 May 2013, 22:46:07 UTC
Sent 19 May 2013, 22:46:08 UTC
Report deadline 19 Aug 2013, 6:13:19 UTC
Received 30 May 2013, 22:19:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1282401
Run time 10 days 14 hours 45 min 43 sec
CPU time 10 days 9 hours 28 min 26 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 2.01 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
17:25:14 (2335): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:35:20 (2999): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:56:34 (13127): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:12:22 (52349): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:40:35 (37890): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:40:36 (37890): No heartbeat from core client for 30 sec - exiting
06:40:37 (37890): No heartbeat from core client for 30 sec - exiting
06:40:38 (37890): No heartbeat from core client for 30 sec - exiting
06:40:39 (37890): No heartbeat from core client for 30 sec - exiting
06:40:40 (37890): No heartbeat from core client for 30 sec - exiting
06:40:41 (37890): No heartbeat from core client for 30 sec - exiting
06:40:42 (37890): No heartbeat from core client for 30 sec - exiting
06:40:43 (37890): No heartbeat from core client for 30 sec - exiting
06:40:44 (37890): No heartbeat from core client for 30 sec - exiting
06:40:45 (37890): No heartbeat from core client for 30 sec - exiting
06:40:46 (37890): No heartbeat from core client for 30 sec - exiting
06:40:47 (37890): No heartbeat from core client for 30 sec - exiting
06:40:48 (37890): No heartbeat from core client for 30 sec - exiting
06:40:49 (37890): No heartbeat from core client for 30 sec - exiting
06:40:50 (37890): No heartbeat from core client for 30 sec - exiting
06:40:51 (37890): No heartbeat from core client for 30 sec - exiting
06:40:52 (37890): No heartbeat from core client for 30 sec - exiting
06:40:53 (37890): No heartbeat from core client for 30 sec - exiting
06:40:54 (37890): No heartbeat from core client for 30 sec - exiting
06:40:55 (37890): No heartbeat from core client for 30 sec - exiting
06:40:56 (37890): No heartbeat from core client for 30 sec - exiting
06:40:57 (37890): No heartbeat from core client for 30 sec - exiting
06:45:35 (38928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:36:02 (39099): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:40:28 (41726): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:40:29 (41726): No heartbeat from core client for 30 sec - exiting
10:40:30 (41726): No heartbeat from core client for 30 sec - exiting
10:40:31 (41726): No heartbeat from core client for 30 sec - exiting
10:40:32 (41726): No heartbeat from core client for 30 sec - exiting
10:40:33 (41726): No heartbeat from core client for 30 sec - exiting
10:40:34 (41726): No heartbeat from core client for 30 sec - exiting
10:40:35 (41726): No heartbeat from core client for 30 sec - exiting
10:40:36 (41726): No heartbeat from core client for 30 sec - exiting
10:40:37 (41726): No heartbeat from core client for 30 sec - exiting
10:40:38 (41726): No heartbeat from core client for 30 sec - exiting
10:40:39 (41726): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
14:03:47 (41938): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:13:24 (44007): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:51:26 (44381): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:01:02 (44891): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:05:47 (45094): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:42:25 (45321): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:31:37 (45769): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:29:13 (46490): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:29:14 (46490): No heartbeat from core client for 30 sec - exiting
17:38:37 (47400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:23:44 (47660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:23:45 (47660): No heartbeat from core client for 30 sec - exiting
18:23:46 (47660): No heartbeat from core client for 30 sec - exiting
18:23:47 (47660): No heartbeat from core client for 30 sec - exiting
18:23:48 (47660): No heartbeat from core client for 30 sec - exiting
18:23:49 (47660): No heartbeat from core client for 30 sec - exiting
18:23:50 (47660): No heartbeat from core client for 30 sec - exiting
18:23:51 (47660): No heartbeat from core client for 30 sec - exiting
18:23:52 (47660): No heartbeat from core client for 30 sec - exiting
18:23:53 (47660): No heartbeat from core client for 30 sec - exiting
18:23:54 (47660): No heartbeat from core client for 30 sec - exiting
18:23:55 (47660): No heartbeat from core client for 30 sec - exiting
18:28:22 (48371): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:41:08 (48559): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:51:14 (48821): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:56:00 (49596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:56:01 (49596): No heartbeat from core client for 30 sec - exiting
19:56:02 (49596): No heartbeat from core client for 30 sec - exiting
19:56:03 (49596): No heartbeat from core client for 30 sec - exiting
19:56:04 (49596): No heartbeat from core client for 30 sec - exiting
19:56:05 (49596): No heartbeat from core client for 30 sec - exiting
19:56:06 (49596): No heartbeat from core client for 30 sec - exiting
19:56:07 (49596): No heartbeat from core client for 30 sec - exiting
19:56:08 (49596): No heartbeat from core client for 30 sec - exiting
19:56:09 (49596): No heartbeat from core client for 30 sec - exiting
19:56:10 (49596): No heartbeat from core client for 30 sec - exiting
19:56:11 (49596): No heartbeat from core client for 30 sec - exiting
19:56:12 (49596): No heartbeat from core client for 30 sec - exiting
19:56:13 (49596): No heartbeat from core client for 30 sec - exiting
19:56:14 (49596): No heartbeat from core client for 30 sec - exiting
19:56:15 (49596): No heartbeat from core client for 30 sec - exiting
19:56:16 (49596): No heartbeat from core client for 30 sec - exiting
19:56:17 (49596): No heartbeat from core client for 30 sec - exiting
19:56:18 (49596): No heartbeat from core client for 30 sec - exiting
19:56:19 (49596): No heartbeat from core client for 30 sec - exiting
19:56:20 (49596): No heartbeat from core client for 30 sec - exiting
19:56:21 (49596): No heartbeat from core client for 30 sec - exiting
19:56:22 (49596): No heartbeat from core client for 30 sec - exiting
19:56:23 (49596): No heartbeat from core client for 30 sec - exiting
19:56:24 (49596): No heartbeat from core client for 30 sec - exiting
19:56:25 (49596): No heartbeat from core client for 30 sec - exiting
20:00:33 (49783): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
21:12:14 (49978): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:17:02 (50742): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:17:03 (50742): No heartbeat from core client for 30 sec - exiting
21:17:04 (50742): No heartbeat from core client for 30 sec - exiting
21:17:05 (50742): No heartbeat from core client for 30 sec - exiting
21:17:06 (50742): No heartbeat from core client for 30 sec - exiting
21:17:07 (50742): No heartbeat from core client for 30 sec - exiting
21:17:08 (50742): No heartbeat from core client for 30 sec - exiting
21:17:09 (50742): No heartbeat from core client for 30 sec - exiting
21:17:10 (50742): No heartbeat from core client for 30 sec - exiting
21:17:11 (50742): No heartbeat from core client for 30 sec - exiting
21:17:12 (50742): No heartbeat from core client for 30 sec - exiting
21:17:13 (50742): No heartbeat from core client for 30 sec - exiting
21:17:14 (50742): No heartbeat from core client for 30 sec - exiting
21:17:15 (50742): No heartbeat from core client for 30 sec - exiting
21:17:16 (50742): No heartbeat from core client for 30 sec - exiting
21:17:17 (50742): No heartbeat from core client for 30 sec - exiting
21:17:18 (50742): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7793400]
[0xf7793425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b01df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b3825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759b4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76eb400]
[0xf76eb425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75081df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf750b825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf74f34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf779b400]
[0xf779b425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b81df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75bb825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75a34d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf772d400]
[0xf772d425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf754a1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf754d825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75354d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7787400]
[0xf7787425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75a41df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a7825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf758f4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf76ff400]
[0xf76ff425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf751c1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf751f825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75074d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=50942, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 May 2013 01:19:59 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 311,040 840,399 2.7019
29 May 2013 04:51:33 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 285,120 768,606 2.6957
28 May 2013 07:53:26 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 259,200 697,867 2.6924
27 May 2013 11:09:46 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 233,280 626,064 2.6837
26 May 2013 15:01:58 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 207,360 555,508 2.6790
25 May 2013 19:03:45 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 181,440 487,955 2.6893
25 May 2013 00:19:10 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 155,520 422,475 2.7165
24 May 2013 05:51:55 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 129,600 356,801 2.7531
23 May 2013 10:38:43 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 103,680 290,868 2.8054
22 May 2013 13:10:00 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 77,760 218,100 2.8048
21 May 2013 16:52:08 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 51,840 145,769 2.8119
20 May 2013 19:36:04 1282401 15790416 hadcm3n_zik1_1920_40_008316144_3 25,920 72,486 2.7965


©2024 cpdn.org