climateprediction.net home page
Task 15814125

Task 15814125

Name hadcm3n_n5il_1880_40_008375219_1
Workunit 8526078
Created 1 Jun 2013, 5:23:33 UTC
Sent 1 Jun 2013, 5:26:01 UTC
Report deadline 31 Aug 2013, 12:53:12 UTC
Received 2 Jun 2013, 6:40:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1282401
Run time 19 hours 9 min 12 sec
CPU time 18 hours 41 min 18 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 1.99 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
i686-pc-linux-gnu
Stderr
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
10:41:53 (8684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:45:18 (11296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:45:19 (11296): No heartbeat from core client for 30 sec - exiting
10:45:20 (11296): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
11:42:22 (11416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:46:55 (12028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:46:56 (12028): No heartbeat from core client for 30 sec - exiting
11:46:57 (12028): No heartbeat from core client for 30 sec - exiting
11:46:58 (12028): No heartbeat from core client for 30 sec - exiting
11:46:59 (12028): No heartbeat from core client for 30 sec - exiting
11:47:00 (12028): No heartbeat from core client for 30 sec - exiting
11:47:01 (12028): No heartbeat from core client for 30 sec - exiting
11:47:02 (12028): No heartbeat from core client for 30 sec - exiting
11:47:03 (12028): No heartbeat from core client for 30 sec - exiting
11:47:04 (12028): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
12:20:14 (12192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:44 (12565): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:52 (12565): No heartbeat from core client for 30 sec - exiting
12:30:02 (12725): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:38:45 (12870): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:47:30 (13058): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:52:43 (13233): No heartbeat from core client for 30 sec - exiting
12:52:44 (13233): No heartbeat from core client for 30 sec - exiting
12:52:45 (13233): No heartbeat from core client for 30 sec - exiting
12:52:46 (13233): No heartbeat from core client for 30 sec - exiting
12:52:47 (13233): No heartbeat from core client for 30 sec - exiting
12:52:48 (13233): No heartbeat from core client for 30 sec - exiting
12:52:49 (13233): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:57:15 (13368): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:57:16 (13368): No heartbeat from core client for 30 sec - exiting
12:57:17 (13368): No heartbeat from core client for 30 sec - exiting
12:57:18 (13368): No heartbeat from core client for 30 sec - exiting
13:02:21 (13529): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:03:21 (13685): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:08:21 (14317): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:08:22 (14317): No heartbeat from core client for 30 sec - exiting
14:08:23 (14317): No heartbeat from core client for 30 sec - exiting
14:08:24 (14317): No heartbeat from core client for 30 sec - exiting
14:08:25 (14317): No heartbeat from core client for 30 sec - exiting
14:08:26 (14317): No heartbeat from core client for 30 sec - exiting
14:12:41 (14452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:05:04 (14605): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:32:58 (17270): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:38:04 (17615): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:42:58 (17781): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:42:59 (17781): No heartbeat from core client for 30 sec - exiting
19:47:02 (17951): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:47:03 (17951): No heartbeat from core client for 30 sec - exiting
19:47:04 (17951): No heartbeat from core client for 30 sec - exiting
19:47:05 (17951): No heartbeat from core client for 30 sec - exiting
19:47:06 (17951): No heartbeat from core client for 30 sec - exiting
19:47:07 (17951): No heartbeat from core client for 30 sec - exiting
19:47:08 (17951): No heartbeat from core client for 30 sec - exiting
19:47:09 (17951): No heartbeat from core client for 30 sec - exiting
19:47:10 (17951): No heartbeat from core client for 30 sec - exiting
19:47:11 (17951): No heartbeat from core client for 30 sec - exiting
19:47:12 (17951): No heartbeat from core client for 30 sec - exiting
19:47:13 (17951): No heartbeat from core client for 30 sec - exiting
19:47:14 (17951): No heartbeat from core client for 30 sec - exiting
19:47:15 (17951): No heartbeat from core client for 30 sec - exiting
19:47:16 (17951): No heartbeat from core client for 30 sec - exiting
19:47:17 (17951): No heartbeat from core client for 30 sec - exiting
19:47:18 (17951): No heartbeat from core client for 30 sec - exiting
19:47:19 (17951): No heartbeat from core client for 30 sec - exiting
19:47:20 (17951): No heartbeat from core client for 30 sec - exiting
19:47:21 (17951): No heartbeat from core client for 30 sec - exiting
19:47:22 (17951): No heartbeat from core client for 30 sec - exiting
19:47:23 (17951): No heartbeat from core client for 30 sec - exiting
19:47:24 (17951): No heartbeat from core client for 30 sec - exiting
20:31:53 (18121): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:37:02 (18620): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:41:11 (18794): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:41:12 (18794): No heartbeat from core client for 30 sec - exiting
20:41:13 (18794): No heartbeat from core client for 30 sec - exiting
20:41:14 (18794): No heartbeat from core client for 30 sec - exiting
20:41:15 (18794): No heartbeat from core client for 30 sec - exiting
20:41:16 (18794): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
22:09:32 (18963): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:14:06 (19836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:14:07 (19836): No heartbeat from core client for 30 sec - exiting
22:14:08 (19836): No heartbeat from core client for 30 sec - exiting
22:44:11 (19990): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:48:26 (20367): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:12:52 (20526): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:17:35 (20885): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:34:06 (21046): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:34:07 (21046): No heartbeat from core client for 30 sec - exiting
23:34:08 (21046): No heartbeat from core client for 30 sec - exiting
23:42:56 (21310): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
23:47:12 (21502): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:52:04 (21663): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:34:58 (21778): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:39:48 (23310): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:44:14 (23417): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:44:15 (23417): No heartbeat from core client for 30 sec - exiting
02:48:22 (23592): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:48:23 (23592): No heartbeat from core client for 30 sec - exiting
02:48:24 (23592): No heartbeat from core client for 30 sec - exiting
02:48:25 (23592): No heartbeat from core client for 30 sec - exiting
02:48:26 (23592): No heartbeat from core client for 30 sec - exiting
02:48:27 (23592): No heartbeat from core client for 30 sec - exiting
02:48:28 (23592): No heartbeat from core client for 30 sec - exiting
02:48:29 (23592): No heartbeat from core client for 30 sec - exiting
02:48:30 (23592): No heartbeat from core client for 30 sec - exiting
02:48:31 (23592): No heartbeat from core client for 30 sec - exiting
02:57:21 (23751): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:01:40 (23960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:01:41 (23960): No heartbeat from core client for 30 sec - exiting
03:01:42 (23960): No heartbeat from core client for 30 sec - exiting
03:01:43 (23960): No heartbeat from core client for 30 sec - exiting
03:01:44 (23960): No heartbeat from core client for 30 sec - exiting
03:01:45 (23960): No heartbeat from core client for 30 sec - exiting
04:13:15 (24126): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:44:38 (24860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:49:04 (25726): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:49:05 (25726): No heartbeat from core client for 30 sec - exiting
05:49:06 (25726): No heartbeat from core client for 30 sec - exiting
05:49:07 (25726): No heartbeat from core client for 30 sec - exiting
05:49:08 (25726): No heartbeat from core client for 30 sec - exiting
05:49:09 (25726): No heartbeat from core client for 30 sec - exiting
05:53:34 (25883): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:53:35 (25883): No heartbeat from core client for 30 sec - exiting
05:53:36 (25883): No heartbeat from core client for 30 sec - exiting
05:53:37 (25883): No heartbeat from core client for 30 sec - exiting
05:53:38 (25883): No heartbeat from core client for 30 sec - exiting
05:53:39 (25883): No heartbeat from core client for 30 sec - exiting
05:53:40 (25883): No heartbeat from core client for 30 sec - exiting
05:53:41 (25883): No heartbeat from core client for 30 sec - exiting
05:53:42 (25883): No heartbeat from core client for 30 sec - exiting
05:53:43 (25883): No heartbeat from core client for 30 sec - exiting
05:53:44 (25883): No heartbeat from core client for 30 sec - exiting
05:53:45 (25883): No heartbeat from core client for 30 sec - exiting
05:53:46 (25883): No heartbeat from core client for 30 sec - exiting
05:57:56 (26043): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:57:57 (26043): No heartbeat from core client for 30 sec - exiting
05:57:58 (26043): No heartbeat from core client for 30 sec - exiting
05:57:59 (26043): No heartbeat from core client for 30 sec - exiting
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7780400]
[0xf7780425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf759d1df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75a0825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75884d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26202, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77dc400]
[0xf77dc425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75f91df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75fc825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75e44d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26202, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7794400]
[0xf7794425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75b11df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75b4825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf759c4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26202, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7755400]
[0xf7755425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75721df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7575825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf755d4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26202, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf77a6400]
[0xf77a6425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75c31df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf75c6825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf75ae4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26202, iMonCtr=1
Model crash detected, will try to restart...
SIGABRT: abort called
Stack trace (9 frames):
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x840da8f]
[0xf7706400]
[0xf7706425]
/lib/i386-linux-gnu/libc.so.6(gsignal+0x4f)[0xf75231df]
/lib/i386-linux-gnu/libc.so.6(abort+0x175)[0xf7526825]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x83400c3]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x838f395]
/var/lib/boinc-client/projects/climateprediction.net/hadcm3n_um_6.07_i686-pc-linux-gnu[0x839bdf8]
/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0xf750e4d3]

Exiting...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=26202, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jun 2013 04:27:11 1282401 15814125 hadcm3n_n5il_1880_40_008375219_1 25,920 63,940 2.4668


©2024 climateprediction.net