climateprediction.net home page
Task 15825129

Task 15825129

Name hadcm3n_o5a3_1980_40_008388875_0
Workunit 8539734
Created 3 Jun 2013, 14:51:44 UTC
Sent 5 Jun 2013, 15:56:05 UTC
Report deadline 4 Sep 2013, 23:23:16 UTC
Received 7 Jun 2013, 13:01:24 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1284589
Run time 18 hours 30 min 27 sec
CPU time 14 hours 19 min 2 sec
Validate state Invalid
Credit 311.04
Device peak FLOPS 2.84 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
18:01:20 (3092): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:01:21 (3092): No heartbeat from core client for 30 sec - exiting
18:01:22 (3092): No heartbeat from core client for 30 sec - exiting
18:01:23 (3092): No heartbeat from core client for 30 sec - exiting
18:01:24 (3092): No heartbeat from core client for 30 sec - exiting
18:01:25 (3092): No heartbeat from core client for 30 sec - exiting
18:01:26 (3092): No heartbeat from core client for 30 sec - exiting
18:01:27 (3092): No heartbeat from core client for 30 sec - exiting
18:01:29 (3092): No heartbeat from core client for 30 sec - exiting
18:01:30 (3092): No heartbeat from core client for 30 sec - exiting
18:01:31 (3092): No heartbeat from core client for 30 sec - exiting
18:01:32 (3092): No heartbeat from core client for 30 sec - exiting
18:01:33 (3092): No heartbeat from core client for 30 sec - exiting
18:01:34 (3092): No heartbeat from core client for 30 sec - exiting
18:01:35 (3092): No heartbeat from core client for 30 sec - exiting
18:01:36 (3092): No heartbeat from core client for 30 sec - exiting
20:50:25 (1548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
20:54:39 (4880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:54:40 (4880): No heartbeat from core client for 30 sec - exiting
20:54:41 (4880): No heartbeat from core client for 30 sec - exiting
20:54:43 (4880): No heartbeat from core client for 30 sec - exiting
20:54:44 (4880): No heartbeat from core client for 30 sec - exiting
20:54:45 (4880): No heartbeat from core client for 30 sec - exiting
20:54:46 (4880): No heartbeat from core client for 30 sec - exiting
20:54:47 (4880): No heartbeat from core client for 30 sec - exiting
20:54:48 (4880): No heartbeat from core client for 30 sec - exiting
20:54:49 (4880): No heartbeat from core client for 30 sec - exiting
21:00:39 (4992): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:07:51 (2300): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:00:32 (956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:00:46 (956): No heartbeat from core client for 30 sec - exiting
22:00:47 (956): No heartbeat from core client for 30 sec - exiting
22:00:48 (956): No heartbeat from core client for 30 sec - exiting
22:00:49 (956): No heartbeat from core client for 30 sec - exiting
22:00:50 (956): No heartbeat from core client for 30 sec - exiting
22:03:05 (1260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:40:11 (1508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:02:16 (704): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:04:00 (2884): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:10:08 (3048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:11:48 (3460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:21:51 (4912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
23:37:24 (5028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:38:59 (1780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:54:54 (4244): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:03:31 (5016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:13:04 (2292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:13:05 (2292): No heartbeat from core client for 30 sec - exiting
00:13:07 (2292): No heartbeat from core client for 30 sec - exiting
01:01:05 (1064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:04:49 (4964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:21:54 (1260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:47:32 (1672): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:04:58 (2864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:34:46 (2284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:34:47 (2284): No heartbeat from core client for 30 sec - exiting
02:34:48 (2284): No heartbeat from core client for 30 sec - exiting
02:34:49 (2284): No heartbeat from core client for 30 sec - exiting
02:34:50 (2284): No heartbeat from core client for 30 sec - exiting
02:34:51 (2284): No heartbeat from core client for 30 sec - exiting
02:34:52 (2284): No heartbeat from core client for 30 sec - exiting
02:34:53 (2284): No heartbeat from core client for 30 sec - exiting
02:34:54 (2284): No heartbeat from core client for 30 sec - exiting
02:34:55 (2284): No heartbeat from core client for 30 sec - exiting
02:34:56 (2284): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
03:05:16 (5748): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
05:05:29 (936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Jun 2013 06:04:59 1284589 15825129 hadcm3n_o5a3_1980_40_008388875_0 25,920 34,759 1.3410


©2024 cpdn.org