climateprediction.net home page
Task 15800506

Task 15800506

Name hadcm3n_39f6_1980_40_008320347_3
Workunit 8471482
Created 29 May 2013, 8:18:11 UTC
Sent 29 May 2013, 8:18:44 UTC
Report deadline 28 Aug 2013, 15:45:55 UTC
Received 14 Jun 2013, 4:12:08 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1262237
Run time 3 days 16 hours 2 min 27 sec
CPU time 3 days 2 hours 39 min 58 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 4.03 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognise the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:52:23 (4444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:28:30 (4896): Can't acquire lockfile (32) - waiting 35s
16:28:43 (5240): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
13:25:05 (6060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:28:49 (5812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:43:01 (3532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:32:33 (4464): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:31:15 (2060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5000, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Jun 2013 03:26:49 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 259,200 248,132 0.9573
12 Jun 2013 03:53:51 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 233,280 222,907 0.9555
11 Jun 2013 05:11:43 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 207,360 197,743 0.9536
10 Jun 2013 03:10:28 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 181,440 172,478 0.9506
09 Jun 2013 08:39:37 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 155,520 147,209 0.9466
06 Jun 2013 09:56:47 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 129,600 122,017 0.9415
04 Jun 2013 08:09:36 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 103,680 97,739 0.9427
03 Jun 2013 10:19:51 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 77,760 73,536 0.9457
02 Jun 2013 10:04:10 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 51,840 49,322 0.9514
01 Jun 2013 05:41:43 1262237 15800506 hadcm3n_39f6_1980_40_008320347_3 25,920 25,026 0.9655


©2024 climateprediction.net