climateprediction.net home page
Task 13373500

Task 13373500

Name hadcm3n_t1fd_1980_40_007453134_3
Workunit 7650637
Created 10 Sep 2011, 20:21:52 UTC
Sent 10 Sep 2011, 20:31:08 UTC
Report deadline 11 Dec 2011, 3:58:19 UTC
Received 25 Sep 2011, 12:59:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1167875
Run time 20 hours 36 min 8 sec
CPU time 19 hours 0 min 47 sec
Validate state Invalid
Credit 622.08
Device peak FLOPS 4.24 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4740, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4356, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4904, iMonCtr=1
Model crash detected, will try to restart...
11:49:38 (4640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:24:06 (4716): No heartbeat from core client for 30 sec - exiting
12:24:07 (4716): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=1
Model crash detected, will try to restart...
14:49:34 (4628): No heartbeat from core client for 30 sec - exiting
14:49:35 (4628): No heartbeat from core client for 30 sec - exiting
14:49:36 (4628): No heartbeat from core client for 30 sec - exiting
14:49:37 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:49:38 (4628): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4580, iMonCtr=1
Model crash detected, will try to restart...
18:42:20 (4824): No heartbeat from core client for 30 sec - exiting
18:42:21 (4824): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:33:25 (4856): No heartbeat from core client for 30 sec - exiting
21:33:26 (4856): No heartbeat from core client for 30 sec - exiting
21:33:27 (4856): No heartbeat from core client for 30 sec - exiting
21:33:28 (4856): No heartbeat from core client for 30 sec - exiting
21:33:29 (4856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3224, iMonCtr=1
Model crash detected, will try to restart...
20:39:43 (4872): No heartbeat from core client for 30 sec - exiting
20:39:44 (4872): No heartbeat from core client for 30 sec - exiting
20:39:45 (4872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:33:51 (4848): No heartbeat from core client for 30 sec - exiting
21:33:52 (4848): No heartbeat from core client for 30 sec - exiting
21:33:53 (4848): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:17:07 (4560): No heartbeat from core client for 30 sec - exiting
19:17:09 (4560): No heartbeat from core client for 30 sec - exiting
19:17:10 (4560): No heartbeat from core client for 30 sec - exiting
19:17:11 (4560): No heartbeat from core client for 30 sec - exiting
19:17:12 (4560): No heartbeat from core client for 30 sec - exiting
19:17:13 (4560): No heartbeat from core client for 30 sec - exiting
19:17:14 (4560): No heartbeat from core client for 30 sec - exiting
19:17:15 (4560): No heartbeat from core client for 30 sec - exiting
19:17:16 (4560): No heartbeat from core client for 30 sec - exiting
19:17:17 (4560): No heartbeat from core client for 30 sec - exiting
19:17:18 (4560): No heartbeat from core client for 30 sec - exiting
19:17:19 (4560): No heartbeat from core client for 30 sec - exiting
19:17:20 (4560): No heartbeat from core client for 30 sec - exiting
19:17:21 (4560): No heartbeat from core client for 30 sec - exiting
19:17:22 (4560): No heartbeat from core client for 30 sec - exiting
19:17:23 (4560): No heartbeat from core client for 30 sec - exiting
19:17:24 (4560): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
22:23:06 (4724): No heartbeat from core client for 30 sec - exiting
22:23:07 (4724): No heartbeat from core client for 30 sec - exiting
22:23:08 (4724): No heartbeat from core client for 30 sec - exiting
22:23:09 (4724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:08:13 (1680): No heartbeat from core client for 30 sec - exiting
19:08:14 (1680): No heartbeat from core client for 30 sec - exiting
19:08:15 (1680): No heartbeat from core client for 30 sec - exiting
19:08:16 (1680): No heartbeat from core client for 30 sec - exiting
19:08:17 (1680): No heartbeat from core client for 30 sec - exiting
19:08:18 (1680): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:59:40 (3408): No heartbeat from core client for 30 sec - exiting
10:59:41 (3408): No heartbeat from core client for 30 sec - exiting
10:59:42 (3408): No heartbeat from core client for 30 sec - exiting
10:59:43 (3408): No heartbeat from core client for 30 sec - exiting
10:59:44 (3408): No heartbeat from core client for 30 sec - exiting
10:59:45 (3408): No heartbeat from core client for 30 sec - exiting
10:59:46 (3408): No heartbeat from core client for 30 sec - exiting
10:59:47 (3408): No heartbeat from core client for 30 sec - exiting
10:59:48 (3408): No heartbeat from core client for 30 sec - exiting
10:59:49 (3408): No heartbeat from core client for 30 sec - exiting
10:59:50 (3408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4468, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 Sep 2011 18:13:11 1167875 13373500 hadcm3n_t1fd_1980_40_007453134_3 51,840 49,427 0.9535
19 Sep 2011 21:34:48 1167875 13373500 hadcm3n_t1fd_1980_40_007453134_3 25,920 23,968 0.9247


©2024 cpdn.org