climateprediction.net home page
Task 15892362

Task 15892362

Name hadcm3n_4g7s_1980_40_008326396_2
Workunit 8477531
Created 11 Jul 2013, 15:49:31 UTC
Sent 11 Jul 2013, 16:02:06 UTC
Report deadline 10 Oct 2013, 23:29:17 UTC
Received 14 Aug 2013, 20:09:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1228459
Run time 2 days 19 hours 33 min 44 sec
CPU time 2 days 18 hours 23 min 31 sec
Validate state Invalid
Credit 2,177.28
Device peak FLOPS 3.32 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
09:27:04 (7604): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:03:32 (1668): No heartbeat from core client for 30 sec - exiting
09:03:33 (1668): No heartbeat from core client for 30 sec - exiting
09:03:34 (1668): No heartbeat from core client for 30 sec - exiting
09:03:35 (1668): No heartbeat from core client for 30 sec - exiting
09:03:36 (1668): No heartbeat from core client for 30 sec - exiting
09:03:37 (1668): No heartbeat from core client for 30 sec - exiting
09:03:38 (1668): No heartbeat from core client for 30 sec - exiting
09:03:39 (1668): No heartbeat from core client for 30 sec - exiting
09:03:40 (1668): No heartbeat from core client for 30 sec - exiting
09:03:41 (1668): No heartbeat from core client for 30 sec - exiting
09:03:42 (1668): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
08:34:41 (7860): No heartbeat from core client for 30 sec - exiting
08:34:42 (7860): No heartbeat from core client for 30 sec - exiting
08:34:43 (7860): No heartbeat from core client for 30 sec - exiting
08:34:44 (7860): No heartbeat from core client for 30 sec - exiting
08:34:45 (7860): No heartbeat from core client for 30 sec - exiting
08:34:46 (7860): No heartbeat from core client for 30 sec - exiting
08:34:47 (7860): No heartbeat from core client for 30 sec - exiting
08:34:48 (7860): No heartbeat from core client for 30 sec - exiting
08:34:49 (7860): No heartbeat from core client for 30 sec - exiting
08:34:50 (7860): No heartbeat from core client for 30 sec - exiting
08:34:51 (7860): No heartbeat from core client for 30 sec - exiting
08:34:52 (7860): No heartbeat from core client for 30 sec - exiting
08:34:53 (7860): No heartbeat from core client for 30 sec - exiting
08:34:54 (7860): No heartbeat from core client for 30 sec - exiting
08:34:55 (7860): No heartbeat from core client for 30 sec - exiting
08:34:56 (7860): No heartbeat from core client for 30 sec - exiting
08:34:57 (7860): No heartbeat from core client for 30 sec - exiting
08:34:58 (7860): No heartbeat from core client for 30 sec - exiting
08:34:59 (7860): No heartbeat from core client for 30 sec - exiting
08:35:00 (7860): No heartbeat from core client for 30 sec - exiting
08:35:01 (7860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8672, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6984, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7424, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2013 20:09:49 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 181,440 215,257 1.1864
14 Aug 2013 20:09:49 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 155,520 184,845 1.1886
14 Aug 2013 20:09:49 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 129,600 153,445 1.1840
14 Aug 2013 20:09:49 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 103,680 124,013 1.1961
14 Aug 2013 20:09:49 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 77,760 96,131 1.2363
14 Aug 2013 20:09:49 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 51,840 66,443 1.2817
23 Jul 2013 20:20:44 1228459 15892362 hadcm3n_4g7s_1980_40_008326396_2 25,920 33,868 1.3066


©2024 climateprediction.net