climateprediction.net home page
Task 15906779

Task 15906779

Name hadcm3n_o4pt_2020_40_008376282_1
Workunit 8527141
Created 25 Jul 2013, 3:30:03 UTC
Sent 25 Jul 2013, 3:40:16 UTC
Report deadline 24 Oct 2013, 11:07:27 UTC
Received 14 Aug 2013, 18:17:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1433549
Run time 6 days 23 hours 51 min 23 sec
CPU time 5 days 15 hours 29 min 49 sec
Validate state Invalid
Credit 1,244.16
Device peak FLOPS 2.04 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11148, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:56:22 (13328): No heartbeat from core client for 30 sec - exiting
20:56:23 (13328): No heartbeat from core client for 30 sec - exiting
20:56:24 (13328): No heartbeat from core client for 30 sec - exiting
20:56:25 (13328): No heartbeat from core client for 30 sec - exiting
20:56:26 (13328): No heartbeat from core client for 30 sec - exiting
20:56:27 (13328): No heartbeat from core client for 30 sec - exiting
20:56:28 (13328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:56:29 (13328): No heartbeat from core client for 30 sec - exiting
20:56:30 (13328): No heartbeat from core client for 30 sec - exiting
20:58:27 (13572): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:21:23 (11576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:06:16 (2828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4916, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:06:22 (10028): No heartbeat from core client for 30 sec - exiting
23:06:23 (10028): No heartbeat from core client for 30 sec - exiting
23:06:24 (10028): No heartbeat from core client for 30 sec - exiting
23:06:25 (10028): No heartbeat from core client for 30 sec - exiting
23:06:26 (10028): No heartbeat from core client for 30 sec - exiting
23:06:27 (10028): No heartbeat from core client for 30 sec - exiting
23:06:28 (10028): No heartbeat from core client for 30 sec - exiting
23:06:29 (10028): No heartbeat from core client for 30 sec - exiting
23:06:30 (10028): No heartbeat from core client for 30 sec - exiting
23:06:31 (10028): No heartbeat from core client for 30 sec - exiting
23:06:32 (10028): No heartbeat from core client for 30 sec - exiting
23:06:33 (10028): No heartbeat from core client for 30 sec - exiting
23:06:34 (10028): No heartbeat from core client for 30 sec - exiting
23:06:35 (10028): No heartbeat from core client for 30 sec - exiting
23:06:36 (10028): No heartbeat from core client for 30 sec - exiting
23:06:37 (10028): No heartbeat from core client for 30 sec - exiting
23:06:38 (10028): No heartbeat from core client for 30 sec - exiting
23:06:39 (10028): No heartbeat from core client for 30 sec - exiting
23:06:40 (10028): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:06:41 (10028): No heartbeat from core client for 30 sec - exiting
23:08:28 (10700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Aug 2013 18:21:32 1284597 15906779 hadcm3n_o4pt_2020_40_008376282_1 103,680 419,885 4.0498
14 Aug 2013 18:21:32 1284597 15906779 hadcm3n_o4pt_2020_40_008376282_1 77,760 325,386 4.1845
14 Aug 2013 18:21:32 1284597 15906779 hadcm3n_o4pt_2020_40_008376282_1 51,840 227,138 4.3815
14 Aug 2013 18:21:32 1284597 15906779 hadcm3n_o4pt_2020_40_008376282_1 25,920 112,413 4.3369


©2024 cpdn.org