climateprediction.net home page
Task 16052345

Task 16052345

Name hadcm3n_odas_1900_40_008472359_3
Workunit 8623198
Created 1 Oct 2013, 16:00:06 UTC
Sent 1 Oct 2013, 16:07:35 UTC
Report deadline 31 Dec 2013, 23:34:46 UTC
Received 17 Dec 2013, 6:29:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1039191
Run time 23 days 9 hours 17 min 30 sec
CPU time 22 days 13 hours 34 min 35 sec
Validate state Invalid
Credit 4,043.52
Device peak FLOPS 0.85 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5272, iMonCtr=1
Model crash detected, will try to restart...
15:23:05 (4640): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:53:11 (3444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:52:30 (4804): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:44:16 (4292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:53:36 (4308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:40:38 (4432): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:14:54 (5624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:39:06 (5492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:47:02 (4828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5804, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5804, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5804, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5804, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2040, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2040, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5376, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Nov 2013 17:24:09 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 336,960 1,926,225 5.7165
06 Nov 2013 14:01:04 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 311,040 1,787,760 5.7477
04 Nov 2013 22:24:04 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 285,120 1,646,279 5.7740
31 Oct 2013 22:28:40 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 259,200 1,505,282 5.8074
26 Oct 2013 17:06:19 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 233,280 1,352,390 5.7973
24 Oct 2013 20:26:46 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 207,360 1,200,913 5.7914
17 Oct 2013 16:11:58 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 181,440 1,048,849 5.7807
14 Oct 2013 15:22:45 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 155,520 899,395 5.7831
12 Oct 2013 20:23:51 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 129,600 750,303 5.7894
11 Oct 2013 01:06:08 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 103,680 600,699 5.7938
09 Oct 2013 05:16:05 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 77,760 450,739 5.7965
07 Oct 2013 09:55:57 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 51,840 300,682 5.8002
05 Oct 2013 14:33:00 1039191 16052345 hadcm3n_odas_1900_40_008472359_3 25,920 150,327 5.7997


©2024 cpdn.org