climateprediction.net home page
Task 16083908

Task 16083908

Name hadcm3n_o0y2_2020_40_008409526_1
Workunit 8560382
Created 25 Nov 2013, 22:21:13 UTC
Sent 25 Nov 2013, 23:31:08 UTC
Report deadline 25 Feb 2014, 6:58:19 UTC
Received 31 Dec 2013, 3:15:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1251442
Run time 14 days 14 hours 57 min 6 sec
CPU time 10 days 15 hours 13 min 31 sec
Validate state Invalid
Credit 3,421.44
Device peak FLOPS 2.27 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk.
 (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5712, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
15:35:32 (6232): No heartbeat from core client for 30 sec - exiting
15:35:33 (6232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:33:57 (5084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7084, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7092, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7092, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2388, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2464, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:58:10 (1332): No heartbeat from core client for 30 sec - exiting
18:58:11 (1332): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:48:04 (5984): No heartbeat from core client for 30 sec - exiting
17:48:05 (5984): No heartbeat from core client for 30 sec - exiting
17:48:06 (5984): No heartbeat from core client for 30 sec - exiting
17:48:07 (5984): No heartbeat from core client for 30 sec - exiting
17:48:08 (5984): No heartbeat from core client for 30 sec - exiting
17:48:09 (5984): No heartbeat from core client for 30 sec - exiting
17:48:10 (5984): No heartbeat from core client for 30 sec - exiting
17:48:11 (5984): No heartbeat from core client for 30 sec - exiting
17:48:12 (5984): No heartbeat from core client for 30 sec - exiting
17:48:13 (5984): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
12:52:27 (6756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4300, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:42:34 (5528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:42:38 (5528): No heartbeat from core client for 30 sec - exiting
09:42:39 (5528): No heartbeat from core client for 30 sec - exiting
09:42:40 (5528): No heartbeat from core client for 30 sec - exiting
09:42:41 (5528): No heartbeat from core client for 30 sec - exiting
09:42:42 (5528): No heartbeat from core client for 30 sec - exiting
09:42:43 (5528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:15:26 (23156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:35:28 (6080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Dec 2013 21:58:02 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 285,120 860,922 3.0195
26 Dec 2013 01:37:02 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 259,200 783,899 3.0243
24 Dec 2013 13:34:46 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 233,280 699,615 2.9990
13 Dec 2013 18:56:02 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 181,440 542,019 2.9873
11 Dec 2013 13:34:58 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 155,520 455,886 2.9314
08 Dec 2013 21:24:22 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 129,600 380,271 2.9342
05 Dec 2013 22:03:07 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 103,680 298,810 2.8820
03 Dec 2013 12:59:09 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 77,760 223,075 2.8688
30 Nov 2013 01:58:12 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 51,840 150,551 2.9041
27 Nov 2013 20:59:38 1251442 16083908 hadcm3n_o0y2_2020_40_008409526_1 25,920 70,337 2.7136


©2024 cpdn.org