climateprediction.net home page
Task 13005983

Task 13005983

Name hadcm3n_o0o1_1940_40_007308532_1
Workunit 7505956
Created 26 Jun 2011, 18:39:41 UTC
Sent 26 Jun 2011, 18:39:48 UTC
Report deadline 26 Sep 2011, 2:06:59 UTC
Received 17 Jul 2011, 3:13:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1135626
Run time 13 days 19 hours 43 min 55 sec
CPU time 12 days 16 hours 15 min 14 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 3.23 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:20:36 (772): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
17:20:37 (772): No heartbeat from core client for 30 sec - exiting
17:20:38 (772): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
06:14:59 (2308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
02:25:22 (3976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:28:47 (2204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:28:49 (2204): No heartbeat from core client for 30 sec - exiting
02:28:50 (2204): No heartbeat from core client for 30 sec - exiting
22:37:11 (3388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:37:13 (3388): No heartbeat from core client for 30 sec - exiting
22:37:14 (3388): No heartbeat from core client for 30 sec - exiting
07:10:56 (352): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:10:58 (352): No heartbeat from core client for 30 sec - exiting
07:10:59 (352): No heartbeat from core client for 30 sec - exiting
07:11:00 (352): No heartbeat from core client for 30 sec - exiting
07:11:01 (352): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3040, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
02:36:21 (2920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3372, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3884, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3884, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
06:13:20 (4032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Jul 2011 15:52:43 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 388,800 546,593 1.4058
09 Jul 2011 04:48:22 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 362,880 509,394 1.4038
08 Jul 2011 15:16:43 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 336,960 473,046 1.4039
08 Jul 2011 05:12:23 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 311,040 438,689 1.4104
07 Jul 2011 18:07:31 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 285,120 400,784 1.4057
07 Jul 2011 15:42:10 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 259,200 363,910 1.4040
07 Jul 2011 15:42:10 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 233,280 326,824 1.4010
07 Jul 2011 15:42:10 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 207,360 289,855 1.3978
05 Jul 2011 20:51:25 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 181,440 252,723 1.3929
05 Jul 2011 09:58:28 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 155,520 215,717 1.3871
04 Jul 2011 22:48:17 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 129,600 178,713 1.3790
04 Jul 2011 11:41:03 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 103,680 142,015 1.3697
03 Jul 2011 14:07:16 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 77,760 108,967 1.4013
01 Jul 2011 03:16:18 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 51,840 74,291 1.4331
30 Jun 2011 16:05:13 1135626 13005983 hadcm3n_o0o1_1940_40_007308532_1 25,920 37,141 1.4329


©2024 climateprediction.net