climateprediction.net home page
Task 14928355

Task 14928355

Name hadam3p_eu_9ojv_1985_1_008053974_1
Workunit 8209088
Created 17 Jul 2012, 12:01:24 UTC
Sent 17 Jul 2012, 12:37:38 UTC
Report deadline 29 Jun 2013, 17:57:38 UTC
Received 3 Sep 2012, 12:26:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1221890
Run time 2 days 6 hours 1 min 1 sec
CPU time 1 days 22 hours 27 min 54 sec
Validate state Invalid
Credit 995.30
Device peak FLOPS 2.52 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.8.42</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2208, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2344, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1872, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6040, selfPID=7152, iMonCtr=1
Model crash detected, will try to restart...
16:33:36 (2056): No heartbeat from core client for 30 sec - exiting
16:33:37 (2056): No heartbeat from core client for 30 sec - exiting
16:33:38 (2056): No heartbeat from core client for 30 sec - exiting
16:33:39 (2056): No heartbeat from core client for 30 sec - exiting
16:33:40 (2056): No heartbeat from core client for 30 sec - exiting
16:33:41 (2056): No heartbeat from core client for 30 sec - exiting
16:33:43 (2056): No heartbeat from core client for 30 sec - exiting
16:33:44 (2056): No heartbeat from core client for 30 sec - exiting
16:33:45 (2056): No heartbeat from core client for 30 sec - exiting
16:33:46 (2056): No heartbeat from core client for 30 sec - exiting
16:33:47 (2056): No heartbeat from core client for 30 sec - exiting
16:33:48 (2056): No heartbeat from core client for 30 sec - exiting
16:33:49 (2056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:34:24 (4124): No heartbeat from core client for 30 sec - exiting
20:34:25 (4124): No heartbeat from core client for 30 sec - exiting
20:34:26 (4124): No heartbeat from core client for 30 sec - exiting
20:34:27 (4124): No heartbeat from core client for 30 sec - exiting
20:34:28 (4124): No heartbeat from core client for 30 sec - exiting
20:34:29 (4124): No heartbeat from core client for 30 sec - exiting
20:34:31 (4124): No heartbeat from core client for 30 sec - exiting
20:34:32 (4124): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5352, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4536, selfPID=2924, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2632, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5060, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5076, selfPID=3204, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=124, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3932, selfPID=3120, iMonCtr=1
Model crash detected, will try to restart...
17:10:54 (5040): No heartbeat from core client for 30 sec - exiting
17:10:55 (5040): No heartbeat from core client for 30 sec - exiting
17:10:56 (5040): No heartbeat from core client for 30 sec - exiting
17:10:57 (5040): No heartbeat from core client for 30 sec - exiting
17:10:58 (5040): No heartbeat from core client for 30 sec - exiting
17:10:59 (5040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5312, selfPID=3612, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5716, selfPID=3776, iMonCtr=1
Model crash detected, will try to restart...
06:42:55 (3996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2836, iMonCtr=2
Model crash detected, will try to restart...
22:23:47 (3912): No heartbeat from core client for 30 sec - exiting
22:23:48 (3912): No heartbeat from core client for 30 sec - exiting
22:23:49 (3912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1180, iMonCtr=2
Model crash detected, will try to restart...
12:35:36 (1316): No heartbeat from core client for 30 sec - exiting
12:35:37 (1316): No heartbeat from core client for 30 sec - exiting
12:35:38 (1316): No heartbeat from core client for 30 sec - exiting
12:35:40 (1316): No heartbeat from core client for 30 sec - exiting
12:35:41 (1316): No heartbeat from core client for 30 sec - exiting
12:35:42 (1316): No heartbeat from core client for 30 sec - exiting
12:35:43 (1316): No heartbeat from core client for 30 sec - exiting
12:35:44 (1316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:10:29 (2932): No heartbeat from core client for 30 sec - exiting
17:10:30 (2932): No heartbeat from core client for 30 sec - exiting
17:10:31 (2932): No heartbeat from core client for 30 sec - exiting
17:10:32 (2932): No heartbeat from core client for 30 sec - exiting
17:10:33 (2932): No heartbeat from core client for 30 sec - exiting
17:10:35 (2932): No heartbeat from core client for 30 sec - exiting
17:10:36 (2932): No heartbeat from core client for 30 sec - exiting
17:10:37 (2932): No heartbeat from core client for 30 sec - exiting
17:10:38 (2932): No heartbeat from core client for 30 sec - exiting
17:10:39 (2932): No heartbeat from core client for 30 sec - exiting
17:10:40 (2932): No heartbeat from core client for 30 sec - exiting
17:10:41 (2932): No heartbeat from core client for 30 sec - exiting
17:10:42 (2932): No heartbeat from core client for 30 sec - exiting
17:10:43 (2932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5224, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5240, selfPID=3052, iMonCtr=1
Model crash detected, will try to restart...
19:47:10 (3976): No heartbeat from core client for 30 sec - exiting
19:47:11 (3976): No heartbeat from core client for 30 sec - exiting
19:47:12 (3976): No heartbeat from core client for 30 sec - exiting
19:47:13 (3976): No heartbeat from core client for 30 sec - exiting
19:47:14 (3976): No heartbeat from core client for 30 sec - exiting
19:47:15 (3976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CCPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5088, selfPID=4740, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5392, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5412, selfPID=3960, iMonCtr=1
Model crash detected, will try to restart...

zip error: Nothing to do! (../_1.zip)
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
31 Aug 2012 11:10:57 1221890 14928355 hadam3p_eu_9ojv_1985_1_008053974_1 57,696 147,983 2.5649
27 Aug 2012 17:07:07 1221890 14928355 hadam3p_eu_9ojv_1985_1_008053974_1 46,176 118,631 2.5691
25 Aug 2012 12:45:32 1221890 14928355 hadam3p_eu_9ojv_1985_1_008053974_1 34,656 87,359 2.5207
20 Aug 2012 04:43:39 1221890 14928355 hadam3p_eu_9ojv_1985_1_008053974_1 23,139 56,981 2.4626
19 Aug 2012 16:28:10 1221890 14928355 hadam3p_eu_9ojv_1985_1_008053974_1 23,136 56,611 2.4469
15 Aug 2012 15:51:29 1221890 14928355 hadam3p_eu_9ojv_1985_1_008053974_1 11,616 28,560 2.4587


©2024 climateprediction.net