climateprediction.net home page
Task 13185700

Task 13185700

Name hadam3p_eu_2iwb_1971_1_007384442_0
Workunit 7581872
Created 1 Aug 2011, 9:17:39 UTC
Sent 1 Aug 2011, 9:21:34 UTC
Report deadline 13 Jul 2012, 14:41:34 UTC
Received 23 May 2012, 12:53:57 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1047460
Run time 11 days 20 hours 22 min 32 sec
CPU time 8 days 14 hours 58 min 50 sec
Validate state Invalid
Credit 1,988.94
Device peak FLOPS 0.90 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3884, selfPID=3884, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3224, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3468, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1412, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
20:55:19 (1756): No heartbeat from core client for 30 sec - exiting
20:55:20 (1756): No heartbeat from core client for 30 sec - exiting
20:55:21 (1756): No heartbeat from core client for 30 sec - exiting
20:55:22 (1756): No heartbeat from core client for 30 sec - exiting
20:55:23 (1756): No heartbeat from core client for 30 sec - exiting
20:55:24 (1756): No heartbeat from core client for 30 sec - exiting
20:55:25 (1756): No heartbeat from core client for 30 sec - exiting
20:55:27 (1756): No heartbeat from core client for 30 sec - exiting
20:55:28 (1756): No heartbeat from core client for 30 sec - exiting
20:55:29 (1756): No heartbeat from core client for 30 sec - exiting
20:55:30 (1756): No heartbeat from core client for 30 sec - exiting
20:55:31 (1756): No heartbeat from core client for 30 sec - exiting
20:55:32 (1756): No heartbeat from core client for 30 sec - exiting
20:55:33 (1756): No heartbeat from core client for 30 sec - exiting
20:55:34 (1756): No heartbeat from core client for 30 sec - exiting
20:55:35 (1756): No heartbeat from core client for 30 sec - exiting
20:55:36 (1756): No heartbeat from core client for 30 sec - exiting
20:55:37 (1756): No heartbeat from core client for 30 sec - exiting
20:55:39 (1756): No heartbeat from core client for 30 sec - exiting
20:55:40 (1756): No heartbeat from core client for 30 sec - exiting
20:55:41 (1756): No heartbeat from core client for 30 sec - exiting
20:55:42 (1756): No heartbeat from core client for 30 sec - exiting
20:55:43 (1756): No heartbeat from core client for 30 sec - exiting
20:55:44 (1756): No heartbeat from core client for 30 sec - exiting
20:55:45 (1756): No heartbeat from core client for 30 sec - exiting
20:55:46 (1756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:55:47 (1756): No heartbeat from core client for 30 sec - exiting
GCobal oortroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=2
Model crash detected, will try to restart...
ker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5848, iMonCtr=2
GCobaontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=2

Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5832, selfPID=5832, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4544, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5680, selfPID=5680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
17:54:35 (8112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:54:36 (8112): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=2
Model crash detected, will try to restart...
19:53:24 (4956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:53:25 (4956): No heartbeat from core client for 30 sec - exiting
19:53:26 (4956): No heartbeat from core client for 30 sec - exiting
19:53:27 (4956): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3856, selfPID=3856, iMonCtr=2
19:53:28 (4956): No heartbeat from core client for 30 sec - exiting
19:53:29 (4956): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5360, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1124, iMonCtr=2
Model crash detected, will try to restart...
lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=2
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5028, selfPID=3612, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8168, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4260, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=732, selfPID=732, iMonCtr=2
15:03:43 (4856): No heartbeat from core client for 30 sec - exiting
15:03:44 (4856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4944, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:48:00 (7652): No heartbeat from core client for 30 sec - exiting
21:48:01 (7652): No heartbeat from core client for 30 sec - exiting
21:48:02 (7652): No heartbeat from core client for 30 sec - exiting
21:48:03 (7652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6344, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6904, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8104, selfPID=8104, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5428, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CCPDN Monitor - Quit request from BOINC...
CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2
ontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4760, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3868, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CGnlobal WorkerDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6592, iMonCtr=2
Model crash detected, will try to restart...
1, checkPID=0, selfPID=7300, iMonCtr=2
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1712, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5808, selfPID=6000, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_2iwb_1971_1_007384442_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2iwb_1971_1_007384442_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 May 2012 20:10:15 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 115,296 742,761 6.4422
18 May 2012 19:31:58 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 103,776 668,627 6.4430
15 May 2012 14:50:16 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 92,257 591,702 6.4136
14 May 2012 21:59:44 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 92,256 590,704 6.4029
09 May 2012 20:45:32 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 80,736 515,515 6.3852
07 May 2012 14:24:51 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 69,216 441,408 6.3773
04 May 2012 08:29:47 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 57,696 362,519 6.2833
01 May 2012 10:54:14 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 46,176 286,801 6.2110
12 Aug 2011 08:10:31 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 34,656 211,417 6.1004
06 Aug 2011 05:39:22 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 23,136 148,719 6.4280
04 Aug 2011 21:03:39 1047460 13185700 hadam3p_eu_2iwb_1971_1_007384442_0 11,616 75,022 6.4585


©2024 cpdn.org