climateprediction.net home page
Task 12266199

Task 12266199

Name hadam3p_pnw_zepu_1977_1_006983194_0
Workunit 7186510
Created 23 Nov 2010, 16:58:00 UTC
Sent 10 Feb 2011, 22:10:52 UTC
Report deadline 24 Jan 2012, 3:30:52 UTC
Received 8 Mar 2011, 17:59:34 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1095810
Run time 5 days 9 hours 0 min 2 sec
CPU time 4 days 14 hours 31 min 7 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 2.03 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
14:00:35 (5168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:00:36 (5168): No heartbeat from core client for 30 sec - exiting
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4764, selfPID=4764, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5304, selfPID=5304, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3332, selfPID=3332, iMonCtr=2
08:51:39 (1896): No heartbeat from core client for 30 sec - exiting
08:51:40 (1896): No heartbeat from core client for 30 sec - exiting
08:51:42 (1896): No heartbeat from core client for 30 sec - exiting
08:51:43 (1896): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:57:23 (1032): No heartbeat from core client for 30 sec - exiting
12:57:24 (1032): No heartbeat from core client for 30 sec - exiting
12:57:26 (1032): No heartbeat from core client for 30 sec - exiting
12:57:27 (1032): No heartbeat from core client for 30 sec - exiting
12:57:28 (1032): No heartbeat from core client for 30 sec - exiting
12:57:29 (1032): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
RegController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=892, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1564, selfPID=3208, iMonCtr=1
Model crash detected, will try to restart...
14:08:57 (1500): No heartbeat from core client for 30 sec - exiting
14:09:03 (1500): No heartbeat from core client for 30 sec - exiting
14:09:04 (1500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2836, iMonCtr=2
Model crash detected, will try to restart...
CGntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2124, iMonCtr=2
Model crash detected, will try to restart...
lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3796, iMonCtr=2
08:59:53 (3404): No heartbeat from core client for 30 sec - exiting
08:59:54 (3404): No heartbeat from core client for 30 sec - exiting
08:59:55 (3404): No heartbeat from core client for 30 sec - exiting
08:59:56 (3404): No heartbeat from core client for 30 sec - exiting
08:59:57 (3404): No heartbeat from core client for 30 sec - exiting
08:59:59 (3404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:33:16 (2324): No heartbeat from core client for 30 sec - exiting
08:33:17 (2324): No heartbeat from core client for 30 sec - exiting
08:33:18 (2324): No heartbeat from core client for 30 sec - exiting
08:33:19 (2324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=864, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3164, selfPID=2128, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
18:58:25 (2128): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zepu_1977_1_006983194_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zepu_1977_1_006983194_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zepu_1977_1_006983194_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zepu_1977_1_006983194_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Mar 2011 14:53:45 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 92,256 375,346 4.0685
08 Mar 2011 14:53:45 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 80,736 328,811 4.0727
27 Feb 2011 15:58:15 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 69,216 282,570 4.0824
25 Feb 2011 22:26:16 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 57,696 234,976 4.0727
20 Feb 2011 19:31:05 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 46,176 187,935 4.0700
18 Feb 2011 09:53:49 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 34,656 140,727 4.0607
16 Feb 2011 13:25:22 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 23,136 94,448 4.0823
14 Feb 2011 10:29:31 1095810 12266199 hadam3p_pnw_zepu_1977_1_006983194_0 11,616 47,008 4.0468


©2024 cpdn.org