climateprediction.net home page
Task 12236879

Task 12236879

Name hadam3p_pnw_zbat_1964_1_006954765_0
Workunit 7158081
Created 23 Nov 2010, 9:07:09 UTC
Sent 18 Mar 2011, 13:53:32 UTC
Report deadline 28 Feb 2012, 19:13:32 UTC
Received 29 Mar 2011, 8:42:51 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1138840
Run time 5 days 0 hours 7 min 41 sec
CPU time 4 days 5 hours 24 min 29 sec
Validate state Invalid
Credit 2,505.24
Device peak FLOPS 2.59 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.36</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:34:30 (5276): No heartbeat from core client for 30 sec - exiting
09:34:31 (5276): No heartbeat from core client for 30 sec - exiting
09:34:32 (5276): No heartbeat from core client for 30 sec - exiting
09:34:33 (5276): No heartbeat from core client for 30 sec - exiting
09:34:34 (5276): No heartbeat from core client for 30 sec - exiting
09:34:35 (5276): No heartbeat from core client for 30 sec - exiting
09:34:36 (5276): No heartbeat from core client for 30 sec - exiting
09:34:37 (5276): No heartbeat from core client for 30 sec - exiting
09:34:38 (5276): No heartbeat from core client for 30 sec - exiting
09:34:39 (5276): No heartbeat from core client for 30 sec - exiting
09:34:40 (5276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5844, selfPID=5284, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:23:16 (7084): No heartbeat from core client for 30 sec - exiting
09:23:17 (7084): No heartbeat from core client for 30 sec - exiting
09:23:18 (7084): No heartbeat from core client for 30 sec - exiting
09:23:19 (7084): No heartbeat from core client for 30 sec - exiting
09:23:20 (7084): No heartbeat from core client for 30 sec - exiting
09:23:21 (7084): No heartbeat from core client for 30 sec - exiting
09:23:22 (7084): No heartbeat from core client for 30 sec - exiting
09:23:23 (7084): No heartbeat from core client for 30 sec - exiting
09:23:24 (7084): No heartbeat from core client for 30 sec - exiting
09:23:25 (7084): No heartbeat from core client for 30 sec - exiting
09:23:26 (7084): No heartbeat from core client for 30 sec - exiting
09:23:27 (7084): No heartbeat from core client for 30 sec - exiting
09:23:28 (7084): No heartbeat from core client for 30 sec - exiting
09:23:29 (7084): No heartbeat from core client for 30 sec - exiting
09:23:30 (7084): No heartbeat from core client for 30 sec - exiting
09:23:31 (7084): No heartbeat from core client for 30 sec - exiting
09:23:32 (7084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5592, selfPID=5592, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9588, selfPID=9588, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=1008, iMonCtr=1
Model crash detected, will try to restart...
09:04:10 (5580): No heartbeat from core client for 30 sec - exiting
09:04:11 (5580): No heartbeat from core client for 30 sec - exiting
09:04:12 (5580): No heartbeat from core client for 30 sec - exiting
09:04:13 (5580): No heartbeat from core client for 30 sec - exiting
09:04:14 (5580): No heartbeat from core client for 30 sec - exiting
09:04:15 (5580): No heartbeat from core client for 30 sec - exiting
09:04:16 (5580): No heartbeat from core client for 30 sec - exiting
09:04:17 (5580): No heartbeat from core client for 30 sec - exiting
09:04:18 (5580): No heartbeat from core client for 30 sec - exiting
09:04:19 (5580): No heartbeat from core client for 30 sec - exiting
09:04:20 (5580): No heartbeat from core client for 30 sec - exiting
09:04:21 (5580): No heartbeat from core client for 30 sec - exiting
09:04:22 (5580): No heartbeat from core client for 30 sec - exiting
09:04:23 (5580): No heartbeat from core client for 30 sec - exiting
09:04:24 (5580): No heartbeat from core client for 30 sec - exiting
09:04:25 (5580): No heartbeat from core client for 30 sec - exiting
09:04:26 (5580): No heartbeat from core client for 30 sec - exiting
09:04:27 (5580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:21:24 (4320): No heartbeat from core client for 30 sec - exiting
07:21:25 (4320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5216, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1364, selfPID=7640, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 10
22:52:06 (7640): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zbat_1964_1_006954765_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zbat_1964_1_006954765_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Mar 2011 19:55:23 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 115,296 353,060 3.0622
27 Mar 2011 09:36:50 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 103,776 318,856 3.0725
25 Mar 2011 18:58:02 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 92,256 283,982 3.0782
25 Mar 2011 07:45:16 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 80,736 248,918 3.0831
24 Mar 2011 18:00:34 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 69,216 214,133 3.0937
24 Mar 2011 06:39:24 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 57,696 179,207 3.1061
23 Mar 2011 13:50:33 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 46,176 144,085 3.1203
22 Mar 2011 18:19:34 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 34,656 108,490 3.1305
21 Mar 2011 13:03:27 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 23,136 72,495 3.1334
19 Mar 2011 18:47:21 1138840 12236879 hadam3p_pnw_zbat_1964_1_006954765_0 11,616 36,617 3.1523


©2024 climateprediction.net