climateprediction.net home page
Task 15794605

Task 15794605

Name hadam3p_pnw_q97l_2039_1_008359356_1
Workunit 8510215
Created 23 May 2013, 18:41:51 UTC
Sent 23 May 2013, 18:41:52 UTC
Report deadline 6 May 2014, 0:01:52 UTC
Received 31 May 2013, 11:03:41 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1149805
Run time 3 days 20 hours 58 min 54 sec
CPU time 3 days 15 hours 31 min 40 sec
Validate state Invalid
Credit 2,505.24
Device peak FLOPS 2.96 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<stderr_txt>
09:47:04 (768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:47:08 (768): No heartbeat from core client for 30 sec - exiting
09:47:09 (768): No heartbeat from core client for 30 sec - exiting
09:47:10 (768): No heartbeat from core client for 30 sec - exiting
09:47:11 (768): No heartbeat from core client for 30 sec - exiting
09:47:12 (768): No heartbeat from core client for 30 sec - exiting
09:47:13 (768): No heartbeat from core client for 30 sec - exiting
09:47:14 (768): No heartbeat from core client for 30 sec - exiting
09:47:16 (768): No heartbeat from core client for 30 sec - exiting
09:52:46 (4424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:52:47 (4424): No heartbeat from core client for 30 sec - exiting
09:52:48 (4424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=836, selfPID=3488, iMonCtr=1
Model crash detected, will try to restart...
09:44:00 (3284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
14:08:36 (2832): No heartbeat from core client for 30 sec - exiting
14:08:37 (2832): No heartbeat from core client for 30 sec - exiting
14:08:38 (2832): No heartbeat from core client for 30 sec - exiting
14:08:40 (2832): No heartbeat from core client for 30 sec - exiting
14:08:41 (2832): No heartbeat from core client for 30 sec - exiting
14:08:42 (2832): No heartbeat from core client for 30 sec - exiting
14:08:43 (2832): No heartbeat from core client for 30 sec - exiting
14:08:44 (2832): No heartbeat from core client for 30 sec - exiting
14:08:45 (2832): No heartbeat from core client for 30 sec - exiting
14:08:46 (2832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:08:47 (2832): No heartbeat from core client for 30 sec - exiting
14:08:48 (2832): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:55:04 (5056): No heartbeat from core client for 30 sec - exiting
10:55:05 (5056): No heartbeat from core client for 30 sec - exiting
10:55:06 (5056): No heartbeat from core client for 30 sec - exiting
10:55:07 (5056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:56:58 (308): No heartbeat from core client for 30 sec - exiting
12:56:59 (308): No heartbeat from core client for 30 sec - exiting
12:57:00 (308): No heartbeat from core client for 30 sec - exiting
12:57:01 (308): No heartbeat from core client for 30 sec - exiting
12:57:02 (308): No heartbeat from core client for 30 sec - exiting
12:57:03 (308): No heartbeat from core client for 30 sec - exiting
12:57:04 (308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:57:05 (308): No heartbeat from core client for 30 sec - exiting
12:57:06 (308): No heartbeat from core client for 30 sec - exiting
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=4744, iMonCtr=1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3700, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3700, selfPID=4676, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 10
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_q97l_2039_1_008359356_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_q97l_2039_1_008359356_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 May 2013 14:12:30 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 115,296 289,257 2.5088
29 May 2013 19:53:18 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 103,776 259,460 2.5002
29 May 2013 11:23:53 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 92,256 229,926 2.4923
28 May 2013 16:01:43 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 80,736 201,115 2.4910
27 May 2013 19:11:18 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 69,216 172,682 2.4948
26 May 2013 22:04:53 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 57,696 144,958 2.5124
26 May 2013 12:56:21 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 46,176 116,131 2.5150
26 May 2013 07:49:47 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 34,656 88,128 2.5429
25 May 2013 10:47:45 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 23,136 58,871 2.5446
24 May 2013 14:49:58 1149805 15794605 hadam3p_pnw_q97l_2039_1_008359356_1 11,616 30,536 2.6288


©2024 cpdn.org