climateprediction.net home page
Task 15003833

Task 15003833

Name hadam3p_pnw_bx4l_1973_1_008091351_0
Workunit 8246465
Created 26 Jul 2012, 14:38:21 UTC
Sent 26 Jul 2012, 14:45:16 UTC
Report deadline 8 Jul 2013, 20:05:16 UTC
Received 18 Aug 2012, 23:58:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1229231
Run time 21 days 14 hours 11 min 7 sec
CPU time 13 days 10 hours 42 min 36 sec
Validate state Invalid
Credit 2,004.61
Device peak FLOPS 0.78 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
12:45:35 (4588): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:45:17 (3820): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:48:10 (3488): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:48:11 (3488): No heartbeat from core client for 30 sec - exiting
23:48:12 (3488): No heartbeat from core client for 30 sec - exiting
23:48:13 (3488): No heartbeat from core client for 30 sec - exiting
23:48:14 (3488): No heartbeat from core client for 30 sec - exiting
23:48:20 (3488): No heartbeat from core client for 30 sec - exiting
14:05:41 (5912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:05:42 (5912): No heartbeat from core client for 30 sec - exiting
14:05:43 (5912): No heartbeat from core client for 30 sec - exiting
14:05:44 (5912): No heartbeat from core client for 30 sec - exiting
14:05:45 (5912): No heartbeat from core client for 30 sec - exiting
14:05:46 (5912): No heartbeat from core client for 30 sec - exiting
14:05:47 (5912): No heartbeat from core client for 30 sec - exiting
14:05:48 (5912): No heartbeat from core client for 30 sec - exiting
14:05:49 (5912): No heartbeat from core client for 30 sec - exiting
14:05:50 (5912): No heartbeat from core client for 30 sec - exiting
14:05:51 (5912): No heartbeat from core client for 30 sec - exiting
14:05:52 (5912): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
15:22:47 (4516): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:22:48 (4516): No heartbeat from core client for 30 sec - exiting
15:22:49 (4516): No heartbeat from core client for 30 sec - exiting
15:22:50 (4516): No heartbeat from core client for 30 sec - exiting
18:17:06 (4484): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:17:07 (4484): No heartbeat from core client for 30 sec - exiting
18:17:08 (4484): No heartbeat from core client for 30 sec - exiting
18:17:09 (4484): No heartbeat from core client for 30 sec - exiting
18:17:10 (4484): No heartbeat from core client for 30 sec - exiting
18:17:11 (4484): No heartbeat from core client for 30 sec - exiting
18:17:12 (4484): No heartbeat from core client for 30 sec - exiting
05:52:38 (4492): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:52:39 (4492): No heartbeat from core client for 30 sec - exiting
05:52:40 (4492): No heartbeat from core client for 30 sec - exiting
07:58:29 (4440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:58:30 (4440): No heartbeat from core client for 30 sec - exiting
07:58:31 (4440): No heartbeat from core client for 30 sec - exiting
07:58:32 (4440): No heartbeat from core client for 30 sec - exiting
07:58:33 (4440): No heartbeat from core client for 30 sec - exiting
07:58:34 (4440): No heartbeat from core client for 30 sec - exiting
07:58:35 (4440): No heartbeat from core client for 30 sec - exiting
08:34:05 (3132): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:34:06 (3132): No heartbeat from core client for 30 sec - exiting
11:02:02 (1928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:58:52 (5004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:06:02 (2784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:06:03 (2784): No heartbeat from core client for 30 sec - exiting
04:06:04 (2784): No heartbeat from core client for 30 sec - exiting
04:06:05 (2784): No heartbeat from core client for 30 sec - exiting
04:06:06 (2784): No heartbeat from core client for 30 sec - exiting
04:06:07 (2784): No heartbeat from core client for 30 sec - exiting
04:06:08 (2784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4328, selfPID=184, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
16:14:37 (4168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
16:14:38 (4168): No heartbeat from core client for 30 sec - exiting
16:14:39 (4168): No heartbeat from core client for 30 sec - exiting
16:14:40 (4168): No heartbeat from core client for 30 sec - exiting
16:14:41 (4168): No heartbeat from core client for 30 sec - exiting
16:14:42 (4168): No heartbeat from core client for 30 sec - exiting
16:33:08 (5280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=4436, iMonCtr=2
23:15:55 (2084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:24:13 (6060): No heartbeat from core client for 30 sec - exiting
23:24:14 (6060): No heartbeat from core client for 30 sec - exiting
23:24:15 (6060): No heartbeat from core client for 30 sec - exiting
23:24:16 (6060): No heartbeat from core client for 30 sec - exiting
23:24:17 (6060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:24:18 (6060): No heartbeat from core client for 30 sec - exiting
23:24:19 (6060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:06:21 (5580): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1092, selfPID=2044, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
04:59:23 (4648): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3756, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1000, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3416, selfPID=4924, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2140, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4344, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1872, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4448, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5792, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4120, selfPID=4364, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_bx4l_1973_1_008091351_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bx4l_1973_1_008091351_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bx4l_1973_1_008091351_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_bx4l_1973_1_008091351_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Aug 2012 18:54:11 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 92,256 1,050,767 11.3897
14 Aug 2012 11:52:02 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 80,736 922,400 11.4249
11 Aug 2012 20:47:27 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 69,216 791,314 11.4325
09 Aug 2012 14:52:23 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 57,696 659,370 11.4283
07 Aug 2012 00:08:12 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 46,176 523,004 11.3263
03 Aug 2012 07:21:14 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 34,656 390,154 11.2579
01 Aug 2012 00:38:49 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 23,137 261,578 11.3056
31 Jul 2012 23:33:32 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 23,136 260,075 11.2411
29 Jul 2012 18:46:53 1229231 15003833 hadam3p_pnw_bx4l_1973_1_008091351_0 11,616 129,055 11.1101


©2024 cpdn.org