climateprediction.net home page
Task 17549993

Task 17549993

Name hadam3p_pnw_hd24_2011_1_009287682_0
Workunit 9371870
Created 4 Dec 2014, 11:40:07 UTC
Sent 4 Dec 2014, 12:25:21 UTC
Report deadline 16 Nov 2015, 17:45:21 UTC
Received 5 Jan 2015, 11:14:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1270117
Run time 2 days 6 hours 31 min 12 sec
CPU time 2 days 3 hours 22 min 42 sec
Validate state Invalid
Credit 1,508.39
Device peak FLOPS 3.38 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.22
windows_intelx86
Stderr
<core_client_version>6.8.44</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5360, selfPID=5360, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5220, selfPID=5220, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5424, selfPID=5424, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3512, selfPID=3512, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2344, selfPID=2344, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5000, selfPID=5000, iMonCtr=2
12:17:28 (3560): No heartbeat from client for 30 sec - exiting
12:17:28 (3560): timer handler: client dead, exiting
12:17:29 (3560): No heartbeat from client for 30 sec - exiting
12:17:29 (3560): timer handler: client dead, exiting
12:17:30 (3560): No heartbeat from client for 30 sec - exiting
12:17:30 (3560): timer handler: client dead, exiting
12:17:31 (3560): No heartbeat from client for 30 sec - exiting
12:17:31 (3560): timer handler: client dead, exiting
12:17:32 (3560): No heartbeat from client for 30 sec - exiting
12:17:32 (3560): timer handler: client dead, exiting
12:17:33 (3560): No heartbeat from client for 30 sec - exiting
12:17:33 (3560): timer handler: client dead, exiting
12:17:34 (3560): No heartbeat from client for 30 sec - exiting
12:17:34 (3560): timer handler: client dead, exiting
12:17:36 (3560): No heartbeat from client for 30 sec - exiting
12:17:36 (3560): timer handler: client dead, exiting
12:17:37 (3560): No heartbeat from client for 30 sec - exiting
12:17:37 (3560): timer handler: client dead, exiting
12:17:38 (3560): No heartbeat from client for 30 sec - exiting
12:17:38 (3560): timer handler: client dead, exiting
12:17:39 (3560): No heartbeat from client for 30 sec - exiting
12:17:39 (3560): timer handler: client dead, exiting
12:17:40 (3560): No heartbeat from client for 30 sec - exiting
12:17:40 (3560): timer handler: client dead, exiting
12:17:41 (3560): No heartbeat from client for 30 sec - exiting
12:17:41 (3560): timer handler: client dead, exiting
12:17:42 (3560): No heartbeat from client for 30 sec - exiting
12:17:42 (3560): timer handler: client dead, exiting
12:17:43 (3560): No heartbeat from client for 30 sec - exiting
12:17:43 (3560): timer handler: client dead, exiting
12:17:44 (3560): No heartbeat from client for 30 sec - exiting
12:17:44 (3560): timer handler: client dead, exiting
12:17:45 (3560): No heartbeat from client for 30 sec - exiting
12:17:45 (3560): timer handler: client dead, exiting
12:17:46 (3560): No heartbeat from client for 30 sec - exiting
12:17:46 (3560): timer handler: client dead, exiting
12:17:48 (3560): No heartbeat from client for 30 sec - exiting
12:17:48 (3560): timer handler: client dead, exiting
12:17:49 (3560): No heartbeat from client for 30 sec - exiting
12:17:49 (3560): timer handler: client dead, exiting
12:17:50 (3560): No heartbeat from client for 30 sec - exiting
12:17:50 (3560): timer handler: client dead, exiting
12:17:51 (3560): No heartbeat from client for 30 sec - exiting
12:17:51 (3560): timer handler: client dead, exiting
12:17:52 (3560): No heartbeat from client for 30 sec - exiting
12:17:52 (3560): timer handler: client dead, exiting
12:17:53 (3560): No heartbeat from client for 30 sec - exiting
12:17:53 (3560): timer handler: client dead, exiting
12:17:54 (3560): No heartbeat from client for 30 sec - exiting
12:17:54 (3560): timer handler: client dead, exiting
12:17:55 (3560): No heartbeat from client for 30 sec - exiting
12:17:55 (3560): timer handler: client dead, exiting
12:17:56 (3560): No heartbeat from client for 30 sec - exiting
12:17:56 (3560): timer handler: client dead, exiting
12:17:57 (3560): No heartbeat from client for 30 sec - exiting
12:17:57 (3560): timer handler: client dead, exiting
12:17:58 (3560): No heartbeat from client for 30 sec - exiting
12:17:58 (3560): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5280, selfPID=5280, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2168, selfPID=2168, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=776, selfPID=776, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional WorCPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:59:20 (3920): No heartbeat from client for 30 sec - exiting
13:59:20 (3920): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5184, selfPID=4700, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
11:50:41 (3320): No heartbeat from client for 30 sec - exiting
11:50:41 (3320): timer handler: client dead, exiting
11:50:42 (3320): No heartbeat from client for 30 sec - exiting
11:50:42 (3320): timer handler: client dead, exiting
11:50:43 (3320): No heartbeat from client for 30 sec - exiting
11:50:43 (3320): timer handler: client dead, exiting
11:50:44 (3320): No heartbeat from client for 30 sec - exiting
11:50:44 (3320): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4680, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=4744, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6028, selfPID=6028, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5516, selfPID=5516, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7524, selfPID=7524, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
12:19:11 (3996): No heartbeat from client for 30 sec - exiting
12:19:11 (3996): timer handler: client dead, exiting
12:19:12 (3996): No heartbeat from client for 30 sec - exiting
12:19:12 (3996): timer handler: client dead, exiting
12:19:13 (3996): No heartbeat from client for 30 sec - exiting
12:19:13 (3996): timer handler: client dead, exiting
12:19:14 (3996): No heartbeat from client for 30 sec - exiting
12:19:14 (3996): timer handler: client dead, exiting
12:19:15 (3996): No heartbeat from client for 30 sec - exiting
12:19:15 (3996): timer handler: client dead, exiting
12:19:16 (3996): No heartbeat from client for 30 sec - exiting
12:19:16 (3996): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4204, selfPID=4204, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4884, selfPID=4884, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2736, iMonCtr=2
Model crash detected, will try to restart...
12:12:41 (3720): No heartbeat from client for 30 sec - exiting
12:12:41 (3720): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=5340, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5408, selfPID=4896, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
12:13:07 (4896): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd24_2011_1_009287682_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd24_2011_1_009287682_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd24_2011_1_009287682_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd24_2011_1_009287682_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd24_2011_1_009287682_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_hd24_2011_1_009287682_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Dec 2014 13:49:53 1270117 17549993 hadam3p_pnw_hd24_2011_1_009287682_0 69,419 162,540 2.3414
21 Dec 2014 18:12:57 1270117 17549993 hadam3p_pnw_hd24_2011_1_009287682_0 57,899 135,653 2.3429
16 Dec 2014 16:47:11 1270117 17549993 hadam3p_pnw_hd24_2011_1_009287682_0 46,379 108,537 2.3402
09 Dec 2014 18:56:10 1270117 17549993 hadam3p_pnw_hd24_2011_1_009287682_0 34,859 81,274 2.3315
08 Dec 2014 15:29:26 1270117 17549993 hadam3p_pnw_hd24_2011_1_009287682_0 23,339 54,522 2.3361
05 Dec 2014 16:12:13 1270117 17549993 hadam3p_pnw_hd24_2011_1_009287682_0 11,819 27,901 2.3607


©2024 cpdn.org