climateprediction.net home page
Task 15179865

Task 15179865

Name hadam3p_saf_1h5u_1977_1_006959674_1
Workunit 7162990
Created 23 Aug 2012, 18:14:17 UTC
Sent 23 Aug 2012, 18:16:27 UTC
Report deadline 5 Aug 2013, 23:36:27 UTC
Received 31 Aug 2012, 17:24:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1224497
Run time 2 days 3 hours 21 min 26 sec
CPU time 8 hours 38 min 25 sec
Validate state Invalid
Credit 1,496.58
Device peak FLOPS 3.69 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:28:00 (3456): No heartbeat from core client for 30 sec - exiting
22:28:01 (3456): No heartbeat from core client for 30 sec - exiting
22:28:02 (3456): No heartbeat from core client for 30 sec - exiting
22:28:03 (3456): No heartbeat from core client for 30 sec - exiting
22:28:04 (3456): No heartbeat from core client for 30 sec - exiting
22:28:05 (3456): No heartbeat from core client for 30 sec - exiting
22:28:06 (3456): No heartbeat from core client for 30 sec - exiting
22:28:07 (3456): No heartbeat from core client for 30 sec - exiting
22:28:08 (3456): No heartbeat from core client for 30 sec - exiting
22:28:09 (3456): No heartbeat from core client for 30 sec - exiting
22:28:10 (3456): No heartbeat from core client for 30 sec - exiting
22:28:11 (3456): No heartbeat from core client for 30 sec - exiting
22:28:12 (3456): No heartbeat from core client for 30 sec - exiting
22:28:13 (3456): No heartbeat from core client for 30 sec - exiting
22:28:14 (3456): No heartbeat from core client for 30 sec - exiting
22:28:15 (3456): No heartbeat from core client for 30 sec - exiting
22:28:16 (3456): No heartbeat from core client for 30 sec - exiting
22:28:17 (3456): No heartbeat from core client for 30 sec - exiting
22:28:18 (3456): No heartbeat from core client for 30 sec - exiting
22:28:19 (3456): No heartbeat from core client for 30 sec - exiting
22:28:20 (3456): No heartbeat from core client for 30 sec - exiting
22:28:21 (3456): No heartbeat from core client for 30 sec - exiting
22:28:22 (3456): No heartbeat from core client for 30 sec - exiting
22:28:23 (3456): No heartbeat from core client for 30 sec - exiting
22:28:24 (3456): No heartbeat from core client for 30 sec - exiting
22:28:25 (3456): No heartbeat from core client for 30 sec - exiting
22:28:27 (3456): No heartbeat from core client for 30 sec - exiting
22:28:28 (3456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:28:29 (3456): No heartbeat from core client for 30 sec - exiting
22:28:30 (3456): No heartbeat from core client for 30 sec - exiting
05:35:16 (5004): No heartbeat from core client for 30 sec - exiting
05:35:17 (5004): No heartbeat from core client for 30 sec - exiting
05:35:18 (5004): No heartbeat from core client for 30 sec - exiting
05:35:19 (5004): No heartbeat from core client for 30 sec - exiting
05:35:20 (5004): No heartbeat from core client for 30 sec - exiting
05:35:21 (5004): No heartbeat from core client for 30 sec - exiting
05:35:22 (5004): No heartbeat from core client for 30 sec - exiting
05:35:23 (5004): No heartbeat from core client for 30 sec - exiting
05:35:25 (5004): No heartbeat from core client for 30 sec - exiting
05:35:26 (5004): No heartbeat from core client for 30 sec - exiting
05:35:27 (5004): No heartbeat from core client for 30 sec - exiting
05:35:28 (5004): No heartbeat from core client for 30 sec - exiting
05:35:29 (5004): No heartbeat from core client for 30 sec - exiting
05:35:30 (5004): No heartbeat from core client for 30 sec - exiting
05:35:31 (5004): No heartbeat from core client for 30 sec - exiting
05:35:32 (5004): No heartbeat from core client for 30 sec - exiting
05:35:33 (5004): No heartbeat from core client for 30 sec - exiting
05:35:34 (5004): No heartbeat from core client for 30 sec - exiting
05:35:35 (5004): No heartbeat from core client for 30 sec - exiting
05:35:37 (5004): No heartbeat from core client for 30 sec - exiting
05:35:38 (5004): No heartbeat from core client for 30 sec - exiting
05:35:39 (5004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3068, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5536, selfPID=4968, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:07:50 (4828): No heartbeat from core client for 30 sec - exiting
07:07:51 (4828): No heartbeat from core client for 30 sec - exiting
07:07:52 (4828): No heartbeat from core client for 30 sec - exiting
07:07:53 (4828): No heartbeat from core client for 30 sec - exiting
07:07:54 (4828): No heartbeat from core client for 30 sec - exiting
07:07:55 (4828): No heartbeat from core client for 30 sec - exiting
07:07:56 (4828): No heartbeat from core client for 30 sec - exiting
07:07:57 (4828): No heartbeat from core client for 30 sec - exiting
07:07:58 (4828): No heartbeat from core client for 30 sec - exiting
07:08:00 (4828): No heartbeat from core client for 30 sec - exiting
07:08:01 (4828): No heartbeat from core client for 30 sec - exiting
07:08:02 (4828): No heartbeat from core client for 30 sec - exiting
07:08:03 (4828): No heartbeat from core client for 30 sec - exiting
07:08:04 (4828): No heartbeat from core client for 30 sec - exiting
07:08:05 (4828): No heartbeat from core client for 30 sec - exiting
07:08:06 (4828): No heartbeat from core client for 30 sec - exiting
07:08:07 (4828): No heartbeat from core client for 30 sec - exiting
07:08:08 (4828): No heartbeat from core client for 30 sec - exiting
07:08:09 (4828): No heartbeat from core client for 30 sec - exiting
07:08:10 (4828): No heartbeat from core client for 30 sec - exiting
07:08:12 (4828): No heartbeat from core client for 30 sec - exiting
07:08:13 (4828): No heartbeat from core client for 30 sec - exiting
07:08:14 (4828): No heartbeat from core client for 30 sec - exiting
07:08:15 (4828): No heartbeat from core client for 30 sec - exiting
07:08:16 (4828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:08:17 (4828): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:24:10 (4616): No heartbeat from core client for 30 sec - exiting
09:24:12 (4616): No heartbeat from core client for 30 sec - exiting
09:24:13 (4616): No heartbeat from core client for 30 sec - exiting
09:24:14 (4616): No heartbeat from core client for 30 sec - exiting
09:24:15 (4616): No heartbeat from core client for 30 sec - exiting
09:24:16 (4616): No heartbeat from core client for 30 sec - exiting
09:24:17 (4616): No heartbeat from core client for 30 sec - exiting
09:24:18 (4616): No heartbeat from core client for 30 sec - exiting
09:24:19 (4616): No heartbeat from core client for 30 sec - exiting
09:24:20 (4616): No heartbeat from core client for 30 sec - exiting
09:24:21 (4616): No heartbeat from core client for 30 sec - exiting
09:24:22 (4616): No heartbeat from core client for 30 sec - exiting
09:24:23 (4616): No heartbeat from core client for 30 sec - exiting
09:24:24 (4616): No heartbeat from core client for 30 sec - exiting
09:24:26 (4616): No heartbeat from core client for 30 sec - exiting
09:24:27 (4616): No heartbeat from core client for 30 sec - exiting
09:24:28 (4616): No heartbeat from core client for 30 sec - exiting
09:24:29 (4616): No heartbeat from core client for 30 sec - exiting
09:24:30 (4616): No heartbeat from core client for 30 sec - exiting
09:24:31 (4616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:28:00 (5276): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
09:28:02 (4744): No heartbeat from core client for 30 sec - exiting
09:28:03 (4744): No heartbeat from core client for 30 sec - exiting
09:28:04 (4744): No heartbeat from core client for 30 sec - exiting
09:28:05 (4744): No heartbeat from core client for 30 sec - exiting
09:28:06 (4744): No heartbeat from core client for 30 sec - exiting
09:28:07 (4744): No heartbeat from core client for 30 sec - exiting
09:28:08 (4744): No heartbeat from core client for 30 sec - exiting
09:28:09 (4744): No heartbeat from core client for 30 sec - exiting
09:28:10 (4744): No heartbeat from core client for 30 sec - exiting
09:28:12 (4744): No heartbeat from core client for 30 sec - exiting
09:28:13 (4744): No heartbeat from core client for 30 sec - exiting
09:28:14 (4744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5512, selfPID=5024, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=3536, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=3688, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=1156, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1h5u_1977_1_006959674_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
30 Aug 2012 09:51:17 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 92,256 150,385 1.6301
30 Aug 2012 01:34:13 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 80,736 132,600 1.6424
29 Aug 2012 16:23:00 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 69,216 113,907 1.6457
29 Aug 2012 08:15:18 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 57,696 95,478 1.6548
29 Aug 2012 03:03:57 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 46,176 76,678 1.6606
28 Aug 2012 01:24:13 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 34,656 57,295 1.6532
27 Aug 2012 09:51:35 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 23,136 38,538 1.6657
24 Aug 2012 06:50:45 1224497 15179865 hadam3p_saf_1h5u_1977_1_006959674_1 11,616 19,429 1.6726


©2024 cpdn.org