climateprediction.net home page
Task 13274925

Task 13274925

Name hadam3p_saf_26hn_1986_1_007408648_1
Workunit 7606078
Created 18 Aug 2011, 14:49:15 UTC
Sent 18 Aug 2011, 15:37:24 UTC
Report deadline 30 Jul 2012, 20:57:24 UTC
Received 1 Dec 2011, 16:30:31 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1107482
Run time 1 days 20 hours 51 min 4 sec
CPU time 4 hours 8 min 31 sec
Validate state Invalid
Credit 935.95
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
14:09:04 (4548): No heartbeat from core client for 30 sec - exiting
14:09:05 (4548): No heartbeat from core client for 30 sec - exiting
14:09:06 (4548): No heartbeat from core client for 30 sec - exiting
14:09:07 (4548): No heartbeat from core client for 30 sec - exiting
14:09:08 (4548): No heartbeat from core client for 30 sec - exiting
14:09:09 (4548): No heartbeat from core client for 30 sec - exiting
14:09:10 (4548): No heartbeat from core client for 30 sec - exiting
14:09:11 (4548): No heartbeat from core client for 30 sec - exiting
14:09:12 (4548): No heartbeat from core client for 30 sec - exiting
14:09:13 (4548): No heartbeat from core client for 30 sec - exiting
14:09:14 (4548): No heartbeat from core client for 30 sec - exiting
14:09:15 (4548): No heartbeat from core client for 30 sec - exiting
14:09:16 (4548): No heartbeat from core client for 30 sec - exiting
14:09:17 (4548): No heartbeat from core client for 30 sec - exiting
14:09:18 (4548): No heartbeat from core client for 30 sec - exiting
14:09:19 (4548): No heartbeat from core client for 30 sec - exiting
14:09:20 (4548): No heartbeat from core client for 30 sec - exiting
14:09:21 (4548): No heartbeat from core client for 30 sec - exiting
14:09:22 (4548): No heartbeat from core client for 30 sec - exiting
14:09:23 (4548): No heartbeat from core client for 30 sec - exiting
14:09:24 (4548): No heartbeat from core client for 30 sec - exiting
14:09:25 (4548): No heartbeat from core client for 30 sec - exiting
14:09:26 (4548): No heartbeat from core client for 30 sec - exiting
14:09:28 (4548): No heartbeat from core client for 30 sec - exiting
14:09:29 (4548): No heartbeat from core client for 30 sec - exiting
14:09:30 (4548): No heartbeat from core client for 30 sec - exiting
14:09:31 (4548): No heartbeat from core client for 30 sec - exiting
14:09:32 (4548): No heartbeat from core client for 30 sec - exiting
14:09:33 (4548): No heartbeat from core client for 30 sec - exiting
14:09:34 (4548): No heartbeat from core client for 30 sec - exiting
14:09:35 (4548): No heartbeat from core client for 30 sec - exiting
14:09:36 (4548): No heartbeat from core client for 30 sec - exiting
14:09:37 (4548): No heartbeat from core client for 30 sec - exiting
14:09:38 (4548): No heartbeat from core client for 30 sec - exiting
14:09:40 (4548): No heartbeat from core client for 30 sec - exiting
14:09:41 (4548): No heartbeat from core client for 30 sec - exiting
14:09:42 (4548): No heartbeat from core client for 30 sec - exiting
14:09:43 (4548): No heartbeat from core client for 30 sec - exiting
14:09:44 (4548): No heartbeat from core client for 30 sec - exiting
14:09:45 (4548): No heartbeat from core client for 30 sec - exiting
14:09:46 (4548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4764, selfPID=4764, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4672, selfPID=4444, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6488, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
09:33:26 (2816): No heartbeat from core client for 30 sec - exiting
09:33:27 (2816): No heartbeat from core client for 30 sec - exiting
09:33:28 (2816): No heartbeat from core client for 30 sec - exiting
09:33:29 (2816): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
15:56:27 (2856): No heartbeat from core client for 30 sec - exiting
15:56:28 (2856): No heartbeat from core client for 30 sec - exiting
15:56:29 (2856): No heartbeat from core client for 30 sec - exiting
15:56:30 (2856): No heartbeat from core client for 30 sec - exiting
15:56:31 (2856): No heartbeat from core client for 30 sec - exiting
15:56:32 (2856): No heartbeat from core client for 30 sec - exiting
15:56:33 (2856): No heartbeat from core client for 30 sec - exiting
15:56:34 (2856): No heartbeat from core client for 30 sec - exiting
15:56:35 (2856): No heartbeat from core client for 30 sec - exiting
15:56:36 (2856): No heartbeat from core client for 30 sec - exiting
15:56:37 (2856): No heartbeat from core client for 30 sec - exiting
15:56:38 (2856): No heartbeat from core client for 30 sec - exiting
15:56:39 (2856): No heartbeat from core client for 30 sec - exiting
15:56:40 (2856): No heartbeat from core client for 30 sec - exiting
15:56:41 (2856): No heartbeat from core client for 30 sec - exiting
15:56:42 (2856): No heartbeat from core client for 30 sec - exiting
15:56:43 (2856): No heartbeat from core client for 30 sec - exiting
15:56:44 (2856): No heartbeat from core client for 30 sec - exiting
15:56:45 (2856): No heartbeat from core client for 30 sec - exiting
15:56:46 (2856): No heartbeat from core client for 30 sec - exiting
15:56:47 (2856): No heartbeat from core client for 30 sec - exiting
15:56:48 (2856): No heartbeat from core client for 30 sec - exiting
15:56:49 (2856): No heartbeat from core client for 30 sec - exiting
15:56:50 (2856): No heartbeat from core client for 30 sec - exiting
15:56:51 (2856): No heartbeat from core client for 30 sec - exiting
15:56:52 (2856): No heartbeat from core client for 30 sec - exiting
15:56:53 (2856): No heartbeat from core client for 30 sec - exiting
15:56:54 (2856): No heartbeat from core client for 30 sec - exiting
15:56:55 (2856): No heartbeat from core client for 30 sec - exiting
15:56:56 (2856): No heartbeat from core client for 30 sec - exiting
15:56:57 (2856): No heartbeat from core client for 30 sec - exiting
15:56:58 (2856): No heartbeat from core client for 30 sec - exiting
15:56:59 (2856): No heartbeat from core client for 30 sec - exiting
15:57:00 (2856): No heartbeat from core client for 30 sec - exiting
15:57:01 (2856): No heartbeat from core client for 30 sec - exiting
15:57:02 (2856): No heartbeat from core client for 30 sec - exiting
15:57:03 (2856): No heartbeat from core client for 30 sec - exiting
15:57:04 (2856): No heartbeat from core client for 30 sec - exiting
15:57:05 (2856): No heartbeat from core client for 30 sec - exiting
15:57:06 (2856): No heartbeat from core client for 30 sec - exiting
15:57:07 (2856): No heartbeat from core client for 30 sec - exiting
15:57:08 (2856): No heartbeat from core client for 30 sec - exiting
15:57:09 (2856): No heartbeat from core client for 30 sec - exiting
15:57:10 (2856): No heartbeat from core client for 30 sec - exiting
15:57:11 (2856): No heartbeat from core client for 30 sec - exiting
15:57:12 (2856): No heartbeat from core client for 30 sec - exiting
15:57:13 (2856): No heartbeat from core client for 30 sec - exiting
15:57:15 (2856): No heartbeat from core client for 30 sec - exiting
15:57:16 (2856): No heartbeat from core client for 30 sec - exiting
15:57:17 (2856): No heartbeat from core client for 30 sec - exiting
15:57:18 (2856): No heartbeat from core client for 30 sec - exiting
15:57:19 (2856): No heartbeat from core client for 30 sec - exiting
15:57:20 (2856): No heartbeat from core client for 30 sec - exiting
15:57:21 (2856): No heartbeat from core client for 30 sec - exiting
15:57:22 (2856): No heartbeat from core client for 30 sec - exiting
15:57:23 (2856): No heartbeat from core client for 30 sec - exiting
15:57:24 (2856): No heartbeat from core client for 30 sec - exiting
15:57:25 (2856): No heartbeat from core client for 30 sec - exiting
15:57:27 (2856): No heartbeat from core client for 30 sec - exiting
15:57:28 (2856): No heartbeat from core client for 30 sec - exiting
15:57:29 (2856): No heartbeat from core client for 30 sec - exiting
15:57:30 (2856): No heartbeat from core client for 30 sec - exiting
15:57:31 (2856): No heartbeat from core client for 30 sec - exiting
15:57:32 (2856): No heartbeat from core client for 30 sec - exiting
15:57:33 (2856): No heartbeat from core client for 30 sec - exiting
15:57:34 (2856): No heartbeat from core client for 30 sec - exiting
15:57:35 (2856): No heartbeat from core client for 30 sec - exiting
15:57:36 (2856): No heartbeat from core client for 30 sec - exiting
15:57:37 (2856): No heartbeat from core client for 30 sec - exiting
15:57:38 (2856): No heartbeat from core client for 30 sec - exiting
15:57:39 (2856): No heartbeat from core client for 30 sec - exiting
15:57:40 (2856): No heartbeat from core client for 30 sec - exiting
15:57:41 (2856): No heartbeat from core client for 30 sec - exiting
15:57:42 (2856): No heartbeat from core client for 30 sec - exiting
15:57:43 (2856): No heartbeat from core client for 30 sec - exiting
15:57:44 (2856): No heartbeat from core client for 30 sec - exiting
15:57:45 (2856): No heartbeat from core client for 30 sec - exiting
15:57:46 (2856): No heartbeat from core client for 30 sec - exiting
15:57:47 (2856): No heartbeat from core client for 30 sec - exiting
15:57:48 (2856): No heartbeat from core client for 30 sec - exiting
15:57:49 (2856): No heartbeat from core client for 30 sec - exiting
15:57:50 (2856): No heartbeat from core client for 30 sec - exiting
15:57:51 (2856): No heartbeat from core client for 30 sec - exiting
15:57:52 (2856): No heartbeat from core client for 30 sec - exiting
15:57:53 (2856): No heartbeat from core client for 30 sec - exiting
15:57:54 (2856): No heartbeat from core client for 30 sec - exiting
15:57:55 (2856): No heartbeat from core client for 30 sec - exiting
15:57:56 (2856): No heartbeat from core client for 30 sec - exiting
15:57:57 (2856): No heartbeat from core client for 30 sec - exiting
15:57:58 (2856): No heartbeat from core client for 30 sec - exiting
15:57:59 (2856): No heartbeat from core client for 30 sec - exiting
15:58:00 (2856): No heartbeat from core client for 30 sec - exiting
15:58:01 (2856): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
13:15:07 (4260): No heartbeat from core client for 30 sec - exiting
13:15:08 (4260): No heartbeat from core client for 30 sec - exiting
13:15:09 (4260): No heartbeat from core client for 30 sec - exiting
13:15:10 (4260): No heartbeat from core client for 30 sec - exiting
13:15:11 (4260): No heartbeat from core client for 30 sec - exiting
13:15:12 (4260): No heartbeat from core client for 30 sec - exiting
13:15:13 (4260): No heartbeat from core client for 30 sec - exiting
13:15:14 (4260): No heartbeat from core client for 30 sec - exiting
13:15:15 (4260): No heartbeat from core client for 30 sec - exiting
13:15:16 (4260): No heartbeat from core client for 30 sec - exiting
13:15:17 (4260): No heartbeat from core client for 30 sec - exiting
13:15:18 (4260): No heartbeat from core client for 30 sec - exiting
13:15:19 (4260): No heartbeat from core client for 30 sec - exiting
13:15:20 (4260): No heartbeat from core client for 30 sec - exiting
13:15:21 (4260): No heartbeat from core client for 30 sec - exiting
13:15:22 (4260): No heartbeat from core client for 30 sec - exiting
13:15:23 (4260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2644, selfPID=3400, iMonCtr=1
Model crash detected, will try to restart...
18:45:37 (4024): No heartbeat from core client for 30 sec - exiting
18:45:38 (4024): No heartbeat from core client for 30 sec - exiting
18:45:39 (4024): No heartbeat from core client for 30 sec - exiting
18:45:40 (4024): No heartbeat from core client for 30 sec - exiting
18:45:41 (4024): No heartbeat from core client for 30 sec - exiting
18:45:42 (4024): No heartbeat from core client for 30 sec - exiting
18:45:43 (4024): No heartbeat from core client for 30 sec - exiting
18:45:44 (4024): No heartbeat from core client for 30 sec - exiting
18:45:45 (4024): No heartbeat from core client for 30 sec - exiting
18:45:46 (4024): No heartbeat from core client for 30 sec - exiting
18:45:47 (4024): No heartbeat from core client for 30 sec - exiting
18:45:48 (4024): No heartbeat from core client for 30 sec - exiting
18:45:49 (4024): No heartbeat from core client for 30 sec - exiting
18:45:50 (4024): No heartbeat from core client for 30 sec - exiting
18:45:51 (4024): No heartbeat from core client for 30 sec - exiting
18:45:52 (4024): No heartbeat from core client for 30 sec - exiting
18:45:53 (4024): No heartbeat from core client for 30 sec - exiting
18:45:54 (4024): No heartbeat from core client for 30 sec - exiting
18:45:55 (4024): No heartbeat from core client for 30 sec - exiting
18:45:56 (4024): No heartbeat from core client for 30 sec - exiting
18:45:57 (4024): No heartbeat from core client for 30 sec - exiting
18:45:58 (4024): No heartbeat from core client for 30 sec - exiting
18:45:59 (4024): No heartbeat from core client for 30 sec - exiting
18:46:00 (4024): No heartbeat from core client for 30 sec - exiting
18:46:01 (4024): No heartbeat from core client for 30 sec - exiting
18:46:02 (4024): No heartbeat from core client for 30 sec - exiting
18:46:04 (4024): No heartbeat from core client for 30 sec - exiting
18:46:05 (4024): No heartbeat from core client for 30 sec - exiting
18:46:06 (4024): No heartbeat from core client for 30 sec - exiting
18:46:07 (4024): No heartbeat from core client for 30 sec - exiting
18:46:08 (4024): No heartbeat from core client for 30 sec - exiting
18:46:09 (4024): No heartbeat from core client for 30 sec - exiting
18:46:10 (4024): No heartbeat from core client for 30 sec - exiting
18:46:11 (4024): No heartbeat from core client for 30 sec - exiting
18:46:12 (4024): No heartbeat from core client for 30 sec - exiting
18:46:13 (4024): No heartbeat from core client for 30 sec - exiting
18:46:14 (4024): No heartbeat from core client for 30 sec - exiting
18:46:16 (4024): No heartbeat from core client for 30 sec - exiting
18:46:17 (4024): No heartbeat from core client for 30 sec - exiting
18:46:18 (4024): No heartbeat from core client for 30 sec - exiting
18:46:19 (4024): No heartbeat from core client for 30 sec - exiting
18:46:20 (4024): No heartbeat from core client for 30 sec - exiting
18:46:21 (4024): No heartbeat from core client for 30 sec - exiting
18:46:22 (4024): No heartbeat from core client for 30 sec - exiting
18:46:23 (4024): No heartbeat from core client for 30 sec - exiting
18:46:24 (4024): No heartbeat from core client for 30 sec - exiting
18:46:25 (4024): No heartbeat from core client for 30 sec - exiting
18:46:26 (4024): No heartbeat from core client for 30 sec - exiting
18:46:28 (4024): No heartbeat from core client for 30 sec - exiting
18:46:29 (4024): No heartbeat from core client for 30 sec - exiting
18:46:30 (4024): No heartbeat from core client for 30 sec - exiting
18:46:31 (4024): No heartbeat from core client for 30 sec - exiting
18:46:32 (4024): No heartbeat from core client for 30 sec - exiting
18:46:33 (4024): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1572, selfPID=2324, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2344, selfPID=1504, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5596, selfPID=4656, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4888, selfPID=2224, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5360, selfPID=3552, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4640, selfPID=3536, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5664, selfPID=5984, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4292, selfPID=452, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4180, selfPID=4908, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2896, selfPID=2896, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3032, selfPID=3032, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:21:27 (6008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5540, selfPID=5540, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5516, selfPID=5516, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5872, selfPID=4416, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=5604, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4168, selfPID=4168, iMonCtr=2
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_26hn_1986_1_007408648_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Nov 2011 14:07:18 1107482 13274925 hadam3p_saf_26hn_1986_1_007408648_1 57,696 128,377 2.2251
24 Nov 2011 10:52:12 1107482 13274925 hadam3p_saf_26hn_1986_1_007408648_1 46,176 106,993 2.3171
21 Nov 2011 14:45:05 1107482 13274925 hadam3p_saf_26hn_1986_1_007408648_1 34,656 84,952 2.4513
19 Nov 2011 08:48:40 1107482 13274925 hadam3p_saf_26hn_1986_1_007408648_1 23,136 59,812 2.5852
06 Sep 2011 09:58:13 1107482 13274925 hadam3p_saf_26hn_1986_1_007408648_1 11,616 29,633 2.5511


©2024 cpdn.org