climateprediction.net home page
Task 13274941

Task 13274941

Name hadam3p_saf_2agm_1980_1_007403272_2
Workunit 7600702
Created 18 Aug 2011, 15:01:04 UTC
Sent 18 Aug 2011, 15:37:24 UTC
Report deadline 30 Jul 2012, 20:57:24 UTC
Received 1 Dec 2011, 16:30:31 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1107482
Run time 1 days 5 hours 55 min 43 sec
CPU time 4 hours 18 min 50 sec
Validate state Invalid
Credit 562.19
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4324, selfPID=4324, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3292, selfPID=6704, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2992, selfPID=4980, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
09:33:26 (2964): No heartbeat from core client for 30 sec - exiting
09:33:27 (2964): No heartbeat from core client for 30 sec - exiting
09:33:28 (2964): No heartbeat from core client for 30 sec - exiting
09:33:29 (2964): No heartbeat from core client for 30 sec - exiting
15:56:27 (3904): No heartbeat from core client for 30 sec - exiting
15:56:28 (3904): No heartbeat from core client for 30 sec - exiting
15:56:29 (3904): No heartbeat from core client for 30 sec - exiting
15:56:30 (3904): No heartbeat from core client for 30 sec - exiting
15:56:31 (3904): No heartbeat from core client for 30 sec - exiting
15:56:32 (3904): No heartbeat from core client for 30 sec - exiting
15:56:33 (3904): No heartbeat from core client for 30 sec - exiting
15:56:34 (3904): No heartbeat from core client for 30 sec - exiting
15:56:36 (3904): No heartbeat from core client for 30 sec - exiting
15:56:37 (3904): No heartbeat from core client for 30 sec - exiting
15:56:38 (3904): No heartbeat from core client for 30 sec - exiting
15:56:39 (3904): No heartbeat from core client for 30 sec - exiting
15:56:40 (3904): No heartbeat from core client for 30 sec - exiting
15:56:41 (3904): No heartbeat from core client for 30 sec - exiting
15:56:42 (3904): No heartbeat from core client for 30 sec - exiting
15:56:43 (3904): No heartbeat from core client for 30 sec - exiting
15:56:44 (3904): No heartbeat from core client for 30 sec - exiting
15:56:45 (3904): No heartbeat from core client for 30 sec - exiting
15:56:46 (3904): No heartbeat from core client for 30 sec - exiting
15:56:47 (3904): No heartbeat from core client for 30 sec - exiting
15:56:48 (3904): No heartbeat from core client for 30 sec - exiting
15:56:49 (3904): No heartbeat from core client for 30 sec - exiting
15:56:50 (3904): No heartbeat from core client for 30 sec - exiting
15:56:51 (3904): No heartbeat from core client for 30 sec - exiting
15:56:52 (3904): No heartbeat from core client for 30 sec - exiting
15:56:53 (3904): No heartbeat from core client for 30 sec - exiting
15:56:54 (3904): No heartbeat from core client for 30 sec - exiting
15:56:55 (3904): No heartbeat from core client for 30 sec - exiting
15:56:56 (3904): No heartbeat from core client for 30 sec - exiting
15:56:57 (3904): No heartbeat from core client for 30 sec - exiting
15:56:58 (3904): No heartbeat from core client for 30 sec - exiting
15:56:59 (3904): No heartbeat from core client for 30 sec - exiting
15:57:00 (3904): No heartbeat from core client for 30 sec - exiting
15:57:01 (3904): No heartbeat from core client for 30 sec - exiting
15:57:02 (3904): No heartbeat from core client for 30 sec - exiting
15:57:03 (3904): No heartbeat from core client for 30 sec - exiting
15:57:04 (3904): No heartbeat from core client for 30 sec - exiting
15:57:05 (3904): No heartbeat from core client for 30 sec - exiting
15:57:06 (3904): No heartbeat from core client for 30 sec - exiting
15:57:07 (3904): No heartbeat from core client for 30 sec - exiting
15:57:08 (3904): No heartbeat from core client for 30 sec - exiting
15:57:09 (3904): No heartbeat from core client for 30 sec - exiting
15:57:10 (3904): No heartbeat from core client for 30 sec - exiting
15:57:11 (3904): No heartbeat from core client for 30 sec - exiting
15:57:12 (3904): No heartbeat from core client for 30 sec - exiting
15:57:13 (3904): No heartbeat from core client for 30 sec - exiting
15:57:14 (3904): No heartbeat from core client for 30 sec - exiting
15:57:15 (3904): No heartbeat from core client for 30 sec - exiting
15:57:16 (3904): No heartbeat from core client for 30 sec - exiting
15:57:17 (3904): No heartbeat from core client for 30 sec - exiting
15:57:18 (3904): No heartbeat from core client for 30 sec - exiting
15:57:19 (3904): No heartbeat from core client for 30 sec - exiting
15:57:20 (3904): No heartbeat from core client for 30 sec - exiting
15:57:21 (3904): No heartbeat from core client for 30 sec - exiting
15:57:22 (3904): No heartbeat from core client for 30 sec - exiting
15:57:23 (3904): No heartbeat from core client for 30 sec - exiting
15:57:24 (3904): No heartbeat from core client for 30 sec - exiting
15:57:26 (3904): No heartbeat from core client for 30 sec - exiting
15:57:27 (3904): No heartbeat from core client for 30 sec - exiting
15:57:28 (3904): No heartbeat from core client for 30 sec - exiting
15:57:29 (3904): No heartbeat from core client for 30 sec - exiting
15:57:30 (3904): No heartbeat from core client for 30 sec - exiting
15:57:31 (3904): No heartbeat from core client for 30 sec - exiting
15:57:32 (3904): No heartbeat from core client for 30 sec - exiting
15:57:33 (3904): No heartbeat from core client for 30 sec - exiting
15:57:34 (3904): No heartbeat from core client for 30 sec - exiting
15:57:35 (3904): No heartbeat from core client for 30 sec - exiting
15:57:36 (3904): No heartbeat from core client for 30 sec - exiting
15:57:37 (3904): No heartbeat from core client for 30 sec - exiting
15:57:38 (3904): No heartbeat from core client for 30 sec - exiting
15:57:39 (3904): No heartbeat from core client for 30 sec - exiting
15:57:40 (3904): No heartbeat from core client for 30 sec - exiting
15:57:41 (3904): No heartbeat from core client for 30 sec - exiting
15:57:42 (3904): No heartbeat from core client for 30 sec - exiting
15:57:43 (3904): No heartbeat from core client for 30 sec - exiting
15:57:44 (3904): No heartbeat from core client for 30 sec - exiting
15:57:45 (3904): No heartbeat from core client for 30 sec - exiting
15:57:46 (3904): No heartbeat from core client for 30 sec - exiting
15:57:47 (3904): No heartbeat from core client for 30 sec - exiting
15:57:48 (3904): No heartbeat from core client for 30 sec - exiting
15:57:49 (3904): No heartbeat from core client for 30 sec - exiting
15:57:50 (3904): No heartbeat from core client for 30 sec - exiting
15:57:51 (3904): No heartbeat from core client for 30 sec - exiting
15:57:52 (3904): No heartbeat from core client for 30 sec - exiting
15:57:53 (3904): No heartbeat from core client for 30 sec - exiting
15:57:54 (3904): No heartbeat from core client for 30 sec - exiting
15:57:55 (3904): No heartbeat from core client for 30 sec - exiting
15:57:56 (3904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:57:57 (3904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
13:15:07 (4284): No heartbeat from core client for 30 sec - exiting
13:15:08 (4284): No heartbeat from core client for 30 sec - exiting
13:15:09 (4284): No heartbeat from core client for 30 sec - exiting
13:15:10 (4284): No heartbeat from core client for 30 sec - exiting
13:15:11 (4284): No heartbeat from core client for 30 sec - exiting
13:15:12 (4284): No heartbeat from core client for 30 sec - exiting
13:15:13 (4284): No heartbeat from core client for 30 sec - exiting
13:15:14 (4284): No heartbeat from core client for 30 sec - exiting
13:15:15 (4284): No heartbeat from core client for 30 sec - exiting
13:15:16 (4284): No heartbeat from core client for 30 sec - exiting
13:15:17 (4284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:15:18 (4284): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3424, selfPID=4940, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1160, selfPID=5596, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2716, selfPID=464, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3688, selfPID=3688, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1132, selfPID=1132, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6960, selfPID=6960, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6112, selfPID=6112, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4552, selfPID=4552, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1556, selfPID=1556, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5552, selfPID=5552, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4092, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=968, selfPID=968, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_2agm_1980_1_007403272_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 Nov 2011 15:12:35 1107482 13274941 hadam3p_saf_2agm_1980_1_007403272_2 34,656 76,497 2.2073
24 Nov 2011 10:52:12 1107482 13274941 hadam3p_saf_2agm_1980_1_007403272_2 23,136 54,754 2.3666
21 Nov 2011 14:45:05 1107482 13274941 hadam3p_saf_2agm_1980_1_007403272_2 11,616 33,106 2.8500


©2024 cpdn.org