climateprediction.net home page
Task 12606985

Task 12606985

Name hadam3p_saf_1s91_1996_1_007002845_2
Workunit 7206161
Created 20 Feb 2011, 22:04:38 UTC
Sent 20 Feb 2011, 22:57:25 UTC
Report deadline 3 Feb 2012, 4:17:25 UTC
Received 14 Mar 2011, 19:56:41 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1135560
Run time 15 hours 2 min 1 sec
CPU time 1 hours 53 min 53 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 0.74 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4440, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4924, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:07:41 (3864): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4296, selfPID=3584, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4384, iMonCtr=2
Model crash detected, will try to restart...
11:12:36 (5068): No heartbeat from core client for 30 sec - exiting
11:12:37 (5068): No heartbeat from core client for 30 sec - exiting
11:12:38 (5068): No heartbeat from core client for 30 sec - exiting
11:12:39 (5068): No heartbeat from core client for 30 sec - exiting
11:12:40 (5068): No heartbeat from core client for 30 sec - exiting
11:12:41 (5068): No heartbeat from core client for 30 sec - exiting
11:12:42 (5068): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:35:05 (2940): No heartbeat from core client for 30 sec - exiting
20:35:06 (2940): No heartbeat from core client for 30 sec - exiting
20:35:07 (2940): No heartbeat from core client for 30 sec - exiting
20:35:08 (2940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4248, iMonCtr=2
Model crash detected, will try to restart...
22:28:21 (4004): No heartbeat from core client for 30 sec - exiting
22:28:22 (4004): No heartbeat from core client for 30 sec - exiting
22:28:23 (4004): No heartbeat from core client for 30 sec - exiting
22:28:24 (4004): No heartbeat from core client for 30 sec - exiting
22:28:25 (4004): No heartbeat from core client for 30 sec - exiting
22:28:26 (4004): No heartbeat from core client for 30 sec - exiting
22:28:27 (4004): No heartbeat from core client for 30 sec - exiting
22:28:28 (4004): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5972, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5544, selfPID=1072, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:46:34 (4324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5732, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=2
Model crash detected, will try to restart...
21:54:51 (4552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:30:43 (1248): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4400, selfPID=4400, iMonCtr=2
22:00:12 (2756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5976, selfPID=5524, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4364, selfPID=4604, iMonCtr=1
Model crash detected, will try to restart...
10:07:46 (4468): No heartbeat from core client for 30 sec - exiting
10:07:47 (4468): No heartbeat from core client for 30 sec - exiting
10:07:48 (4468): No heartbeat from core client for 30 sec - exiting
10:07:49 (4468): No heartbeat from core client for 30 sec - exiting
10:07:50 (4468): No heartbeat from core client for 30 sec - exiting
10:07:51 (4468): No heartbeat from core client for 30 sec - exiting
10:07:52 (4468): No heartbeat from core client for 30 sec - exiting
10:07:53 (4468): No heartbeat from core client for 30 sec - exiting
10:07:54 (4468): No heartbeat from core client for 30 sec - exiting
10:07:55 (4468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5224, selfPID=3728, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:29:24 (3728): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_1.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_2.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_3.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_4.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_1s91_1996_1_007002845_2_13.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
No trickles!


©2024 cpdn.org