climateprediction.net home page
Task 12331311

Task 12331311

Name hadam3p_pnw_zw04_1963_1_007043996_0
Workunit 7247312
Created 25 Nov 2010, 12:42:15 UTC
Sent 5 Dec 2010, 10:36:46 UTC
Report deadline 17 Nov 2011, 15:56:46 UTC
Received 21 Dec 2010, 15:31:19 UTC
Server state Over
Outcome No reply
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1050294
Run time 5 days 7 hours 5 min 55 sec
CPU time 4 days 2 hours 39 min 36 sec
Validate state Invalid
Credit 2,254.93
Device peak FLOPS 2.26 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.08
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3104, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
11:58:32 (3852): called boinc_finish
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6060, selfPID=6060, iMonCtr=2
16:28:14 (4836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4820, selfPID=4952, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6100, selfPID=4272, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 3
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=5308, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 4
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3304, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=488, selfPID=220, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3952, selfPID=5764, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=836, selfPID=1480, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2268, selfPID=3712, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 5
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1124, selfPID=4984, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 6
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5012, selfPID=5900, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 7
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6016, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=2280, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 8
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5640, selfPID=3092, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 9
00:05:42 (3092): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_pnw_zw04_1963_1_007043996_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zw04_1963_1_007043996_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_zw04_1963_1_007043996_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Dec 2010 19:46:19 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 103,776 345,677 3.3310
18 Dec 2010 22:22:39 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 92,256 306,774 3.3252
17 Dec 2010 23:52:34 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 80,736 271,432 3.3620
16 Dec 2010 18:47:03 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 69,216 233,945 3.3799
13 Dec 2010 20:51:03 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 57,696 194,183 3.3656
13 Dec 2010 00:28:32 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 46,176 157,849 3.4184
12 Dec 2010 00:32:01 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 34,656 119,832 3.4578
11 Dec 2010 11:44:54 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 23,136 81,405 3.5185
09 Dec 2010 17:36:48 1050294 12331311 hadam3p_pnw_zw04_1963_1_007043996_0 11,616 40,487 3.4855


©2024 cpdn.org