climateprediction.net home page
Task 15176725

Task 15176725

Name hadam3p_pnw_6uo6_2003_1_008152254_1
Workunit 8307378
Created 23 Aug 2012, 2:33:51 UTC
Sent 25 Aug 2012, 8:07:11 UTC
Report deadline 7 Aug 2013, 13:27:11 UTC
Received 23 Sep 2012, 15:14:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1220923
Run time 4 days 11 hours 27 min 16 sec
CPU time 4 days 8 hours 22 min 35 sec
Validate state Invalid
Credit 2,755.56
Device peak FLOPS 2.77 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2792, selfPID=5728, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3384, selfPID=5400, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=5948, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6032, selfPID=5864, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2572, selfPID=5804, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4252, selfPID=5712, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4780, selfPID=3544, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:00:29 (1552): No heartbeat from core client for 30 sec - exiting
17:00:30 (1552): No heartbeat from core client for 30 sec - exiting
17:00:31 (1552): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=444, selfPID=444, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5360, selfPID=1304, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 11
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_6uo6_2003_1_008152254/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_pnw_6uo6_2003_1_008152254/dataout/region_restart.day after 11 attempts

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Leaving CPDN_Main::Monitor...
Regional yearly means requires 12 input files got 0
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_6uo6_2003_1_008152254_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Sep 2012 07:10:51 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 126,816 349,031 2.7523
15 Sep 2012 03:21:19 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 115,296 318,946 2.7663
13 Sep 2012 18:11:47 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 103,776 287,414 2.7696
05 Sep 2012 14:31:48 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 92,256 258,288 2.7997
03 Sep 2012 15:02:36 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 80,736 227,038 2.8121
02 Sep 2012 07:05:24 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 69,216 193,952 2.8021
01 Sep 2012 07:30:11 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 57,696 158,851 2.7532
29 Aug 2012 17:06:33 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 46,176 125,854 2.7255
28 Aug 2012 13:47:07 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 34,656 93,921 2.7101
26 Aug 2012 14:56:23 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 23,136 62,388 2.6966
25 Aug 2012 21:02:39 1220923 15176725 hadam3p_pnw_6uo6_2003_1_008152254_1 11,616 30,810 2.6524


©2024 cpdn.org