climateprediction.net home page
Task 16408540

Task 16408540

Name hadam3p_anz_n1j3_2012_1_008591340_0
Workunit 8737852
Created 26 Mar 2014, 17:55:28 UTC
Sent 30 Mar 2014, 15:34:13 UTC
Report deadline 12 Mar 2015, 20:54:13 UTC
Received 9 Apr 2014, 19:30:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1322693
Run time 5 days 17 hours 55 min 22 sec
CPU time 5 days 17 hours 0 min 15 sec
Validate state Invalid
Credit 4,484.28
Device peak FLOPS 2.65 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5848, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8416, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8184, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6764, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5676, selfPID=4876, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8188, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3100, selfPID=6564, iMonCtr=1
Model crash detected, will try to restart...
06:38:36 (944): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Colonal troller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPDD==292, iMonCnCtr=
2
del crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3860, selfPID=4960, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6136, selfPID=4048, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4936, selfPID=5096, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiGinglobal Worker1, checkProcess is not running, nCtr=2
, bRetVal = 1, checkPID=0, selfPID=3564, iMon
Ctr=2
Leaving CPDN_Main::Monitor...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_anz_n1j3_2012_1_008591340/dataout/atmos_restart.day after 11 attempts
06:28:04 (5536): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadam3p_anz_n1j3_2012_1_008591340/dataout/atmos_restart.day after 11 attempts
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7252, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_n1j3_2012_1_008591340_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n1j3_2012_1_008591340_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_n1j3_2012_1_008591340_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
08 Apr 2014 19:01:42 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 103,979 456,911 4.3943
07 Apr 2014 19:55:22 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 92,459 407,153 4.4036
06 Apr 2014 17:14:09 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 80,939 357,325 4.4147
05 Apr 2014 23:32:28 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 69,419 305,932 4.4070
05 Apr 2014 09:05:16 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 57,899 253,958 4.3862
04 Apr 2014 17:41:20 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 46,379 203,875 4.3958
03 Apr 2014 23:48:10 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 34,859 154,912 4.4440
02 Apr 2014 15:39:29 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 23,339 103,640 4.4406
01 Apr 2014 23:50:16 1322693 16408540 hadam3p_anz_n1j3_2012_1_008591340_0 11,819 51,015 4.3164


©2024 climateprediction.net