climateprediction.net home page
Task 15207116

Task 15207116

Name hadam3p_saf_15pj_1966_1_006916031_1
Workunit 7119347
Created 30 Aug 2012, 19:22:44 UTC
Sent 30 Aug 2012, 19:24:41 UTC
Report deadline 13 Aug 2013, 0:44:41 UTC
Received 17 Sep 2012, 19:04:19 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1142011
Run time 2 days 23 hours 14 min 20 sec
CPU time 2 days 21 hours 35 min 53 sec
Validate state Invalid
Credit 1,870.33
Device peak FLOPS 2.84 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Southern Africa v6.09
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
GClntroll W::ker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7072, iMonC
tr=2
 crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7940, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6416, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6348, selfPID=2944, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
21:31:01 (4812): No heartbeat from core client for 30 sec - exiting
21:31:02 (4812): No heartbeat from core client for 30 sec - exiting
21:31:03 (4812): No heartbeat from core client for 30 sec - exiting
21:31:04 (4812): No heartbeat from core client for 30 sec - exiting
21:31:05 (4812): No heartbeat from core client for 30 sec - exiting
21:31:06 (4812): No heartbeat from core client for 30 sec - exiting
21:31:07 (4812): No heartbeat from core client for 30 sec - exiting
21:31:08 (4812): No heartbeat from core client for 30 sec - exiting
21:31:09 (4812): No heartbeat from core client for 30 sec - exiting
21:31:11 (4812): No heartbeat from core client for 30 sec - exiting
21:31:12 (4812): No heartbeat from core client for 30 sec - exiting
21:31:13 (4812): No heartbeat from core client for 30 sec - exiting
21:31:14 (4812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6888, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6820, selfPID=5948, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4400, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2988, iMonCtr=2
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2052, selfPID=3028, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1504, selfPID=1536, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6156, selfPID=3320, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_15pj_1966_1_006916031\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_saf_um_6.  0060A39A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  005B2CD0  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  005B1E9A  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00592819  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  00492287  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  0052E7B2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  0052F2DA  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  002A9BD2  Unknown               Unknown  Unknown
hadam3p_saf_um_6.  005EE638  Unknown               Unknown  Unknown
kernel32.dll       74D7339A  Unknown               Unknown  Unknown
ntdll.dll          77049EF2  Unknown               Unknown  Unknown
ntdll.dll          77049EC5  Unknown               Unknown  Unknown
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_saf_15pj_1966_1_006916031\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_saf_um_6.  006DC52A  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00684460  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  0068362A  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00662469  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  005666EB  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  00602AE2  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  006035AF  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  003A9860  Unknown               Unknown  Unknown
hadrm3p_saf_um_6.  006C0893  Unknown               Unknown  Unknown
kernel32.dll       74D7339A  Unknown               Unknown  Unknown
ntdll.dll          77049EF2  Unknown               Unknown  Unknown
ntdll.dll          77049EC5  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1196, selfPID=6300, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_saf_15pj_1966_1_006916031_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_saf_15pj_1966_1_006916031_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
14 Sep 2012 21:49:38 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 115,296 248,573 2.1560
12 Sep 2012 21:34:01 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 103,776 225,265 2.1707
10 Sep 2012 18:12:35 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 92,256 199,841 2.1662
09 Sep 2012 13:06:01 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 80,736 174,475 2.1611
07 Sep 2012 17:30:44 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 69,216 148,506 2.1455
03 Sep 2012 20:52:18 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 57,696 123,850 2.1466
02 Sep 2012 18:08:52 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 46,176 99,425 2.1532
02 Sep 2012 11:01:13 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 34,656 76,248 2.2001
01 Sep 2012 16:57:25 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 23,136 49,552 2.1418
31 Aug 2012 21:42:52 1142011 15207116 hadam3p_saf_15pj_1966_1_006916031_1 11,616 24,226 2.0856


©2024 climateprediction.net