climateprediction.net home page
Task 13300556

Task 13300556

Name hadam3p_eu_2m1e_1987_1_007424998_0
Workunit 7622633
Created 26 Aug 2011, 13:31:24 UTC
Sent 26 Aug 2011, 13:31:33 UTC
Report deadline 7 Aug 2012, 18:51:33 UTC
Received 18 Sep 2011, 12:12:20 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1159510
Run time 11 days 23 hours 34 min 19 sec
CPU time 7 days 15 hours 34 min 7 sec
Validate state Invalid
Credit 1,392.75
Device peak FLOPS 1.22 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1296, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3748, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3524, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2680, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2156, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3140, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3260, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1784, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3400, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3188, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2420, selfPID=1656, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN procesController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2840, selfPID=3160, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3776, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3848, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1832, selfPID=3496, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3740, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3188, selfPID=3248, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=2
ontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=884, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3132, selfPID=3248, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1508, selfPID=2628, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2084, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2464, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3044, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=432, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=2
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2284, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=364, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3100, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2312, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3700, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=624, iMonCtr=2
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2640, iMonCtr=2
Leaving CPDN_Main::Monitor...
forrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2m1e_1987_1_007424998\tmp\xaakm.namelists

Image              PC        Routine            Line        Source             
hadam3p_eu_um_6.0  0159A39A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01542CD0  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01541E9A  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01522819  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01422287  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  014BE7B2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  014BF2DA  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  01239BD2  Unknown               Unknown  Unknown
hadam3p_eu_um_6.0  0157E638  Unknown               Unknown  Unknown
kernel32.dll       76D1D309  Unknown               Unknown  Unknown
ntdll.dll          76DF16C3  Unknown               Unknown  Unknown
ntdll.dll          76DF1696  Unknown               Unknown  Unknown
rrtl: severe (24): end-of-file during read, unit 9, file C:\ProgramData\BOINC\projects\climateprediction.net\hadam3p_eu_2m1e_1987_1_007424998\tmp\xaakg.namelists

Image              PC        Routine            Line        Source             
hadrm3p_eu_um_6.0  016CC52A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01674460  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  0167362A  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01652469  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015566EB  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015F2AE2  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  015F35AF  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  01399860  Unknown               Unknown  Unknown
hadrm3p_eu_um_6.0  016B0893  Unknown               Unknown  Unknown
kernel32.dll       76D1D309  Unknown               Unknown  Unknown
ntdll.dll          76DF16C3  Unknown               Unknown  Unknown
ntdll.dll          76DF1696  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2784, selfPID=2068, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_2m1e_1987_1_007424998_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Sep 2011 08:56:59 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 80,736 584,827 7.2437
13 Sep 2011 18:13:58 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 69,216 500,180 7.2264
10 Sep 2011 17:09:03 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 57,696 415,714 7.2052
06 Sep 2011 18:57:56 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 46,176 332,817 7.2076
04 Sep 2011 03:07:23 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 34,656 250,748 7.2353
01 Sep 2011 18:51:49 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 23,136 170,264 7.3593
29 Aug 2011 11:53:49 1159510 13300556 hadam3p_eu_2m1e_1987_1_007424998_0 11,616 85,549 7.3648


©2024 cpdn.org