climateprediction.net (CPDN) home page
Task 19217206

Task 19217206

Name hadam3p_anz_k7qo_201212_12_306_010271401_0
Workunit 10271401
Created 25 Jan 2016, 14:54:27 UTC
Sent 26 Jan 2016, 9:51:24 UTC
Report deadline 7 Jan 2017, 15:11:24 UTC
Received 19 Feb 2016, 15:04:17 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1149985
Run time 4 days 5 hours 4 min 2 sec
CPU time 3 days 21 hours 39 min 30 sec
Validate state Invalid
Credit 2,000.18
Device peak FLOPS 2.32 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3796, iMonCtr=2
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2764, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2704, selfPID=4448, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3600, iMonCtr=2
GGlobal Worker:: DN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4452, iMonCtr=2
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4256, selfPID=1484, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1380, selfPID=4264, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2072, selfPID=3288, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3276, selfPID=3116, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3216, selfPID=4648, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2408, selfPID=2280, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4708, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4484, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2628, selfPID=2884, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3572, selfPID=3576, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=3068, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2132, selfPID=2132, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2132, selfPID=2540, iMonCtr=1
Model crash detected, will try to restart...

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_5.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_k7qo_201212_12_306_010271401_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Feb 2016 17:33:46 1149985 19217206 hadam3p_anz_k7qo_201212_12_306_010271401_0 46,379 286,575 6.1790
11 Feb 2016 10:56:40 1149985 19217206 hadam3p_anz_k7qo_201212_12_306_010271401_0 34,859 216,787 6.2190
05 Feb 2016 11:51:07 1149985 19217206 hadam3p_anz_k7qo_201212_12_306_010271401_0 23,339 143,895 6.1654
01 Feb 2016 15:38:22 1149985 19217206 hadam3p_anz_k7qo_201212_12_306_010271401_0 11,819 72,329 6.1197


©2025 cpdn.org