climateprediction.net home page
Task 16287187

Task 16287187

Name hadam3p_pnw_ue24_2001_1_008511272_1
Workunit 8661079
Created 7 Feb 2014, 13:32:40 UTC
Sent 7 Feb 2014, 13:35:05 UTC
Report deadline 20 Jan 2015, 18:55:05 UTC
Received 7 Mar 2014, 18:43:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1257976
Run time 3 days 1 hours 34 min 34 sec
CPU time 2 days 7 hours 30 min 28 sec
Validate state Invalid
Credit 1,258.08
Device peak FLOPS 2.27 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.22
windows_intelx86
Stderr
<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3540, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3796, selfPID=3664, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3608, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=120, selfPID=3384, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=168, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=404, selfPID=3464, iMonCtr=1
Model crash detected, will try to restart...
22:13:53 (3392): No heartbeat from client for 30 sec - exiting
22:13:53 (3392): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3136, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
19:15:22 (3596): No heartbeat from client for 30 sec - exiting
19:15:22 (3596): timer handler: client dead, exiting
19:15:23 (3596): No heartbeat from client for 30 sec - exiting
19:15:23 (3596): timer handler: client dead, exiting
19:15:24 (3596): No heartbeat from client for 30 sec - exiting
19:15:24 (3596): timer handler: client dead, exiting
19:15:25 (3596): No heartbeat from client for 30 sec - exiting
19:15:25 (3596): timer handler: client dead, exiting
19:15:26 (3596): No heartbeat from client for 30 sec - exiting
19:15:26 (3596): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2740, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
11:56:19 (3552): No heartbeat from client for 30 sec - exiting
11:56:19 (3552): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4004, selfPID=3472, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3880, selfPID=3368, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3960, selfPID=3416, iMonCtr=1
Model crash detected, will try to restart...
19:26:05 (3476): No heartbeat from client for 30 sec - exiting
19:26:05 (3476): timer handler: client dead, exiting
19:26:06 (3476): No heartbeat from client for 30 sec - exiting
19:26:06 (3476): timer handler: client dead, exiting
19:26:08 (3476): No heartbeat from client for 30 sec - exiting
19:26:08 (3476): timer handler: client dead, exiting
19:26:09 (3476): No heartbeat from client for 30 sec - exiting
19:26:09 (3476): timer handler: client dead, exiting
19:26:10 (3476): No heartbeat from client for 30 sec - exiting
19:26:10 (3476): timer handler: client dead, exiting
19:26:11 (3476): No heartbeat from client for 30 sec - exiting
19:26:11 (3476): timer handler: client dead, exiting
19:26:12 (3476): No heartbeat from client for 30 sec - exiting
19:26:12 (3476): timer handler: client dead, exiting
19:26:13 (3476): No heartbeat from client for 30 sec - exiting
19:26:13 (3476): timer handler: client dead, exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2848, selfPID=3288, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1720, selfPID=3568, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3352, selfPID=3624, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3208, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
19:34:09 (3208): called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_6.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_7.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_8.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_ue24_2001_1_008511272_1_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Mar 2014 12:24:09 1257976 16287187 hadam3p_pnw_ue24_2001_1_008511272_1 57,899 173,723 3.0004
25 Feb 2014 16:46:05 1257976 16287187 hadam3p_pnw_ue24_2001_1_008511272_1 46,379 137,966 2.9748
21 Feb 2014 17:29:45 1257976 16287187 hadam3p_pnw_ue24_2001_1_008511272_1 34,859 101,988 2.9257
19 Feb 2014 18:34:18 1257976 16287187 hadam3p_pnw_ue24_2001_1_008511272_1 23,339 68,323 2.9274
12 Feb 2014 14:12:37 1257976 16287187 hadam3p_pnw_ue24_2001_1_008511272_1 11,819 34,715 2.9372


©2024 cpdn.org