climateprediction.net home page
Task 18638007

Task 18638007

Name hadam3p_pnw_pjla_2013_1_009973119_0
Workunit 9979477
Created 29 Jun 2015, 17:29:22 UTC
Sent 30 Jun 2015, 7:36:10 UTC
Report deadline 11 Jun 2016, 12:56:10 UTC
Received 20 Aug 2015, 4:48:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1346020
Run time 3 days 11 hours 48 min 50 sec
CPU time 3 days 10 hours 22 min 39 sec
Validate state Invalid
Credit 2,759.97
Device peak FLOPS 1.85 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Pacific North West v7.27
windows_intelx86
Stderr
<core_client_version>7.4.42</core_client_version>
<![CDATA[
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6344, selfPID=6064, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3216, selfPID=5992, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5936, selfPID=4484, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5844, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4336, selfPID=5988, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3000, selfPID=5256, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3632, selfPID=6024, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3496, selfPID=5576, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4904, selfPID=4272, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=852, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6244, selfPID=4668, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
12:34:53 (968): start_timer_thread(): CreateThread() failed, errno 0
12:34:54 (2752): start_timer_thread(): CreateThread() failed, errno 0
15:43:31 (3720): start_timer_thread(): CreateThread() failed, errno 0
15:43:32 (1468): start_timer_thread(): CreateThread() failed, errno 0
17:05:51 (4224): start_timer_thread(): CreateThread() failed, errno 0
17:05:53 (2708): start_timer_thread(): CreateThread() failed, errno 0
Suspended CPDN Monitor - Suspend request from BOINC...
12:01:49 (1876): start_timer_thread(): CreateThread() failed, errno 0
12:01:51 (3636): start_timer_thread(): CreateThread() failed, errno 0
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1

Model crashed: SETPOS: Unit 63 to Word Address -198 Failed with Error Code -1
Leaving CPDN_Main::Monitor...
16:26:10 (2448): called boinc_finish(0)

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_13.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_14.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_15.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_16.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_17.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_pnw_pjla_2013_1_009973119_0_18.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Aug 2015 19:37:30 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 127,019 265,986 2.0941
02 Aug 2015 10:20:12 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 115,499 199,454 1.7269
28 Jul 2015 09:13:22 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 103,979 179,046 1.7219
28 Jul 2015 00:52:11 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 92,459 160,999 1.7413
25 Jul 2015 00:30:05 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 80,939 138,382 1.7097
16 Jul 2015 12:34:35 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 69,419 116,708 1.6812
16 Jul 2015 04:56:07 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 57,899 98,590 1.7028
08 Jul 2015 13:29:18 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 46,379 78,718 1.6973
07 Jul 2015 04:48:58 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 34,859 57,594 1.6522
01 Jul 2015 00:33:59 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 23,339 36,283 1.5546
30 Jun 2015 12:51:37 1346020 18638007 hadam3p_pnw_pjla_2013_1_009973119_0 11,819 18,387 1.5557


©2024 cpdn.org