climateprediction.net home page
Task 17593139

Task 17593139

Name hadam3p_anz_m5og_2012_1_009306886_0
Workunit 9391074
Created 17 Dec 2014, 19:50:56 UTC
Sent 21 Dec 2014, 23:38:59 UTC
Report deadline 4 Dec 2015, 4:58:59 UTC
Received 10 Jan 2015, 13:48:54 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1143523
Run time 2 days 16 hours 24 min 43 sec
CPU time 2 days 12 hours 17 min
Validate state Invalid
Credit 1,006.54
Device peak FLOPS 2.89 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Australia New Zealand v6.10
windows_intelx86
Stderr
<core_client_version>7.4.36</core_client_version>
<![CDATA[
<stderr_txt>
22:19:14 (5356): No heartbeat from core client for 30 sec - exiting
22:19:15 (5356): No heartbeat from core client for 30 sec - exiting
22:19:16 (5356): No heartbeat from core client for 30 sec - exiting
22:19:17 (5356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=2
13:00:12 (6052): No heartbeat from core client for 30 sec - exiting
13:00:13 (6052): No heartbeat from core client for 30 sec - exiting
13:00:14 (6052): No heartbeat from core client for 30 sec - exiting
13:00:15 (6052): No heartbeat from core client for 30 sec - exiting
13:00:16 (6052): No heartbeat from core client for 30 sec - exiting
13:00:17 (6052): No heartbeat from core client for 30 sec - exiting
13:00:18 (6052): No heartbeat from core client for 30 sec - exiting
13:00:19 (6052): No heartbeat from core client for 30 sec - exiting
13:00:20 (6052): No heartbeat from core client for 30 sec - exiting
13:00:21 (6052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:42:22 (5788): No heartbeat from core client for 30 sec - exiting
19:42:23 (5788): No heartbeat from core client for 30 sec - exiting
19:42:24 (5788): No heartbeat from core client for 30 sec - exiting
19:42:25 (5788): No heartbeat from core client for 30 sec - exiting
19:42:26 (5788): No heartbeat from core client for 30 sec - exiting
19:42:27 (5788): No heartbeat from core client for 30 sec - exiting
19:42:28 (5788): No heartbeat from core client for 30 sec - exiting
19:42:29 (5788): No heartbeat from core client for 30 sec - exiting
19:42:30 (5788): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:49:41 (4628): No heartbeat from core client for 30 sec - exiting
13:49:42 (4628): No heartbeat from core client for 30 sec - exiting
13:49:43 (4628): No heartbeat from core client for 30 sec - exiting
13:49:44 (4628): No heartbeat from core client for 30 sec - exiting
13:49:45 (4628): No heartbeat from core client for 30 sec - exiting
13:49:46 (4628): No heartbeat from core client for 30 sec - exiting
13:49:47 (4628): No heartbeat from core client for 30 sec - exiting
13:49:48 (4628): No heartbeat from core client for 30 sec - exiting
13:49:49 (4628): No heartbeat from core client for 30 sec - exiting
13:49:50 (4628): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN proces17:58:42 (4596): No heartbeat from core client for 30 sec - exiting
17:58:43 (4596): No heartbeat from core client for 30 sec - exiting
17:58:44 (4596): No heartbeat from core client for 30 sec - exiting
17:58:45 (4596): No heartbeat from core client for 30 sec - exiting
17:58:46 (4596): No heartbeat from core client for 30 sec - exiting
17:58:47 (4596): No heartbeat from core client for 30 sec - exiting
17:58:48 (4596): No heartbeat from core client for 30 sec - exiting
17:58:49 (4596): No heartbeat from core client for 30 sec - exiting
17:58:50 (4596): No heartbeat from core client for 30 sec - exiting
17:58:51 (4596): No heartbeat from core client for 30 sec - exiting
17:58:52 (4596): No heartbeat from core client for 30 sec - exiting
17:58:53 (4596): No heartbeat from core client for 30 sec - exiting
17:58:54 (4596): No heartbeat from core client for 30 sec - exiting
17:58:55 (4596): No heartbeat from core client for 30 sec - exiting
17:58:56 (4596): No heartbeat from core client for 30 sec - exiting
17:58:57 (4596): No heartbeat from core client for 30 sec - exiting
17:58:58 (4596): No heartbeat from core client for 30 sec - exiting
17:58:59 (4596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:26:42 (5696): No heartbeat from core client for 30 sec - exiting
22:26:43 (5696): No heartbeat from core client for 30 sec - exiting
22:26:44 (5696): No heartbeat from core client for 30 sec - exiting
22:26:45 (5696): No heartbeat from core client for 30 sec - exiting
22:26:46 (5696): No heartbeat from core client for 30 sec - exiting
22:26:47 (5696): No heartbeat from core client for 30 sec - exiting
22:26:48 (5696): No heartbeat from core client for 30 sec - exiting
22:26:49 (5696): No heartbeat from core client for 30 sec - exiting
22:26:50 (5696): No heartbeat from core client for 30 sec - exiting
22:26:51 (5696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5744, selfPID=4348, iMonCtr=1
Model crash detected, will try to restart...
13:01:03 (5384): No heartbeat from core client for 30 sec - exiting
13:01:04 (5384): No heartbeat from core client for 30 sec - exiting
13:01:05 (5384): No heartbeat from core client for 30 sec - exiting
13:01:06 (5384): No heartbeat from core client for 30 sec - exiting
13:01:07 (5384): No heartbeat from core client for 30 sec - exiting
13:01:08 (5384): No heartbeat from core client for 30 sec - exiting
13:01:09 (5384): No heartbeat from core client for 30 sec - exiting
13:01:10 (5384): No heartbeat from core client for 30 sec - exiting
13:01:11 (5384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=2
Model crash detected, will try to restart...
20:40:22 (4232): No heartbeat from core client for 30 sec - exiting
20:40:23 (4232): No heartbeat from core client for 30 sec - exiting
20:40:24 (4232): No heartbeat from core client for 30 sec - exiting
20:40:25 (4232): No heartbeat from core client for 30 sec - exiting
20:40:26 (4232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:33:28 (5220): No heartbeat from core client for 30 sec - exiting
17:33:29 (5220): No heartbeat from core client for 30 sec - exiting
17:33:30 (5220): No heartbeat from core client for 30 sec - exiting
17:33:31 (5220): No heartbeat from core client for 30 sec - exiting
17:33:32 (5220): No heartbeat from core client for 30 sec - exiting
17:33:33 (5220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:03:03 (5440): No heartbeat from core client for 30 sec - exiting
18:03:04 (5440): No heartbeat from core client for 30 sec - exiting
18:03:05 (5440): No heartbeat from core client for 30 sec - exiting
18:03:06 (5440): No heartbeat from core client for 30 sec - exiting
18:03:07 (5440): No heartbeat from core client for 30 sec - exiting
18:03:08 (5440): No heartbeat from core client for 30 sec - exiting
18:03:09 (5440): No heartbeat from core client for 30 sec - exiting
18:03:10 (5440): No heartbeat from core client for 30 sec - exiting
18:03:11 (5440): No heartbeat from core client for 30 sec - exiting
18:03:12 (5440): No heartbeat from core client for 30 sec - exiting
18:03:13 (5440): No heartbeat from core client for 30 sec - exiting
18:03:14 (5440): No heartbeat from core client for 30 sec - exiting
18:03:15 (5440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:26:14 (5652): No heartbeat from core client for 30 sec - exiting
20:26:15 (5652): No heartbeat from core client for 30 sec - exiting
20:26:16 (5652): No heartbeat from core client for 30 sec - exiting
20:26:17 (5652): No heartbeat from core client for 30 sec - exiting
20:26:18 (5652): No heartbeat from core client for 30 sec - exiting
20:26:19 (5652): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:32:47 (4204): No heartbeat from core client for 30 sec - exiting
15:32:48 (4204): No heartbeat from core client for 30 sec - exiting
15:32:49 (4204): No heartbeat from core client for 30 sec - exiting
15:32:50 (4204): No heartbeat from core client for 30 sec - exiting
15:32:51 (4204): No heartbeat from core client for 30 sec - exiting
15:32:52 (4204): No heartbeat from core client for 30 sec - exiting
15:32:53 (4204): No heartbeat from core client for 30 sec - exiting
15:32:54 (4204): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:40:56 (5524): No heartbeat from core client for 30 sec - exiting
17:40:57 (5524): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:34:53 (4372): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6092, selfPID=6092, iMonCtr=2
16:26:06 (5064): No heartbeat from core client for 30 sec - exiting
16:26:07 (5064): No heartbeat from core client for 30 sec - exiting
16:26:08 (5064): No heartbeat from core client for 30 sec - exiting
16:26:09 (5064): No heartbeat from core client for 30 sec - exiting
16:26:10 (5064): No heartbeat from core client for 30 sec - exiting
16:26:11 (5064): No heartbeat from core client for 30 sec - exiting
16:26:12 (5064): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5196, selfPID=5468, iMonCtr=1
Model crash detected, will try to restart...
17:38:35 (4556): No heartbeat from core client for 30 sec - exiting
17:38:36 (4556): No heartbeat from core client for 30 sec - exiting
17:38:37 (4556): No heartbeat from core client for 30 sec - exiting
17:38:38 (4556): No heartbeat from core client for 30 sec - exiting
17:38:39 (4556): No heartbeat from core client for 30 sec - exiting
17:38:40 (4556): No heartbeat from core client for 30 sec - exiting
17:38:41 (4556): No heartbeat from core client for 30 sec - exiting
17:38:42 (4556): No heartbeat from core client for 30 sec - exiting
17:38:43 (4556): No heartbeat from core client for 30 sec - exiting
17:38:44 (4556): No heartbeat from core client for 30 sec - exiting
17:38:45 (4556): No heartbeat from core client for 30 sec - exiting
17:38:46 (4556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5304, iMonCtr=2
22:03:37 (5532): No heartbeat from core client for 30 sec - exiting
22:03:38 (5532): No heartbeat from core client for 30 sec - exiting
22:03:39 (5532): No heartbeat from core client for 30 sec - exiting
22:03:40 (5532): No heartbeat from core client for 30 sec - exiting
22:03:41 (5532): No heartbeat from core client for 30 sec - exiting
22:03:42 (5532): No heartbeat from core client for 30 sec - exiting
22:03:43 (5532): No heartbeat from core client for 30 sec - exiting
22:03:44 (5532): No heartbeat from core client for 30 sec - exiting
22:03:45 (5532): No heartbeat from core client for 30 sec - exiting
22:03:46 (5532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4276, selfPID=808, iMonCtr=1
Model crash detected, will try to restart...
11:41:24 (5216): No heartbeat from core client for 30 sec - exiting
11:41:25 (5216): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5200, selfPID=5500, iMonCtr=1
Model crash detected, will try to restart...
13:01:15 (5532): No heartbeat from core client for 30 sec - exiting
13:01:16 (5532): No heartbeat from core client for 30 sec - exiting
13:01:17 (5532): No heartbeat from core client for 30 sec - exiting
13:01:18 (5532): No heartbeat from core client for 30 sec - exiting
13:01:19 (5532): No heartbeat from core client for 30 sec - exiting
13:01:20 (5532): No heartbeat from core client for 30 sec - exiting
13:01:21 (5532): No heartbeat from core client for 30 sec - exiting
13:01:22 (5532): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:26:14 (5456): No heartbeat from core client for 30 sec - exiting
18:26:15 (5456): No heartbeat from core client for 30 sec - exiting
18:26:16 (5456): No heartbeat from core client for 30 sec - exiting
18:26:17 (5456): No heartbeat from core client for 30 sec - exiting
18:26:18 (5456): No heartbeat from core client for 30 sec - exiting
18:26:19 (5456): No heartbeat from core client for 30 sec - exiting
18:26:20 (5456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5156, selfPID=5588, iMonCtr=1
Model crash detected, will try to restart...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1004, selfPID=5724, iMonCtr=1
Model crash detected, will try to restart...
20:07:30 (3096): No heartbeat from core client for 30 sec - exiting
20:07:31 (3096): No heartbeat from core client for 30 sec - exiting
20:07:32 (3096): No heartbeat from core client for 30 sec - exiting
20:07:33 (3096): No heartbeat from core client for 30 sec - exiting
20:07:34 (3096): No heartbeat from core client for 30 sec - exiting
20:07:35 (3096): No heartbeat from core client for 30 sec - exiting
20:07:36 (3096): No heartbeat from core client for 30 sec - exiting
20:07:37 (3096): No heartbeat from core client for 30 sec - exiting
20:07:38 (3096): No heartbeat from core client for 30 sec - exiting
20:07:39 (3096): No heartbeat from core client for 30 sec - exiting
20:07:40 (3096): No heartbeat from core client for 30 sec - exiting
20:07:41 (3096): No heartbeat from core client for 30 sec - exiting
20:07:42 (3096): No heartbeat from core client for 30 sec - exiting
20:07:43 (3096): No heartbeat from core client for 30 sec - exiting
20:07:44 (3096): No heartbeat from core client for 30 sec - exiting
20:07:45 (3096): No heartbeat from core client for 30 sec - exiting
20:07:46 (3096): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4072, iMonCtr=2
Model crash detected, will try to restart...
20:43:11 (5016): No heartbeat from core client for 30 sec - exiting
20:43:12 (5016): No heartbeat from core client for 30 sec - exiting
20:43:13 (5016): No heartbeat from core client for 30 sec - exiting
20:43:14 (5016): No heartbeat from core client for 30 sec - exiting
20:43:15 (5016): No heartbeat from core client for 30 sec - exiting
20:43:16 (5016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:36:41 (1472): No heartbeat from core client for 30 sec - exiting
21:36:42 (1472): No heartbeat from core client for 30 sec - exiting
21:36:43 (1472): No heartbeat from core client for 30 sec - exiting
21:36:44 (1472): No heartbeat from core client for 30 sec - exiting
21:36:45 (1472): No heartbeat from core client for 30 sec - exiting
21:36:46 (1472): No heartbeat from core client for 30 sec - exiting
21:36:47 (1472): No heartbeat from core client for 30 sec - exiting
21:36:48 (1472): No heartbeat from core client for 30 sec - exiting
21:36:49 (1472): No heartbeat from core client for 30 sec - exiting
21:36:50 (1472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5768, iMonCtr=2
Model crash detected, will try to restart...
21:31:35 (5740): No heartbeat from core client for 30 sec - exiting
21:31:36 (5740): No heartbeat from core client for 30 sec - exiting
21:31:37 (5740): No heartbeat from core client for 30 sec - exiting
21:31:38 (5740): No heartbeat from core client for 30 sec - exiting
21:31:39 (5740): No heartbeat from core client for 30 sec - exiting
21:31:40 (5740): No heartbeat from core client for 30 sec - exiting
21:31:41 (5740): No heartbeat from core client for 30 sec - exiting
21:31:42 (5740): No heartbeat from core client for 30 sec - exiting
21:31:43 (5740): No heartbeat from core client for 30 sec - exiting
21:31:44 (5740): No heartbeat from core client for 30 sec - exiting
21:31:45 (5740): No heartbeat from core client for 30 sec - exiting
21:31:46 (5740): No heartbeat from core client for 30 sec - exiting
21:31:47 (5740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: READHIST: End of file in READ from history file for namelist NLIHISTO                                                                                                                                                                                           tmp/xaakm.pipe_dummy                                                            2048    
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3944, selfPID=5136, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt>
<message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_3.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_4.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_5.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_6.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_7.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_8.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_9.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_10.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_11.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_anz_m5og_2012_1_009306886_0_12.zip</file_name>
  <error_code>-161 (not found)</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jan 2015 01:01:17 1143523 17593139 hadam3p_anz_m5og_2012_1_009306886_0 23,339 166,973 7.1542
31 Dec 2014 00:06:27 1143523 17593139 hadam3p_anz_m5og_2012_1_009306886_0 11,819 84,933 7.1861


©2024 climateprediction.net