climateprediction.net home page
Task 12248891

Task 12248891

Name hadam3p_eu_xel3_1998_1_006966511_0
Workunit 7169827
Created 23 Nov 2010, 11:34:27 UTC
Sent 30 Jan 2011, 2:43:48 UTC
Report deadline 12 Jan 2012, 8:03:48 UTC
Received 21 Apr 2011, 8:10:57 UTC
Server state Over
Outcome Didn't need
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1102379
Run time 3 days 15 hours 14 min 4 sec
CPU time 3 days 9 hours 29 min 8 sec
Validate state Invalid
Credit 1,988.94
Device peak FLOPS 2.92 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.08
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
15:30:18 (2544): No heartbeat from core client for 30 sec - exiting
15:30:19 (2544): No heartbeat from core client for 30 sec - exiting
15:30:20 (2544): No heartbeat from core client for 30 sec - exiting
15:30:21 (2544): No heartbeat from core client for 30 sec - exiting
15:30:22 (2544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:57:55 (3320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4856, selfPID=4856, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1860, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
05:17:18 (6072): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3120, selfPID=3120, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=2
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4500, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4568, selfPID=3912, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4388, selfPID=4388, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4184, selfPID=3844, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...

zip error: Could not create output file (was replacing the original zip file)
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3876, iMonCtr=2
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4492, selfPID=3912, iMonCtr=1
Model crash detected, will try to restart...
18:03:48 (3940): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:12:19 (5800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3268, selfPID=3268, iMonCtr=2
18:13:50 (4576): No heartbeat from core client for 30 sec - exiting
18:13:51 (4576): No heartbeat from core client for 30 sec - exiting
18:13:52 (4576): No heartbeat from core client for 30 sec - exiting
18:13:53 (4576): No heartbeat from core client for 30 sec - exiting
18:13:54 (4576): No heartbeat from core client for 30 sec - exiting
18:13:55 (4576): No heartbeat from core client for 30 sec - exiting
18:13:56 (4576): No heartbeat from core client for 30 sec - exiting
18:13:57 (4576): No heartbeat from core client for 30 sec - exiting
18:13:58 (4576): No heartbeat from core client for 30 sec - exiting
18:13:59 (4576): No heartbeat from core client for 30 sec - exiting
18:14:00 (4576): No heartbeat from core client for 30 sec - exiting
18:14:01 (4576): No heartbeat from core client for 30 sec - exiting
18:14:02 (4576): No heartbeat from core client for 30 sec - exiting
18:14:03 (4576): No heartbeat from core client for 30 sec - exiting
18:14:04 (4576): No heartbeat from core client for 30 sec - exiting
18:14:05 (4576): No heartbeat from core client for 30 sec - exiting
18:14:06 (4576): No heartbeat from core client for 30 sec - exiting
18:14:07 (4576): No heartbeat from core client for 30 sec - exiting
18:14:08 (4576): No heartbeat from core client for 30 sec - exiting
18:14:09 (4576): No heartbeat from core client for 30 sec - exiting
18:14:10 (4576): No heartbeat from core client for 30 sec - exiting
18:14:11 (4576): No heartbeat from core client for 30 sec - exiting
18:14:12 (4576): No heartbeat from core client for 30 sec - exiting
18:14:13 (4576): No heartbeat from core client for 30 sec - exiting
18:14:14 (4576): No heartbeat from core client for 30 sec - exiting
18:14:15 (4576): No heartbeat from core client for 30 sec - exiting
18:14:16 (4576): No heartbeat from core client for 30 sec - exiting
18:14:17 (4576): No heartbeat from core client for 30 sec - exiting
18:14:18 (4576): No heartbeat from core client for 30 sec - exiting
18:14:19 (4576): No heartbeat from core client for 30 sec - exiting
18:14:20 (4576): No heartbeat from core client for 30 sec - exiting
18:14:21 (4576): No heartbeat from core client for 30 sec - exiting
18:14:22 (4576): No heartbeat from core client for 30 sec - exiting
18:14:23 (4576): No heartbeat from core client for 30 sec - exiting
18:14:24 (4576): No heartbeat from core client for 30 sec - exiting
18:14:25 (4576): No heartbeat from core client for 30 sec - exiting
18:14:26 (4576): No heartbeat from core client for 30 sec - exiting
18:14:27 (4576): No heartbeat from core client for 30 sec - exiting
18:14:28 (4576): No heartbeat from core client for 30 sec - exiting
18:14:29 (4576): No heartbeat from core client for 30 sec - exiting
18:14:30 (4576): No heartbeat from core client for 30 sec - exiting
18:14:31 (4576): No heartbeat from core client for 30 sec - exiting
18:14:32 (4576): No heartbeat from core client for 30 sec - exiting
18:14:33 (4576): No heartbeat from core client for 30 sec - exiting
18:14:34 (4576): No heartbeat from core client for 30 sec - exiting
18:14:35 (4576): No heartbeat from core client for 30 sec - exiting
18:14:36 (4576): No heartbeat from core client for 30 sec - exiting
18:14:37 (4576): No heartbeat from core client for 30 sec - exiting
18:14:38 (4576): No heartbeat from core client for 30 sec - exiting
18:14:39 (4576): No heartbeat from core client for 30 sec - exiting
18:14:40 (4576): No heartbeat from core client for 30 sec - exiting
18:14:41 (4576): No heartbeat from core client for 30 sec - exiting
18:14:42 (4576): No heartbeat from core client for 30 sec - exiting
18:14:43 (4576): No heartbeat from core client for 30 sec - exiting
18:14:44 (4576): No heartbeat from core client for 30 sec - exiting
18:14:45 (4576): No heartbeat from core client for 30 sec - exiting
18:14:46 (4576): No heartbeat from core client for 30 sec - exiting
18:14:47 (4576): No heartbeat from core client for 30 sec - exiting
18:14:48 (4576): No heartbeat from core client for 30 sec - exiting
18:14:49 (4576): No heartbeat from core client for 30 sec - exiting
18:14:50 (4576): No heartbeat from core client for 30 sec - exiting
18:14:51 (4576): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4812, selfPID=4812, iMonCtr=2
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4376, selfPID=4020, iMonCtr=1
Model crash detected, will try to restart...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4312, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4388, selfPID=4008, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2052, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5040, selfPID=3288, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No Process Handle
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4744, selfPID=4744, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4828, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1288, selfPID=2896, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4672, selfPID=4040, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=4052, iMonCtr=1
Model crash detected, will try to restart...
10:05:09 (4060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:05:10 (4060): No heartbeat from core client for 30 sec - exiting
10:10:04 (5084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4680, selfPID=4680, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=2
Model crash detected, will try to restart...
10:19:12 (4060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:20:12 (3304): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1768, selfPID=3440, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4552, selfPID=4552, iMonCtr=2
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5824, iMonCtr=2
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=1
Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2600, selfPID=5808, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
12:58:19 (5808): called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
  <file_name>hadam3p_eu_xel3_1998_1_006966511_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_xel3_1998_1_006966511_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Apr 2011 08:11:39 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 115,296 290,778 2.5220
10 Apr 2011 14:49:59 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 103,776 261,770 2.5225
09 Apr 2011 02:35:53 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 92,261 232,470 2.5197
09 Apr 2011 02:35:53 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 92,256 232,101 2.5158
01 Apr 2011 07:54:20 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 80,736 202,549 2.5088
28 Mar 2011 09:53:18 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 69,216 172,926 2.4984
24 Mar 2011 15:45:26 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 57,696 143,616 2.4892
22 Mar 2011 13:44:34 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 46,176 114,959 2.4896
20 Mar 2011 16:09:01 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 34,658 87,571 2.5267
20 Mar 2011 15:58:42 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 34,656 87,237 2.5172
19 Mar 2011 04:04:28 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 23,136 58,735 2.5387
12 Mar 2011 01:27:04 1102379 12248891 hadam3p_eu_xel3_1998_1_006966511_0 11,616 29,932 2.5768


©2024 climateprediction.net