climateprediction.net home page
Task 15867137

Task 15867137

Name hadam3p_eu_qimb_2008_1_008397411_0
Workunit 8548270
Created 26 Jun 2013, 10:25:16 UTC
Sent 26 Jun 2013, 13:07:38 UTC
Report deadline 8 Jun 2014, 18:27:38 UTC
Received 11 Jul 2013, 5:01:05 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 0 (0x00000000)
Computer ID 1218845
Run time 6 days 12 hours 36 min 22 sec
CPU time 6 days 7 hours 49 min 41 sec
Validate state Invalid
Credit 1,591.48
Device peak FLOPS 2.66 GFLOPS
Application version UK Met Office HadAM3P-HadRM3P Europe v6.09
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<stderr_txt>
06:29:26 (4388): No heartbeat from core client for 30 sec - exiting
06:29:28 (4388): No heartbeat from core client for 30 sec - exiting
06:29:29 (4388): No heartbeat from core client for 30 sec - exiting
06:29:30 (4388): No heartbeat from core client for 30 sec - exiting
06:29:31 (4388): No heartbeat from core client for 30 sec - exiting
06:29:32 (4388): No heartbeat from core client for 30 sec - exiting
06:29:33 (4388): No heartbeat from core client for 30 sec - exiting
06:29:34 (4388): No heartbeat from core client for 30 sec - exiting
06:29:35 (4388): No heartbeat from core client for 30 sec - exiting
06:29:36 (4388): No heartbeat from core client for 30 sec - exiting
06:29:37 (4388): No heartbeat from core client for 30 sec - exiting
06:29:38 (4388): No heartbeat from core client for 30 sec - exiting
06:29:40 (4388): No heartbeat from core client for 30 sec - exiting
06:29:41 (4388): No heartbeat from core client for 30 sec - exiting
06:29:42 (4388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=748, selfPID=5460, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
ColobalnWorkller :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, self=2456, iMonCtr=2
=2
Model crash detected, will try to restart...
09:53:12 (5800): No heartbeat from core client for 30 sec - exiting
09:53:13 (5800): No heartbeat from core client for 30 sec - exiting
09:53:14 (5800): No heartbeat from core client for 30 sec - exiting
09:53:15 (5800): No heartbeat from core client for 30 sec - exiting
09:53:16 (5800): No heartbeat from core client for 30 sec - exiting
09:53:17 (5800): No heartbeat from core client for 30 sec - exiting
09:53:19 (5800): No heartbeat from core client for 30 sec - exiting
09:53:20 (5800): No heartbeat from core client for 30 sec - exiting
09:53:21 (5800): No heartbeat from core client for 30 sec - exiting
09:53:22 (5800): No heartbeat from core client for 30 sec - exiting
09:53:23 (5800): No heartbeat from core client for 30 sec - exiting
09:53:24 (5800): No heartbeat from core client for 30 sec - exiting
09:53:25 (5800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Colobal Worroelrr:: : CPDN processis not running, exiting, bRetVal = 1, checkPID=0, sselfPD=3084, iMonCtr=2

Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
16:42:50 (4392): No heartbeat from core client for 30 sec - exiting
16:42:51 (4392): No heartbeat from core client for 30 sec - exiting
16:42:52 (4392): No heartbeat from core client for 30 sec - exiting
16:42:54 (4392): No heartbeat from core client for 30 sec - exiting
16:42:55 (4392): No heartbeat from core client for 30 sec - exiting
16:42:56 (4392): No heartbeat from core client for 30 sec - exiting
16:42:57 (4392): No heartbeat from core client for 30 sec - exiting
16:42:58 (4392): No heartbeat from core client for 30 sec - exiting
16:42:59 (4392): No heartbeat from core client for 30 sec - exiting
16:43:00 (4392): No heartbeat from core client for 30 sec - exiting
16:43:01 (4392): No heartbeat from core client for 30 sec - exiting
16:43:02 (4392): No heartbeat from core client for 30 sec - exiting
16:43:03 (4392): No heartbeat from core client for 30 sec - exiting
16:43:04 (4392): No heartbeat from core client for 30 sec - exiting
16:43:05 (4392): No heartbeat from core client for 30 sec - exiting
16:43:07 (4392): No heartbeat from core client for 30 sec - exiting
16:43:08 (4392): No heartbeat from core client for 30 sec - exiting
16:43:09 (4392): No heartbeat from core client for 30 sec - exiting
16:43:10 (4392): No heartbeat from core client for 30 sec - exiting
16:43:11 (4392): No heartbeat from core client for 30 sec - exiting
16:43:12 (4392): No heartbeat from core client for 30 sec - exiting
16:43:13 (4392): No heartbeat from core client for 30 sec - exiting
16:43:14 (4392): No heartbeat from core client for 30 sec - exiting
16:43:15 (4392): No heartbeat from core client for 30 sec - exiting
16:43:16 (4392): No heartbeat from core client for 30 sec - exiting
16:43:18 (4392): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2584, selfPID=3292, iMonCtr=1
Model crash detected, will try to restart...
12:19:15 (4436): No heartbeat from core client for 30 sec - exiting
12:19:16 (4436): No heartbeat from core client for 30 sec - exiting
12:19:18 (4436): No heartbeat from core client for 30 sec - exiting
12:19:19 (4436): No heartbeat from core client for 30 sec - exiting
12:19:20 (4436): No heartbeat from core client for 30 sec - exiting
12:19:21 (4436): No heartbeat from core client for 30 sec - exiting
12:19:22 (4436): No heartbeat from core client for 30 sec - exiting
12:19:23 (4436): No heartbeat from core client for 30 sec - exiting
12:19:24 (4436): No heartbeat from core client for 30 sec - exiting
12:19:25 (4436): No heartbeat from core client for 30 sec - exiting
12:19:26 (4436): No heartbeat from core client for 30 sec - exiting
12:19:27 (4436): No heartbeat from core client for 30 sec - exiting
12:19:28 (4436): No heartbeat from core client for 30 sec - exiting
12:19:30 (4436): No heartbeat from core client for 30 sec - exiting
12:19:31 (4436): No heartbeat from core client for 30 sec - exiting
12:19:32 (4436): No heartbeat from core client for 30 sec - exiting
12:19:33 (4436): No heartbeat from core client for 30 sec - exiting
12:19:34 (4436): No heartbeat from core client for 30 sec - exiting
12:19:35 (4436): No heartbeat from core client for 30 sec - exiting
12:19:36 (4436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=2
Model crash detected, will try to restart...
20:57:52 (4448): No heartbeat from core client for 30 sec - exiting
20:57:54 (4448): No heartbeat from core client for 30 sec - exiting
20:57:55 (4448): No heartbeat from core client for 30 sec - exiting
20:57:56 (4448): No heartbeat from core client for 30 sec - exiting
20:57:57 (4448): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5724, iMonCtr=2
Model crash detected, will try to restart...
04:44:36 (5724): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:53:58 (5228): No heartbeat from core client for 30 sec - exiting
10:53:59 (5228): No heartbeat from core client for 30 sec - exiting
10:54:00 (5228): No heartbeat from core client for 30 sec - exiting
10:54:01 (5228): No heartbeat from core client for 30 sec - exiting
10:54:02 (5228): No heartbeat from core client for 30 sec - exiting
10:54:03 (5228): No heartbeat from core client for 30 sec - exiting
10:54:04 (5228): No heartbeat from core client for 30 sec - exiting
10:54:05 (5228): No heartbeat from core client for 30 sec - exiting
10:54:06 (5228): No heartbeat from core client for 30 sec - exiting
10:54:07 (5228): No heartbeat from core client for 30 sec - exiting
10:54:08 (5228): No heartbeat from core client for 30 sec - exiting
10:54:09 (5228): No heartbeat from core client for 30 sec - exiting
10:54:10 (5228): No heartbeat from core client for 30 sec - exiting
10:54:11 (5228): No heartbeat from core client for 30 sec - exiting
10:54:12 (5228): No heartbeat from core client for 30 sec - exiting
10:54:13 (5228): No heartbeat from core client for 30 sec - exiting
10:54:14 (5228): No heartbeat from core client for 30 sec - exiting
10:54:15 (5228): No heartbeat from core client for 30 sec - exiting
10:54:16 (5228): No heartbeat from core client for 30 sec - exiting
10:54:17 (5228): No heartbeat from core client for 30 sec - exiting
10:54:18 (5228): No heartbeat from core client for 30 sec - exiting
10:54:19 (5228): No heartbeat from core client for 30 sec - exiting
10:54:20 (5228): No heartbeat from core client for 30 sec - exiting
10:54:21 (5228): No heartbeat from core client for 30 sec - exiting
10:54:22 (5228): No heartbeat from core client for 30 sec - exiting
10:54:23 (5228): No heartbeat from core client for 30 sec - exiting
10:54:24 (5228): No heartbeat from core client for 30 sec - exiting
10:54:25 (5228): No heartbeat from core client for 30 sec - exiting
10:54:26 (5228): No heartbeat from core client for 30 sec - exiting
10:54:27 (5228): No heartbeat from core client for 30 sec - exiting
10:54:28 (5228): No heartbeat from core client for 30 sec - exiting
10:54:29 (5228): No heartbeat from core client for 30 sec - exiting
10:54:30 (5228): No heartbeat from core client for 30 sec - exiting
10:54:31 (5228): No heartbeat from core client for 30 sec - exiting
10:54:32 (5228): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:54:33 (4604): Can't acquire lockfile (32) - waiting 35s
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3196, selfPID=4604, iMonCtr=1
Model crash detected, will try to restart...
14:52:28 (5272): No heartbeat from core client for 30 sec - exiting
14:52:29 (5272): No heartbeat from core client for 30 sec - exiting
14:52:30 (5272): No heartbeat from core client for 30 sec - exiting
14:52:31 (5272): No heartbeat from core client for 30 sec - exiting
14:52:32 (5272): No heartbeat from core client for 30 sec - exiting
14:52:33 (5272): No heartbeat from core client for 30 sec - exiting
14:52:34 (5272): No heartbeat from core client for 30 sec - exiting
14:52:35 (5272): No heartbeat from core client for 30 sec - exiting
14:52:36 (5272): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=2
Model crash detected, will try to restart...
19:11:55 (6076): No heartbeat from core client for 30 sec - exiting
19:11:57 (6076): No heartbeat from core client for 30 sec - exiting
19:11:58 (6076): No heartbeat from core client for 30 sec - exiting
19:11:59 (6076): No heartbeat from core client for 30 sec - exiting
19:12:00 (6076): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
09:41:53 (5708): No heartbeat from core client for 30 sec - exiting
09:41:54 (5708): No heartbeat from core client for 30 sec - exiting
09:41:55 (5708): No heartbeat from core client for 30 sec - exiting
09:41:56 (5708): No heartbeat from core client for 30 sec - exiting
09:41:57 (5708): No heartbeat from core client for 30 sec - exiting
09:41:58 (5708): No heartbeat from core client for 30 sec - exiting
09:42:00 (5708): No heartbeat from core client for 30 sec - exiting
09:42:01 (5708): No heartbeat from core client for 30 sec - exiting
09:42:02 (5708): No heartbeat from core client for 30 sec - exiting
09:42:03 (5708): No heartbeat from core client for 30 sec - exiting
09:42:04 (5708): No heartbeat from core client for 30 sec - exiting
09:42:05 (5708): No heartbeat from core client for 30 sec - exiting
09:42:06 (5708): No heartbeat from core client for 30 sec - exiting
09:42:07 (5708): No heartbeat from core client for 30 sec - exiting
09:42:08 (5708): No heartbeat from core client for 30 sec - exiting
09:42:09 (5708): No heartbeat from core client for 30 sec - exiting
09:42:10 (5708): No heartbeat from core client for 30 sec - exiting
09:42:12 (5708): No heartbeat from core client for 30 sec - exiting
09:42:13 (5708): No heartbeat from core client for 30 sec - exiting
09:42:14 (5708): No heartbeat from core client for 30 sec - exiting
09:42:15 (5708): No heartbeat from core client for 30 sec - exiting
09:42:16 (5708): No heartbeat from core client for 30 sec - exiting
09:42:17 (5708): No heartbeat from core client for 30 sec - exiting
09:42:18 (5708): No heartbeat from core client for 30 sec - exiting
09:42:19 (5708): No heartbeat from core client for 30 sec - exiting
09:42:20 (5708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2912, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3912, selfPID=5912, iMonCtr=1
Model crash detected, will try to restart...
09:05:25 (4172): No heartbeat from core client for 30 sec - exiting
09:05:26 (4172): No heartbeat from core client for 30 sec - exiting
09:05:27 (4172): No heartbeat from core client for 30 sec - exiting
09:05:28 (4172): No heartbeat from core client for 30 sec - exiting
09:05:29 (4172): No heartbeat from core client for 30 sec - exiting
09:05:30 (4172): No heartbeat from core client for 30 sec - exiting
09:05:31 (4172): No heartbeat from core client for 30 sec - exiting
09:05:32 (4172): No heartbeat from core client for 30 sec - exiting
09:05:33 (4172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4908, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5700, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=2
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
07:14:11 (5020): No heartbeat from core client for 30 sec - exiting
07:14:13 (5020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=2
Model crash detected, will try to restart...
12:01:35 (5356): No heartbeat from core client for 30 sec - exiting
12:01:37 (5356): No heartbeat from core client for 30 sec - exiting
12:01:38 (5356): No heartbeat from core client for 30 sec - exiting
12:01:39 (5356): No heartbeat from core client for 30 sec - exiting
12:01:40 (5356): No heartbeat from core client for 30 sec - exiting
12:01:41 (5356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4744, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5124, selfPID=5868, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5468, iMonCtr=2
Model crash detected, will try to restart...
09:09:52 (5408): No heartbeat from core client for 30 sec - exiting
09:09:53 (5408): No heartbeat from core client for 30 sec - exiting
09:09:54 (5408): No heartbeat from core client for 30 sec - exiting
09:09:55 (5408): No heartbeat from core client for 30 sec - exiting
09:09:56 (5408): No heartbeat from core client for 30 sec - exiting
09:09:57 (5408): No heartbeat from core client for 30 sec - exiting
09:09:58 (5408): No heartbeat from core client for 30 sec - exiting
09:09:59 (5408): No heartbeat from core client for 30 sec - exiting
09:10:01 (5408): No heartbeat from core client for 30 sec - exiting
09:10:02 (5408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1712, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3560, iMonCtr=2
Model crash detected, will try to restart...
21:17:41 (2852): No heartbeat from core client for 30 sec - exiting
21:17:42 (2852): No heartbeat from core client for 30 sec - exiting
21:17:43 (2852): No heartbeat from core client for 30 sec - exiting
21:17:44 (2852): No heartbeat from core client for 30 sec - exiting
21:17:45 (2852): No heartbeat from core client for 30 sec - exiting
21:17:46 (2852): No heartbeat from core client for 30 sec - exiting
21:17:47 (2852): No heartbeat from core client for 30 sec - exiting
21:17:48 (2852): No heartbeat from core client for 30 sec - exiting
21:17:49 (2852): No heartbeat from core client for 30 sec - exiting
21:17:50 (2852): No heartbeat from core client for 30 sec - exiting
21:17:51 (2852): No heartbeat from core client for 30 sec - exiting
21:17:52 (2852): No heartbeat from core client for 30 sec - exiting
21:17:53 (2852): No heartbeat from core client for 30 sec - exiting
21:17:54 (2852): No heartbeat from core client for 30 sec - exiting
21:17:55 (2852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=2
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5992, selfPID=2008, iMonCtr=1
Model crash detected, will try to restart...
08:47:24 (5900): No heartbeat from core client for 30 sec - exiting
08:47:25 (5900): No heartbeat from core client for 30 sec - exiting
08:47:26 (5900): No heartbeat from core client for 30 sec - exiting
08:47:27 (5900): No heartbeat from core client for 30 sec - exiting
08:47:28 (5900): No heartbeat from core client for 30 sec - exiting
08:47:29 (5900): No heartbeat from core client for 30 sec - exiting
08:47:30 (5900): No heartbeat from core client for 30 sec - exiting
08:47:31 (5900): No heartbeat from core client for 30 sec - exiting
08:47:33 (5900): No heartbeat from core client for 30 sec - exiting
08:47:34 (5900): No heartbeat from core client for 30 sec - exiting
08:47:35 (5900): No heartbeat from core client for 30 sec - exiting
08:47:36 (5900): No heartbeat from core client for 30 sec - exiting
08:47:37 (5900): No heartbeat from core client for 30 sec - exiting
08:47:38 (5900): No heartbeat from core client for 30 sec - exiting
08:47:39 (5900): No heartbeat from core client for 30 sec - exiting
08:47:40 (5900): No heartbeat from core client for 30 sec - exiting
08:47:41 (5900): No heartbeat from core client for 30 sec - exiting
08:47:42 (5900): No heartbeat from core client for 30 sec - exiting
08:47:43 (5900): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3056, selfPID=4032, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5956, selfPID=5356, iMonCtr=1
Model crash detected, will try to restart...
Leaving CPDN_Main::Monitor...
Called boinc_finish

</stderr_txt><message>
upload failure: <file_xfer_error>
  <file_name>hadam3p_eu_qimb_2008_1_008397411_0_9.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_qimb_2008_1_008397411_0_10.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_qimb_2008_1_008397411_0_11.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>
<file_xfer_error>
  <file_name>hadam3p_eu_qimb_2008_1_008397411_0_12.zip</file_name>
  <error_code>-161</error_code>
</file_xfer_error>

</message>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
09 Jul 2013 15:24:20 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 92,256 495,177 5.3674
07 Jul 2013 09:56:19 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 80,736 431,859 5.3490
06 Jul 2013 05:42:26 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 69,216 367,354 5.3074
04 Jul 2013 14:21:44 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 57,696 306,733 5.3164
03 Jul 2013 02:33:59 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 46,176 247,470 5.3593
02 Jul 2013 10:54:41 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 34,656 182,359 5.2620
02 Jul 2013 10:07:18 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 23,136 117,778 5.0907
28 Jun 2013 03:46:18 1218845 15867137 hadam3p_eu_qimb_2008_1_008397411_0 11,616 62,725 5.3999


©2024 cpdn.org