Name | hadam3p_eu_qimb_2008_1_008397411_0 |
Workunit | 8548270 |
Created | 26 Jun 2013, 10:25:16 UTC |
Sent | 26 Jun 2013, 13:07:38 UTC |
Report deadline | 8 Jun 2014, 18:27:38 UTC |
Received | 11 Jul 2013, 5:01:05 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1218845 |
Run time | 6 days 12 hours 36 min 22 sec |
CPU time | 6 days 7 hours 49 min 41 sec |
Validate state | Invalid |
Credit | 1,591.48 |
Device peak FLOPS | 2.66 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <stderr_txt> 06:29:26 (4388): No heartbeat from core client for 30 sec - exiting 06:29:28 (4388): No heartbeat from core client for 30 sec - exiting 06:29:29 (4388): No heartbeat from core client for 30 sec - exiting 06:29:30 (4388): No heartbeat from core client for 30 sec - exiting 06:29:31 (4388): No heartbeat from core client for 30 sec - exiting 06:29:32 (4388): No heartbeat from core client for 30 sec - exiting 06:29:33 (4388): No heartbeat from core client for 30 sec - exiting 06:29:34 (4388): No heartbeat from core client for 30 sec - exiting 06:29:35 (4388): No heartbeat from core client for 30 sec - exiting 06:29:36 (4388): No heartbeat from core client for 30 sec - exiting 06:29:37 (4388): No heartbeat from core client for 30 sec - exiting 06:29:38 (4388): No heartbeat from core client for 30 sec - exiting 06:29:40 (4388): No heartbeat from core client for 30 sec - exiting 06:29:41 (4388): No heartbeat from core client for 30 sec - exiting 06:29:42 (4388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=748, selfPID=5460, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... ColobalnWorkller :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, self=2456, iMonCtr=2 =2 Model crash detected, will try to restart... 09:53:12 (5800): No heartbeat from core client for 30 sec - exiting 09:53:13 (5800): No heartbeat from core client for 30 sec - exiting 09:53:14 (5800): No heartbeat from core client for 30 sec - exiting 09:53:15 (5800): No heartbeat from core client for 30 sec - exiting 09:53:16 (5800): No heartbeat from core client for 30 sec - exiting 09:53:17 (5800): No heartbeat from core client for 30 sec - exiting 09:53:19 (5800): No heartbeat from core client for 30 sec - exiting 09:53:20 (5800): No heartbeat from core client for 30 sec - exiting 09:53:21 (5800): No heartbeat from core client for 30 sec - exiting 09:53:22 (5800): No heartbeat from core client for 30 sec - exiting 09:53:23 (5800): No heartbeat from core client for 30 sec - exiting 09:53:24 (5800): No heartbeat from core client for 30 sec - exiting 09:53:25 (5800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Colobal Worroelrr:: : CPDN processis not running, exiting, bRetVal = 1, checkPID=0, sselfPD=3084, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:42:50 (4392): No heartbeat from core client for 30 sec - exiting 16:42:51 (4392): No heartbeat from core client for 30 sec - exiting 16:42:52 (4392): No heartbeat from core client for 30 sec - exiting 16:42:54 (4392): No heartbeat from core client for 30 sec - exiting 16:42:55 (4392): No heartbeat from core client for 30 sec - exiting 16:42:56 (4392): No heartbeat from core client for 30 sec - exiting 16:42:57 (4392): No heartbeat from core client for 30 sec - exiting 16:42:58 (4392): No heartbeat from core client for 30 sec - exiting 16:42:59 (4392): No heartbeat from core client for 30 sec - exiting 16:43:00 (4392): No heartbeat from core client for 30 sec - exiting 16:43:01 (4392): No heartbeat from core client for 30 sec - exiting 16:43:02 (4392): No heartbeat from core client for 30 sec - exiting 16:43:03 (4392): No heartbeat from core client for 30 sec - exiting 16:43:04 (4392): No heartbeat from core client for 30 sec - exiting 16:43:05 (4392): No heartbeat from core client for 30 sec - exiting 16:43:07 (4392): No heartbeat from core client for 30 sec - exiting 16:43:08 (4392): No heartbeat from core client for 30 sec - exiting 16:43:09 (4392): No heartbeat from core client for 30 sec - exiting 16:43:10 (4392): No heartbeat from core client for 30 sec - exiting 16:43:11 (4392): No heartbeat from core client for 30 sec - exiting 16:43:12 (4392): No heartbeat from core client for 30 sec - exiting 16:43:13 (4392): No heartbeat from core client for 30 sec - exiting 16:43:14 (4392): No heartbeat from core client for 30 sec - exiting 16:43:15 (4392): No heartbeat from core client for 30 sec - exiting 16:43:16 (4392): No heartbeat from core client for 30 sec - exiting 16:43:18 (4392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2584, selfPID=3292, iMonCtr=1 Model crash detected, will try to restart... 12:19:15 (4436): No heartbeat from core client for 30 sec - exiting 12:19:16 (4436): No heartbeat from core client for 30 sec - exiting 12:19:18 (4436): No heartbeat from core client for 30 sec - exiting 12:19:19 (4436): No heartbeat from core client for 30 sec - exiting 12:19:20 (4436): No heartbeat from core client for 30 sec - exiting 12:19:21 (4436): No heartbeat from core client for 30 sec - exiting 12:19:22 (4436): No heartbeat from core client for 30 sec - exiting 12:19:23 (4436): No heartbeat from core client for 30 sec - exiting 12:19:24 (4436): No heartbeat from core client for 30 sec - exiting 12:19:25 (4436): No heartbeat from core client for 30 sec - exiting 12:19:26 (4436): No heartbeat from core client for 30 sec - exiting 12:19:27 (4436): No heartbeat from core client for 30 sec - exiting 12:19:28 (4436): No heartbeat from core client for 30 sec - exiting 12:19:30 (4436): No heartbeat from core client for 30 sec - exiting 12:19:31 (4436): No heartbeat from core client for 30 sec - exiting 12:19:32 (4436): No heartbeat from core client for 30 sec - exiting 12:19:33 (4436): No heartbeat from core client for 30 sec - exiting 12:19:34 (4436): No heartbeat from core client for 30 sec - exiting 12:19:35 (4436): No heartbeat from core client for 30 sec - exiting 12:19:36 (4436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=2 Model crash detected, will try to restart... 20:57:52 (4448): No heartbeat from core client for 30 sec - exiting 20:57:54 (4448): No heartbeat from core client for 30 sec - exiting 20:57:55 (4448): No heartbeat from core client for 30 sec - exiting 20:57:56 (4448): No heartbeat from core client for 30 sec - exiting 20:57:57 (4448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5724, iMonCtr=2 Model crash detected, will try to restart... 04:44:36 (5724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:53:58 (5228): No heartbeat from core client for 30 sec - exiting 10:53:59 (5228): No heartbeat from core client for 30 sec - exiting 10:54:00 (5228): No heartbeat from core client for 30 sec - exiting 10:54:01 (5228): No heartbeat from core client for 30 sec - exiting 10:54:02 (5228): No heartbeat from core client for 30 sec - exiting 10:54:03 (5228): No heartbeat from core client for 30 sec - exiting 10:54:04 (5228): No heartbeat from core client for 30 sec - exiting 10:54:05 (5228): No heartbeat from core client for 30 sec - exiting 10:54:06 (5228): No heartbeat from core client for 30 sec - exiting 10:54:07 (5228): No heartbeat from core client for 30 sec - exiting 10:54:08 (5228): No heartbeat from core client for 30 sec - exiting 10:54:09 (5228): No heartbeat from core client for 30 sec - exiting 10:54:10 (5228): No heartbeat from core client for 30 sec - exiting 10:54:11 (5228): No heartbeat from core client for 30 sec - exiting 10:54:12 (5228): No heartbeat from core client for 30 sec - exiting 10:54:13 (5228): No heartbeat from core client for 30 sec - exiting 10:54:14 (5228): No heartbeat from core client for 30 sec - exiting 10:54:15 (5228): No heartbeat from core client for 30 sec - exiting 10:54:16 (5228): No heartbeat from core client for 30 sec - exiting 10:54:17 (5228): No heartbeat from core client for 30 sec - exiting 10:54:18 (5228): No heartbeat from core client for 30 sec - exiting 10:54:19 (5228): No heartbeat from core client for 30 sec - exiting 10:54:20 (5228): No heartbeat from core client for 30 sec - exiting 10:54:21 (5228): No heartbeat from core client for 30 sec - exiting 10:54:22 (5228): No heartbeat from core client for 30 sec - exiting 10:54:23 (5228): No heartbeat from core client for 30 sec - exiting 10:54:24 (5228): No heartbeat from core client for 30 sec - exiting 10:54:25 (5228): No heartbeat from core client for 30 sec - exiting 10:54:26 (5228): No heartbeat from core client for 30 sec - exiting 10:54:27 (5228): No heartbeat from core client for 30 sec - exiting 10:54:28 (5228): No heartbeat from core client for 30 sec - exiting 10:54:29 (5228): No heartbeat from core client for 30 sec - exiting 10:54:30 (5228): No heartbeat from core client for 30 sec - exiting 10:54:31 (5228): No heartbeat from core client for 30 sec - exiting 10:54:32 (5228): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:54:33 (4604): Can't acquire lockfile (32) - waiting 35s Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3604, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3196, selfPID=4604, iMonCtr=1 Model crash detected, will try to restart... 14:52:28 (5272): No heartbeat from core client for 30 sec - exiting 14:52:29 (5272): No heartbeat from core client for 30 sec - exiting 14:52:30 (5272): No heartbeat from core client for 30 sec - exiting 14:52:31 (5272): No heartbeat from core client for 30 sec - exiting 14:52:32 (5272): No heartbeat from core client for 30 sec - exiting 14:52:33 (5272): No heartbeat from core client for 30 sec - exiting 14:52:34 (5272): No heartbeat from core client for 30 sec - exiting 14:52:35 (5272): No heartbeat from core client for 30 sec - exiting 14:52:36 (5272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4880, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=2 Model crash detected, will try to restart... 19:11:55 (6076): No heartbeat from core client for 30 sec - exiting 19:11:57 (6076): No heartbeat from core client for 30 sec - exiting 19:11:58 (6076): No heartbeat from core client for 30 sec - exiting 19:11:59 (6076): No heartbeat from core client for 30 sec - exiting 19:12:00 (6076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 09:41:53 (5708): No heartbeat from core client for 30 sec - exiting 09:41:54 (5708): No heartbeat from core client for 30 sec - exiting 09:41:55 (5708): No heartbeat from core client for 30 sec - exiting 09:41:56 (5708): No heartbeat from core client for 30 sec - exiting 09:41:57 (5708): No heartbeat from core client for 30 sec - exiting 09:41:58 (5708): No heartbeat from core client for 30 sec - exiting 09:42:00 (5708): No heartbeat from core client for 30 sec - exiting 09:42:01 (5708): No heartbeat from core client for 30 sec - exiting 09:42:02 (5708): No heartbeat from core client for 30 sec - exiting 09:42:03 (5708): No heartbeat from core client for 30 sec - exiting 09:42:04 (5708): No heartbeat from core client for 30 sec - exiting 09:42:05 (5708): No heartbeat from core client for 30 sec - exiting 09:42:06 (5708): No heartbeat from core client for 30 sec - exiting 09:42:07 (5708): No heartbeat from core client for 30 sec - exiting 09:42:08 (5708): No heartbeat from core client for 30 sec - exiting 09:42:09 (5708): No heartbeat from core client for 30 sec - exiting 09:42:10 (5708): No heartbeat from core client for 30 sec - exiting 09:42:12 (5708): No heartbeat from core client for 30 sec - exiting 09:42:13 (5708): No heartbeat from core client for 30 sec - exiting 09:42:14 (5708): No heartbeat from core client for 30 sec - exiting 09:42:15 (5708): No heartbeat from core client for 30 sec - exiting 09:42:16 (5708): No heartbeat from core client for 30 sec - exiting 09:42:17 (5708): No heartbeat from core client for 30 sec - exiting 09:42:18 (5708): No heartbeat from core client for 30 sec - exiting 09:42:19 (5708): No heartbeat from core client for 30 sec - exiting 09:42:20 (5708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2912, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3912, selfPID=5912, iMonCtr=1 Model crash detected, will try to restart... 09:05:25 (4172): No heartbeat from core client for 30 sec - exiting 09:05:26 (4172): No heartbeat from core client for 30 sec - exiting 09:05:27 (4172): No heartbeat from core client for 30 sec - exiting 09:05:28 (4172): No heartbeat from core client for 30 sec - exiting 09:05:29 (4172): No heartbeat from core client for 30 sec - exiting 09:05:30 (4172): No heartbeat from core client for 30 sec - exiting 09:05:31 (4172): No heartbeat from core client for 30 sec - exiting 09:05:32 (4172): No heartbeat from core client for 30 sec - exiting 09:05:33 (4172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2732, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4908, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5700, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3220, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:14:11 (5020): No heartbeat from core client for 30 sec - exiting 07:14:13 (5020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3996, iMonCtr=2 Model crash detected, will try to restart... 12:01:35 (5356): No heartbeat from core client for 30 sec - exiting 12:01:37 (5356): No heartbeat from core client for 30 sec - exiting 12:01:38 (5356): No heartbeat from core client for 30 sec - exiting 12:01:39 (5356): No heartbeat from core client for 30 sec - exiting 12:01:40 (5356): No heartbeat from core client for 30 sec - exiting 12:01:41 (5356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4744, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5124, selfPID=5868, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5468, iMonCtr=2 Model crash detected, will try to restart... 09:09:52 (5408): No heartbeat from core client for 30 sec - exiting 09:09:53 (5408): No heartbeat from core client for 30 sec - exiting 09:09:54 (5408): No heartbeat from core client for 30 sec - exiting 09:09:55 (5408): No heartbeat from core client for 30 sec - exiting 09:09:56 (5408): No heartbeat from core client for 30 sec - exiting 09:09:57 (5408): No heartbeat from core client for 30 sec - exiting 09:09:58 (5408): No heartbeat from core client for 30 sec - exiting 09:09:59 (5408): No heartbeat from core client for 30 sec - exiting 09:10:01 (5408): No heartbeat from core client for 30 sec - exiting 09:10:02 (5408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1712, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3560, iMonCtr=2 Model crash detected, will try to restart... 21:17:41 (2852): No heartbeat from core client for 30 sec - exiting 21:17:42 (2852): No heartbeat from core client for 30 sec - exiting 21:17:43 (2852): No heartbeat from core client for 30 sec - exiting 21:17:44 (2852): No heartbeat from core client for 30 sec - exiting 21:17:45 (2852): No heartbeat from core client for 30 sec - exiting 21:17:46 (2852): No heartbeat from core client for 30 sec - exiting 21:17:47 (2852): No heartbeat from core client for 30 sec - exiting 21:17:48 (2852): No heartbeat from core client for 30 sec - exiting 21:17:49 (2852): No heartbeat from core client for 30 sec - exiting 21:17:50 (2852): No heartbeat from core client for 30 sec - exiting 21:17:51 (2852): No heartbeat from core client for 30 sec - exiting 21:17:52 (2852): No heartbeat from core client for 30 sec - exiting 21:17:53 (2852): No heartbeat from core client for 30 sec - exiting 21:17:54 (2852): No heartbeat from core client for 30 sec - exiting 21:17:55 (2852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5992, selfPID=2008, iMonCtr=1 Model crash detected, will try to restart... 08:47:24 (5900): No heartbeat from core client for 30 sec - exiting 08:47:25 (5900): No heartbeat from core client for 30 sec - exiting 08:47:26 (5900): No heartbeat from core client for 30 sec - exiting 08:47:27 (5900): No heartbeat from core client for 30 sec - exiting 08:47:28 (5900): No heartbeat from core client for 30 sec - exiting 08:47:29 (5900): No heartbeat from core client for 30 sec - exiting 08:47:30 (5900): No heartbeat from core client for 30 sec - exiting 08:47:31 (5900): No heartbeat from core client for 30 sec - exiting 08:47:33 (5900): No heartbeat from core client for 30 sec - exiting 08:47:34 (5900): No heartbeat from core client for 30 sec - exiting 08:47:35 (5900): No heartbeat from core client for 30 sec - exiting 08:47:36 (5900): No heartbeat from core client for 30 sec - exiting 08:47:37 (5900): No heartbeat from core client for 30 sec - exiting 08:47:38 (5900): No heartbeat from core client for 30 sec - exiting 08:47:39 (5900): No heartbeat from core client for 30 sec - exiting 08:47:40 (5900): No heartbeat from core client for 30 sec - exiting 08:47:41 (5900): No heartbeat from core client for 30 sec - exiting 08:47:42 (5900): No heartbeat from core client for 30 sec - exiting 08:47:43 (5900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3056, selfPID=4032, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5956, selfPID=5356, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt><message> upload failure: <file_xfer_error> <file_name>hadam3p_eu_qimb_2008_1_008397411_0_9.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qimb_2008_1_008397411_0_10.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qimb_2008_1_008397411_0_11.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_eu_qimb_2008_1_008397411_0_12.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jul 2013 15:24:20 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 92,256 | 495,177 | 5.3674 |
07 Jul 2013 09:56:19 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 80,736 | 431,859 | 5.3490 |
06 Jul 2013 05:42:26 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 69,216 | 367,354 | 5.3074 |
04 Jul 2013 14:21:44 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 57,696 | 306,733 | 5.3164 |
03 Jul 2013 02:33:59 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 46,176 | 247,470 | 5.3593 |
02 Jul 2013 10:54:41 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 34,656 | 182,359 | 5.2620 |
02 Jul 2013 10:07:18 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 23,136 | 117,778 | 5.0907 |
28 Jun 2013 03:46:18 | 1218845 | 15867137 | hadam3p_eu_qimb_2008_1_008397411_0 | 11,616 | 62,725 | 5.3999 |
©2024 cpdn.org