Name | hadam3p_saf_v7bw_2003_1_006683130_0 |
Workunit | 6886383 |
Created | 26 Aug 2010, 11:56:47 UTC |
Sent | 27 Aug 2010, 21:04:48 UTC |
Report deadline | 10 Aug 2011, 2:24:48 UTC |
Received | 14 Nov 2010, 18:39:20 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 979988 |
Run time | 21 days 2 hours 47 min 12 sec |
CPU time | 10 days 10 hours 20 min 57 sec |
Validate state | Workunit error - check skipped |
Credit | 2,244.09 |
Device peak FLOPS | 0.93 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.05 windows_intelx86 |
Stderr | <core_client_version>6.6.31</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5268, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3956, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4860, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:13:20 (4920): No heartbeat from core client for 30 sec - exiting 08:13:21 (4920): No heartbeat from core client for 30 sec - exiting 08:13:22 (4920): No heartbeat from core client for 30 sec - exiting 08:13:23 (4920): No heartbeat from core client for 30 sec - exiting 08:13:24 (4920): No heartbeat from core client for 30 sec - exiting 08:13:25 (4920): No heartbeat from core client for 30 sec - exiting 08:13:26 (4920): No heartbeat from core client for 30 sec - exiting 08:13:27 (4920): No heartbeat from core client for 30 sec - exiting 08:13:28 (4920): No heartbeat from core client for 30 sec - exiting 08:13:29 (4920): No heartbeat from core client for 30 sec - exiting 08:13:30 (4920): No heartbeat from core client for 30 sec - exiting 08:13:31 (4920): No heartbeat from core client for 30 sec - exiting 08:13:32 (4920): No heartbeat from core client for 30 sec - exiting 08:13:33 (4920): No heartbeat from core client for 30 sec - exiting 08:13:34 (4920): No heartbeat from core client for 30 sec - exiting 08:13:35 (4920): No heartbeat from core client for 30 sec - exiting 08:13:36 (4920): No heartbeat from core client for 30 sec - exiting 08:13:37 (4920): No heartbeat from core client for 30 sec - exiting 08:13:38 (4920): No heartbeat from core client for 30 sec - exiting 08:13:39 (4920): No heartbeat from core client for 30 sec - exiting 08:13:40 (4920): No heartbeat from core client for 30 sec - exiting 08:13:41 (4920): No heartbeat from core client for 30 sec - exiting 08:13:42 (4920): No heartbeat from core client for 30 sec - exiting 08:13:43 (4920): No heartbeat from core client for 30 sec - exiting 08:13:44 (4920): No heartbeat from core client for 30 sec - exiting 08:13:45 (4920): No heartbeat from core client for 30 sec - exiting 08:13:46 (4920): No heartbeat from core client for 30 sec - exiting 08:13:47 (4920): No heartbeat from core client for 30 sec - exiting 08:13:48 (4920): No heartbeat from core client for 30 sec - exiting 08:13:49 (4920): No heartbeat from core client for 30 sec - exiting 08:13:50 (4920): No heartbeat from core client for 30 sec - exiting 08:13:51 (4920): No heartbeat from core client for 30 sec - exiting 08:13:52 (4920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3972, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3336, selfPID=5328, iMonCtr=1 Model crash detected, will try to restart... 10:32:55 (4968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5444, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1804, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3872, selfPID=4880, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5884, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5608, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2136, selfPID=2136, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6384, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5460, selfPID=2168, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5120, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6032, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5064, selfPID=2068, iMonCtr=1 Model crash detected, will try to restart... 12:13:19 (4868): No heartbeat from core client for 30 sec - exiting 12:13:20 (4868): No heartbeat from core client for 30 sec - exiting 12:13:21 (4868): No heartbeat from core client for 30 sec - exiting 12:13:22 (4868): No heartbeat from core client for 30 sec - exiting 12:13:23 (4868): No heartbeat from core client for 30 sec - exiting 12:13:24 (4868): No heartbeat from core client for 30 sec - exiting 12:13:25 (4868): No heartbeat from core client for 30 sec - exiting 12:13:26 (4868): No heartbeat from core client for 30 sec - exiting 12:13:27 (4868): No heartbeat from core client for 30 sec - exiting 12:13:28 (4868): No heartbeat from core client for 30 sec - exiting 12:13:29 (4868): No heartbeat from core client for 30 sec - exiting 12:13:30 (4868): No heartbeat from core client for 30 sec - exiting 12:13:31 (4868): No heartbeat from core client for 30 sec - exiting 12:13:32 (4868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5864, selfPID=5328, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5356, selfPID=2764, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6008, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5580, selfPID=5580, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4668, iMonCtr=2 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2288, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3048, selfPID=1632, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=572, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5052, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1960, selfPID=3972, iMonCtr=1 Model crash detected, will try to restart... 23:57:16 (4488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:57:17 (4488): No heartbeat from core client for 30 sec - exiting 23:57:18 (4488): No heartbeat from core client for 30 sec - exiting 23:57:19 (4488): No heartbeat from core client for 30 sec - exiting 23:57:25 (4488): No heartbeat from core client for 30 sec - exiting 23:57:26 (4488): No heartbeat from core client for 30 sec - exiting 23:57:27 (4488): No heartbeat from core client for 30 sec - exiting 23:57:28 (4488): No heartbeat from core client for 30 sec - exiting 23:57:29 (4488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7520, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7560, selfPID=8104, iMonCtr=1 Model crash detected, will try to restart... 09:34:19 (3580): No heartbeat from core client for 30 sec - exiting 09:34:20 (3580): No heartbeat from core client for 30 sec - exiting 09:34:21 (3580): No heartbeat from core client for 30 sec - exiting 09:34:22 (3580): No heartbeat from core client for 30 sec - exiting 09:34:23 (3580): No heartbeat from core client for 30 sec - exiting 09:34:24 (3580): No heartbeat from core client for 30 sec - exiting 09:34:25 (3580): No heartbeat from core client for 30 sec - exiting 09:34:26 (3580): No heartbeat from core client for 30 sec - exiting 09:34:27 (3580): No heartbeat from core client for 30 sec - exiting 09:34:28 (3580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3024, selfPID=3024, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2476, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4640, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4900, selfPID=6116, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:11:28 (5752): No heartbeat from core client for 30 sec - exiting 08:11:29 (5752): No heartbeat from core client for 30 sec - exiting 08:11:30 (5752): No heartbeat from core client for 30 sec - exiting 08:11:31 (5752): No heartbeat from core client for 30 sec - exiting 08:11:32 (5752): No heartbeat from core client for 30 sec - exiting 08:11:33 (5752): No heartbeat from core client for 30 sec - exiting 08:11:34 (5752): No heartbeat from core client for 30 sec - exiting 08:11:35 (5752): No heartbeat from core client for 30 sec - exiting 08:11:36 (5752): No heartbeat from core client for 30 sec - exiting 08:11:37 (5752): No heartbeat from core client for 30 sec - exiting 08:11:38 (5752): No heartbeat from core client for 30 sec - exiting 08:11:39 (5752): No heartbeat from core client for 30 sec - exiting 08:11:40 (5752): No heartbeat from core client for 30 sec - exiting 08:11:41 (5752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5964, selfPID=5056, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3880, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4812, selfPID=4684, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 06:06:16 (5600): No heartbeat from core client for 30 sec - exiting 06:06:17 (5600): No heartbeat from core client for 30 sec - exiting 06:06:18 (5600): No heartbeat from core client for 30 sec - exiting 06:06:19 (5600): No heartbeat from core client for 30 sec - exiting 06:06:20 (5600): No heartbeat from core client for 30 sec - exiting 06:06:21 (5600): No heartbeat from core client for 30 sec - exiting 06:06:22 (5600): No heartbeat from core client for 30 sec - exiting 06:06:24 (5600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6988, selfPID=3464, iMonCtr=1 Model crash detected, will try to restart... 09:59:43 (4988): No heartbeat from core client for 30 sec - exiting 10:01:35 (4988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: 08:43:25 (4516): No heartbeat from core client for 30 sec - exiting 08:43:26 (4516): No heartbeat from core client for 30 sec - exiting 08:43:27 (4516): No heartbeat from core client for 30 sec - exiting 08:43:28 (4516): No heartbeat from core client for 30 sec - exiting 08:43:29 (4516): No heartbeat from core client for 30 sec - exiting 08:43:30 (4516): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5872, selfPID=2736, iMonCtr=1 Model crash detected, will try to restart... 08:10:03 (5192): No heartbeat from core client for 30 sec - exiting 08:10:04 (5192): No heartbeat from core client for 30 sec - exiting 08:10:05 (5192): No heartbeat from core client for 30 sec - exiting 08:10:06 (5192): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... G08:54:45 (6068): No heartbeat from core client for 30 sec - exiting 08:54:46 (6068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4792, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4752, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:13:58 (2828): No heartbeat from core client for 30 sec - exiting 08:13:59 (2828): No heartbeat from core client for 30 sec - exiting 08:14:00 (2828): No heartbeat from core client for 30 sec - exiting 08:14:01 (2828): No heartbeat from core client for 30 sec - exiting 08:14:02 (2828): No heartbeat from core client for 30 sec - exiting 08:14:04 (2828): No heartbeat from core client for 30 sec - exiting 08:14:05 (2828): No heartbeat from core client for 30 sec - exiting 08:14:06 (2828): No heartbeat from core client for 30 sec - exiting 08:14:07 (2828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7060, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 08:21:06 (3336): No heartbeat from core client for 30 sec - exiting 08:21:07 (3336): No heartbeat from core client for 30 sec - exiting 08:21:08 (3336): No heartbeat from core client for 30 sec - exiting 08:21:09 (3336): No heartbeat from core client for 30 sec - exiting 08:21:10 (3336): No heartbeat from core client for 30 sec - exiting 08:21:11 (3336): No heartbeat from core client for 30 sec - exiting 08:21:12 (3336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:10:07 (4984): No heartbeat from core client for 30 sec - exiting 08:10:08 (4984): No heartbeat from core client for 30 sec - exiting 08:10:09 (4984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:52:03 (3632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5428, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 09:27:12 (276): No heartbeat from core client for 30 sec - exiting 09:27:13 (276): No heartbeat from core client for 30 sec - exiting 09:27:14 (276): No heartbeat from core client for 30 sec - exiting 09:27:15 (276): No heartbeat from core client for 30 sec - exiting 09:27:16 (276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:35:09 (4256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Leaving CPDN_Main::Monitor... 20:57:08 (4644): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Nov 2010 18:42:26 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 138,336 | 899,606 | 6.5031 |
10 Nov 2010 00:02:51 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 126,823 | 820,624 | 6.4706 |
09 Nov 2010 22:04:18 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 126,816 | 819,546 | 6.4625 |
02 Nov 2010 15:12:59 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 115,296 | 741,284 | 6.4294 |
30 Oct 2010 21:13:22 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 103,776 | 671,576 | 6.4714 |
28 Oct 2010 17:07:31 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 92,256 | 605,113 | 6.5591 |
15 Oct 2010 16:05:34 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 80,736 | 537,886 | 6.6623 |
10 Oct 2010 04:01:16 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 69,216 | 472,611 | 6.8281 |
28 Sep 2010 19:13:47 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 57,696 | 398,135 | 6.9006 |
24 Sep 2010 19:10:52 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 46,176 | 320,498 | 6.9408 |
21 Sep 2010 14:51:13 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 34,656 | 248,657 | 7.1750 |
13 Sep 2010 22:54:24 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 23,136 | 175,027 | 7.5651 |
05 Sep 2010 23:50:26 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 11,622 | 93,500 | 8.0451 |
05 Sep 2010 22:20:39 | 979988 | 11685902 | hadam3p_saf_v7bw_2003_1_006683130_0 | 11,616 | 91,995 | 7.9197 |
©2024 cpdn.org