Name | hadam3p_saf_0sgc_1971_1_006860452_1 |
Workunit | 7063768 |
Created | 23 Apr 2011, 10:05:16 UTC |
Sent | 23 Apr 2011, 10:20:41 UTC |
Report deadline | 4 Apr 2012, 15:40:41 UTC |
Received | 30 May 2011, 16:40:20 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 922918 |
Run time | 7 days 20 hours 27 min 3 sec |
CPU time | 5 days 16 hours 27 min 32 sec |
Validate state | Workunit error - check skipped |
Credit | 2,244.09 |
Device peak FLOPS | 2.32 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6096, selfPID=6096, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exitCPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1928, selfPID=1928, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2160, selfPID=2160, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3256, selfPID=3256, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3768, selfPID=3768, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4236, selfPID=4236, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:21:25 (4208): Can't acquire lockfile (32) - waiting 35s Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4972, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:10:03 (4788): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5756, selfPID=5756, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2788, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5576, selfPID=3140, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4896, selfPID=4092, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:02:25 (5220): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=2 16:19:47 (5116): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1396, selfPID=5116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1476, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=2 Leaving CPDN_Main::Monitor... 16:11:53 (4508): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4124, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5424, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5976, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5580, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 16:20:20 (6140): No heartbeat from core client for 30 sec - exiting 16:20:21 (6140): No heartbeat from core client for 30 sec - exiting 16:20:22 (6140): No heartbeat from core client for 30 sec - exiting 16:20:23 (6140): No heartbeat from core client for 30 sec - exiting 16:20:24 (6140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5176, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4436, selfPID=3816, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5856, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=668, iMonCtr=2 Model crash detected, will try to restart... 19:25:45 (4728): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4988, selfPID=6108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3268, selfPID=4780, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4420, selfPID=3608, iMonCtr=1 Model crash detected, will try to restart... 16:17:53 (5224): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5856, iMonCtr=2 Model crash detected, will try to restart... 16:20:11 (4576): No heartbeat from core client for 30 sec - exiting 16:20:12 (4576): No heartbeat from core client for 30 sec - exiting 16:20:13 (4576): No heartbeat from core client for 30 sec - exiting 16:20:14 (4576): No heartbeat from core client for 30 sec - exiting 16:20:15 (4576): No heartbeat from core client for 30 sec - exiting 16:20:16 (4576): No heartbeat from core client for 30 sec - exiting 16:20:17 (4576): No heartbeat from core client for 30 sec - exiting 16:20:18 (4576): No heartbeat from core client for 30 sec - exiting 16:20:19 (4576): No heartbeat from core client for 30 sec - exiting 16:20:20 (4576): No heartbeat from core client for 30 sec - exiting 16:20:22 (4576): No heartbeat from core client for 30 sec - exiting 16:20:23 (4576): No heartbeat from core client for 30 sec - exiting 16:20:24 (4576): No heartbeat from core client for 30 sec - exiting 16:20:25 (4576): No heartbeat from core client for 30 sec - exiting 16:20:26 (4576): No heartbeat from core client for 30 sec - exiting 16:20:27 (4576): No heartbeat from core client for 30 sec - exiting 16:20:28 (4576): No heartbeat from core client for 30 sec - exiting 16:20:29 (4576): No heartbeat from core client for 30 sec - exiting 16:20:30 (4576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:20:42 (5784): Can't acquire lockfile (32) - waiting 35s 16:38:06 (4888): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=2 Model crash detected, will try to restart... 16:19:21 (5712): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4676, iMonCtr=2 Model crash detected, will try to restart... 11:01:34 (4796): Can't acquire lockfile (32) - waiting 35s Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5512, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 10:26:09 (5344): No heartbeat from core client for 30 sec - exiting 10:26:10 (5344): No heartbeat from core client for 30 sec - exiting 10:26:12 (5344): No heartbeat from core client for 30 sec - exiting 10:26:13 (5344): No heartbeat from core client for 30 sec - exiting 10:26:14 (5344): No heartbeat from core client for 30 sec - exiting 10:26:15 (5344): No heartbeat from core client for 30 sec - exiting 10:26:16 (5344): No heartbeat from core client for 30 sec - exiting 10:26:17 (5344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3716, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4908, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:20:03 (4304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:42:19 (1164): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4564, selfPID=5292, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5600, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5612, selfPID=2884, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5548, selfPID=3224, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 11:52:02 (4892): Can't acquire lockfile (32) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2528, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:15:25 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4136, selfPID=4820, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2928, selfPID=1448, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5672, selfPID=4156, iMonCtr=1 Model crash detected, will try to restart... 16:17:03 (4424): Can't acquire lockfile (32) - waiting 35s Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5128, selfPID=2200, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:21:30 (4264): No heartbeat from core client for 30 sec - exiting 16:21:31 (4264): No heartbeat from core client for 30 sec - exiting 16:21:32 (4264): No heartbeat from core client for 30 sec - exiting 16:21:33 (4264): No heartbeat from core client for 30 sec - exiting 16:21:35 (4264): No heartbeat from core client for 30 sec - exiting 16:21:36 (4264): No heartbeat from core client for 30 sec - exiting 16:21:37 (4264): No heartbeat from core client for 30 sec - exiting 16:21:38 (4264): No heartbeat from core client for 30 sec - exiting 16:21:39 (4264): No heartbeat from core client for 30 sec - exiting 16:21:40 (4264): No heartbeat from core client for 30 sec - exiting 16:21:41 (4264): No heartbeat from core client for 30 sec - exiting 16:21:42 (4264): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6100, selfPID=3728, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
30 May 2011 17:07:35 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 138,336 | 490,497 | 3.5457 |
29 May 2011 11:53:51 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 126,816 | 449,781 | 3.5467 |
28 May 2011 08:19:27 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 115,296 | 409,685 | 3.5533 |
24 May 2011 20:21:56 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 103,776 | 369,721 | 3.5627 |
23 May 2011 10:18:07 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 92,256 | 329,112 | 3.5674 |
22 May 2011 07:16:29 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 80,752 | 288,847 | 3.5770 |
21 May 2011 19:14:17 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 80,736 | 288,126 | 3.5687 |
19 May 2011 15:34:12 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 69,216 | 248,129 | 3.5849 |
16 May 2011 12:19:38 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 57,696 | 207,513 | 3.5967 |
15 May 2011 11:18:45 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 46,176 | 166,482 | 3.6054 |
29 Apr 2011 10:29:12 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 34,656 | 125,236 | 3.6137 |
25 Apr 2011 17:28:39 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 23,136 | 83,494 | 3.6088 |
24 Apr 2011 11:45:11 | 922918 | 12809380 | hadam3p_saf_0sgc_1971_1_006860452_1 | 11,616 | 41,476 | 3.5706 |
©2024 climateprediction.net