Name | hadam3p_saf_2gze_1984_1_007151693_0 |
Workunit | 7336473 |
Created | 26 Jan 2011, 21:43:52 UTC |
Sent | 28 Jan 2011, 22:24:27 UTC |
Report deadline | 11 Jan 2012, 3:44:27 UTC |
Received | 11 Mar 2011, 18:09:04 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 983307 |
Run time | 10 days 15 hours 53 min 28 sec |
CPU time | 9 days 1 hours 47 min 37 sec |
Validate state | Workunit error - check skipped |
Credit | 2,244.09 |
Device peak FLOPS | 1.74 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Southern Africa v6.08 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4080, selfPID=936, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4252, selfPID=5832, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5456, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1232, selfPID=5244, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1768, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5292, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5852, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1744, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CGntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5972, ilonCtr=2 rkerel crash detected, will try to restart... :: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=2 Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5428, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3624, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5260, selfPID=5716, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2320, selfPID=4740, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4044, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5764, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5260, selfPID=3316, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=5496, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 14:16:20 (5604): No heartbeat from core client for 30 sec - exiting 14:16:21 (5604): No heartbeat from core client for 30 sec - exiting 14:16:22 (5604): No heartbeat from core client for 30 sec - exiting 14:16:23 (5604): No heartbeat from core client for 30 sec - exiting 14:16:24 (5604): No heartbeat from core client for 30 sec - exiting 14:16:25 (5604): No heartbeat from core client for 30 sec - exiting 14:16:26 (5604): No heartbeat from core client for 30 sec - exiting 14:16:28 (5604): No heartbeat from core client for 30 sec - exiting 14:16:29 (5604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5628, selfPID=4520, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5888, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1056, selfPID=6048, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5532, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5148, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5944, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5772, selfPID=6088, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4760, selfPID=5612, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1540, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4696, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5984, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5656, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4456, iMonCtr=2 Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5400, selfPID=5756, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5212, selfPID=5340, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2620, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6076, selfPID=5780, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5748, selfPID=5304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5540, selfPID=5636, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5240, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5796, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4224, selfPID=4224, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5360, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4868, selfPID=5884, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5724, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1692, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5548, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5560, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4112, iMonCtr=2 Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2932, selfPID=5436, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1460, iMonCtr=2 Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5604, selfPID=5768, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5332, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=836, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5140, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=6108, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5428, iMonCtr=2 Leaving CPDN_Main::Monitor... Leaving CPDN_Main::Monitor... 17:27:49 (5704): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Feb 2011 22:03:21 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 138,336 | 782,786 | 5.6586 |
23 Feb 2011 21:38:45 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 126,816 | 713,159 | 5.6236 |
20 Feb 2011 01:33:24 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 115,296 | 645,225 | 5.5962 |
18 Feb 2011 00:29:22 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 103,776 | 578,355 | 5.5731 |
14 Feb 2011 22:30:08 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 92,264 | 513,663 | 5.5673 |
14 Feb 2011 03:36:14 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 92,256 | 512,821 | 5.5587 |
11 Feb 2011 21:42:00 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 80,736 | 448,214 | 5.5516 |
10 Feb 2011 15:28:13 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 69,216 | 384,405 | 5.5537 |
09 Feb 2011 13:03:52 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 57,723 | 323,151 | 5.5983 |
09 Feb 2011 12:54:28 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 57,704 | 322,194 | 5.5836 |
09 Feb 2011 12:54:28 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 57,696 | 321,389 | 5.5704 |
06 Feb 2011 03:39:53 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 46,176 | 255,679 | 5.5371 |
04 Feb 2011 18:54:31 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 34,656 | 189,046 | 5.4549 |
02 Feb 2011 18:47:41 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 23,136 | 128,406 | 5.5501 |
30 Jan 2011 18:40:22 | 983307 | 12536490 | hadam3p_saf_2gze_1984_1_007151693_0 | 11,616 | 66,011 | 5.6828 |
©2024 cpdn.org