Name | hadam3p_eu_2tvr_1960_1_007305769_1 |
Workunit | 7503193 |
Created | 30 Jun 2011, 15:26:08 UTC |
Sent | 30 Jun 2011, 15:26:17 UTC |
Report deadline | 11 Jun 2012, 20:46:17 UTC |
Received | 28 Feb 2012, 12:44:17 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 953840 |
Run time | 7 days 21 hours 3 min 15 sec |
CPU time | 7 days 21 hours 3 min 15 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.50 |
Device peak FLOPS | 1.98 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.4.5</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4216, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5968, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2268, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4964, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6132, selfPID=820, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5252, selfPID=5252, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2040, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5888, selfPID=4992, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... ContrController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4496, selfPID=5880, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5792, selfPID=4884, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5488, selfPID=4356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6056, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2316, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5352, selfPID=4024, iMonCtr=1 Model crash detected, will try to restart... 18:40:14 (1120): No heartbeat from core client for 30 sec - exiting 18:40:15 (1120): No heartbeat from core client for 30 sec - exiting 18:40:16 (1120): No heartbeat from core client for 30 sec - exiting 18:40:17 (1120): No heartbeat from core client for 30 sec - exiting 18:40:18 (1120): No heartbeat from core client for 30 sec - exiting 18:40:19 (1120): No heartbeat from core client for 30 sec - exiting 18:40:20 (1120): No heartbeat from core client for 30 sec - exiting 18:40:21 (1120): No heartbeat from core client for 30 sec - exiting 18:40:22 (1120): No heartbeat from core client for 30 sec - exiting 18:40:23 (1120): No heartbeat from core client for 30 sec - exiting 18:40:24 (1120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2152, selfPID=4872, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2816, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5616, iMonCtr=2 CPDN Monitor - Quit request fRegional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4212, selfPID=4460, iMonCtr=1 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Atmos Restart file copy failed on atmos_restart.day Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2316, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3544, selfPID=5848, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4840, selfPID=4424, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3112, iMonCtr=2 Model crash detected, will try to restart... 18:13:11 (4116): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4048, selfPID=2440, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4424, iMonCtr=2 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2148, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4692, selfPID=2652, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3836, iMonCtr=2 Model crash detected, will try to restart... CCPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5472, selfPID=5472, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4612, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5660, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5940, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5960, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4128, selfPID=2340, iMonCtr=1 Model crash detected, will try to restart... 20:11:16 (156): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3812, iMonCtr=2 Model crash detected, will try to restart... 20:19:02 (4224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:37:04 (4408): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5652, selfPID=5652, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=940, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4576, selfPID=4268, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Feb 2012 08:26:13 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 138,342 | 679,604 | 4.9125 |
27 Feb 2012 13:07:25 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 138,336 | 678,796 | 4.9069 |
02 Jan 2012 13:20:57 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 126,816 | 623,161 | 4.9139 |
26 Dec 2011 19:43:16 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 115,296 | 564,854 | 4.8992 |
22 Dec 2011 10:24:02 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 103,776 | 506,469 | 4.8804 |
18 Dec 2011 14:13:36 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 92,256 | 452,158 | 4.9011 |
05 Dec 2011 19:09:08 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 80,736 | 395,245 | 4.8955 |
08 Nov 2011 18:52:55 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 69,216 | 339,051 | 4.8984 |
31 Oct 2011 17:32:47 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 57,703 | 283,397 | 4.9113 |
31 Oct 2011 17:15:32 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 57,696 | 282,547 | 4.8972 |
19 Sep 2011 15:47:57 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 46,176 | 226,369 | 4.9023 |
01 Sep 2011 15:51:31 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 34,656 | 170,472 | 4.9190 |
22 Aug 2011 17:11:49 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 23,136 | 114,433 | 4.9461 |
25 Jul 2011 19:43:04 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 11,624 | 58,168 | 5.0041 |
25 Jul 2011 19:39:52 | 953840 | 13030504 | hadam3p_eu_2tvr_1960_1_007305769_1 | 11,616 | 57,381 | 4.9398 |
©2024 cpdn.org