Name | hadam3p_eu_2lqa_1965_1_007292556_0 |
Workunit | 7489830 |
Created | 14 Jun 2011, 21:14:13 UTC |
Sent | 14 Jun 2011, 21:14:18 UTC |
Report deadline | 27 May 2012, 2:34:18 UTC |
Received | 12 Aug 2011, 18:00:50 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1098643 |
Run time | 15 days 3 hours 42 min 14 sec |
CPU time | 11 days 23 hours 37 min 43 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.39 |
Device peak FLOPS | 0.69 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2076, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5320, iMonCtr=2 Model crash detected, will try to restart... 18:34:19 (4544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1960, selfPID=1960, iMonCtr=2 18:34:20 (4544): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8916, selfPID=4592, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2160, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1292, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5084, selfPID=5688, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2016, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5892, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5900, selfPID=5308, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2964, selfPID=5208, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4428, selfPID=5620, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5668, selfPID=5456, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3548, selfPID=4972, iMonCtr=1 Model crash detected, will try to restart... CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5656, iMonCtr=2 ontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4776, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3572, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3428, selfPID=5360, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6052, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5056, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4976, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4516, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3368, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5412, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4540, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5020, iMonCtr=2 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5280, selfPID=4608, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5468, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4932, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4440, selfPID=5232, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2480, selfPID=3176, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5588, selfPID=5336, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5684, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4920, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5448, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3308, selfPID=5260, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5460, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5296, selfPID=6116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4416, selfPID=4632, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4744, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3300, selfPID=6116, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5720, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4192, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4636, iMonCtr=2 Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=900, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5864, iMonCtr=2 Model crash detected, will try to restart... GControWler:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=2 Model crash detected, will try to restart... orker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3044, iMonCtr=2 Leaving CPDN_Main::Monitor... 07:56:31 (4764): No heartbeat from core client for 30 sec - exiting 07:56:40 (4764): No heartbeat from core client for 30 sec - exiting 07:56:41 (4764): No heartbeat from core client for 30 sec - exiting 07:56:42 (4764): No heartbeat from core client for 30 sec - exiting 07:56:43 (4764): No heartbeat from core client for 30 sec - exiting 07:56:44 (4764): No heartbeat from core client for 30 sec - exiting 07:56:45 (4764): No heartbeat from core client for 30 sec - exiting 07:56:46 (4764): No heartbeat from core client for 30 sec - exiting 07:56:47 (4764): No heartbeat from core client for 30 sec - exiting 07:56:48 (4764): No heartbeat from core client for 30 sec - exiting 07:56:49 (4764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5456, selfPID=5816, iMonCtr=1 Model crash detected, will try to restart... 23:39:48 (5292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5764, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5104, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exitinControtler:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5648, iMonCtr=2 Model crash detected, will try to restart... Val = 1, checkPID=0, selfPID=4192, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1160, selfPID=5960, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1936, selfPID=6120, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4508, selfPID=5948, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1928, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Aug 2011 17:57:53 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 138,336 | 1,034,582 | 7.4788 |
05 Aug 2011 19:18:35 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 126,816 | 957,539 | 7.5506 |
02 Aug 2011 18:12:05 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 115,296 | 852,294 | 7.3922 |
25 Jul 2011 22:21:51 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 103,776 | 757,061 | 7.2951 |
25 Jul 2011 19:47:50 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 92,256 | 662,041 | 7.1761 |
25 Jul 2011 16:31:00 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 80,736 | 568,005 | 7.0353 |
25 Jul 2011 14:47:44 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 69,242 | 476,863 | 6.8869 |
25 Jul 2011 14:45:36 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 69,231 | 475,528 | 6.8687 |
25 Jul 2011 14:43:01 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 69,227 | 474,833 | 6.8591 |
25 Jul 2011 14:43:01 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 69,218 | 473,628 | 6.8426 |
25 Jul 2011 14:43:01 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 69,216 | 472,919 | 6.8325 |
02 Jul 2011 22:14:15 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 57,696 | 377,515 | 6.5432 |
27 Jun 2011 21:14:39 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 46,176 | 295,704 | 6.4038 |
26 Jun 2011 05:00:08 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 34,656 | 198,774 | 5.7356 |
22 Jun 2011 13:01:34 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 23,136 | 122,967 | 5.3150 |
20 Jun 2011 00:01:17 | 1098643 | 12976803 | hadam3p_eu_2lqa_1965_1_007292556_0 | 11,616 | 40,238 | 3.4640 |
©2024 cpdn.org