Name | hadam3p_eu_2jn1_1965_1_007371759_0 |
Workunit | 7569189 |
Created | 29 Jul 2011, 13:21:19 UTC |
Sent | 29 Jul 2011, 13:21:32 UTC |
Report deadline | 10 Jul 2012, 18:41:32 UTC |
Received | 24 Aug 2011, 23:31:56 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1038991 |
Run time | 6 days 9 hours 17 min |
CPU time | 5 days 6 hours 28 min 24 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.39 |
Device peak FLOPS | 2.14 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3196, selfPID=4316, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4640, selfPID=4388, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3708, selfPID=4668, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4804, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3064, selfPID=4100, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3456, selfPID=4464, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5968, selfPID=224, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3340, selfPID=4056, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1040, selfPID=4936, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1492, selfPID=4924, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4720, selfPID=4720, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2784, selfPID=4736, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1164, selfPID=2520, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=712, selfPID=4204, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=816, selfPID=4368, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3408, selfPID=4184, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4792, selfPID=4524, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5784, selfPID=4764, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... 19:38:45 (4848): No heartbeat from core client for 30 sec - exiting 19:38:47 (4848): No heartbeat from core client for 30 sec - exiting 19:38:48 (4848): No heartbeat from core client for 30 sec - exiting 19:38:49 (4848): No heartbeat from core client for 30 sec - exiting 19:38:50 (4848): No heartbeat from core client for 30 sec - exiting 19:38:51 (4848): No heartbeat from core client for 30 sec - exiting 19:38:52 (4848): No heartbeat from core client for 30 sec - exiting 19:38:53 (4848): No heartbeat from core client for 30 sec - exiting 19:38:54 (4848): No heartbeat from core client for 30 sec - exiting 19:38:55 (4848): No heartbeat from core client for 30 sec - exiting 19:38:56 (4848): No heartbeat from core client for 30 sec - exiting 19:38:57 (4848): No heartbeat from core client for 30 sec - exiting 19:38:58 (4848): No heartbeat from core client for 30 sec - exiting 19:38:59 (4848): No heartbeat from core client for 30 sec - exiting 19:39:00 (4848): No heartbeat from core client for 30 sec - exiting 19:39:01 (4848): No heartbeat from core client for 30 sec - exiting 19:39:02 (4848): No heartbeat from core client for 30 sec - exiting 19:39:03 (4848): No heartbeat from core client for 30 sec - exiting 19:39:04 (4848): No heartbeat from core client for 30 sec - exiting 19:39:05 (4848): No heartbeat from core client for 30 sec - exiting 19:39:06 (4848): No heartbeat from core client for 30 sec - exiting 19:39:07 (4848): No heartbeat from core client for 30 sec - exiting 19:39:08 (4848): No heartbeat from core client for 30 sec - exiting 19:39:09 (4848): No heartbeat from core client for 30 sec - exiting 19:39:10 (4848): No heartbeat from core client for 30 sec - exiting 19:39:11 (4848): No heartbeat from core client for 30 sec - exiting 19:39:12 (4848): No heartbeat from core client for 30 sec - exiting 19:39:13 (4848): No heartbeat from core client for 30 sec - exiting 19:39:14 (4848): No heartbeat from core client for 30 sec - exiting 19:39:15 (4848): No heartbeat from core client for 30 sec - exiting 19:39:16 (4848): No heartbeat from core client for 30 sec - exiting 19:39:17 (4848): No heartbeat from core client for 30 sec - exiting 19:39:18 (4848): No heartbeat from core client for 30 sec - exiting 19:39:19 (4848): No heartbeat from core client for 30 sec - exiting 19:39:20 (4848): No heartbeat from core client for 30 sec - exiting 19:39:21 (4848): No heartbeat from core client for 30 sec - exiting 19:39:23 (4848): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4912, selfPID=4912, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3156, selfPID=3156, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4404, selfPID=4404, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5788, selfPID=5788, iMonCtr=2 01:16:24 (3284): No heartbeat from core client for 30 sec - exiting 01:16:25 (3284): No heartbeat from core client for 30 sec - exiting 01:16:26 (3284): No heartbeat from core client for 30 sec - exiting 01:16:27 (3284): No heartbeat from core client for 30 sec - exiting 01:16:28 (3284): No heartbeat from core client for 30 sec - exiting 01:16:29 (3284): No heartbeat from core client for 30 sec - exiting 01:16:31 (3284): No heartbeat from core client for 30 sec - exiting 01:16:32 (3284): No heartbeat from core client for 30 sec - exiting 01:16:33 (3284): No heartbeat from core client for 30 sec - exiting 01:16:34 (3284): No heartbeat from core client for 30 sec - exiting 01:16:35 (3284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2164, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3704, selfPID=3704, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5032, selfPID=4928, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5212, selfPID=5212, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4220, selfPID=4780, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2616, selfPID=2616, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2436, selfPID=4180, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1776, selfPID=4964, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
24 Aug 2011 01:21:26 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 138,336 | 454,579 | 3.2860 |
22 Aug 2011 03:32:26 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 126,816 | 417,671 | 3.2935 |
19 Aug 2011 09:45:18 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 115,296 | 380,715 | 3.3021 |
17 Aug 2011 21:51:49 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 103,780 | 344,092 | 3.3156 |
17 Aug 2011 09:34:31 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 103,776 | 343,595 | 3.3109 |
15 Aug 2011 00:28:49 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 92,256 | 306,697 | 3.3244 |
10 Aug 2011 04:12:13 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 80,736 | 270,086 | 3.3453 |
08 Aug 2011 06:54:06 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 69,216 | 231,554 | 3.3454 |
07 Aug 2011 07:40:37 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 57,696 | 194,368 | 3.3688 |
06 Aug 2011 07:48:33 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 46,177 | 157,372 | 3.4080 |
05 Aug 2011 22:49:38 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 46,176 | 156,878 | 3.3974 |
04 Aug 2011 06:56:04 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 34,656 | 118,653 | 3.4237 |
02 Aug 2011 04:34:19 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 23,136 | 79,584 | 3.4398 |
31 Jul 2011 06:40:09 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 11,617 | 39,906 | 3.4351 |
30 Jul 2011 09:29:05 | 1038991 | 13165418 | hadam3p_eu_2jn1_1965_1_007371759_0 | 11,616 | 39,306 | 3.3838 |
©2024 cpdn.org