Name | hadam3p_eu_972c_1965_1_008058270_0 |
Workunit | 8213384 |
Created | 17 Jul 2012, 23:34:07 UTC |
Sent | 17 Jul 2012, 23:36:08 UTC |
Report deadline | 30 Jun 2013, 4:56:08 UTC |
Received | 29 Jul 2012, 13:17:52 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1176562 |
Run time | 5 days 22 hours 18 min 30 sec |
CPU time | 5 days 3 hours 3 min 54 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.39 |
Device peak FLOPS | 2.67 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8144, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7580, selfPID=8180, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4396, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3636, selfPID=4272, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4748, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1888, selfPID=6092, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5676, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2672, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1896, selfPID=2740, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4756, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6000, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=396, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4900, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4960, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5644, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4564, selfPID=4512, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4700, iMonCtr=2 Model crash detected, will try to restart... 14:36:24 (3240): No heartbeat from core client for 30 sec - exiting 14:36:25 (3240): No heartbeat from core client for 30 sec - exiting 14:36:26 (3240): No heartbeat from core client for 30 sec - exiting 14:36:27 (3240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GSuspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6488, selfPID=4724, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3364, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=520, selfPID=5712, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4724, selfPID=5992, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5764, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 19:05:45 (5688): No heartbeat from core client for 30 sec - exiting 19:05:46 (5688): No heartbeat from core client for 30 sec - exiting 19:05:47 (5688): No heartbeat from core client for 30 sec - exiting 19:05:48 (5688): No heartbeat from core client for 30 sec - exiting 19:05:49 (5688): No heartbeat from core client for 30 sec - exiting 19:05:50 (5688): No heartbeat from core client for 30 sec - exiting 19:05:51 (5688): No heartbeat from core client for 30 sec - exiting 19:05:52 (5688): No heartbeat from core client for 30 sec - exiting 19:05:53 (5688): No heartbeat from core client for 30 sec - exiting 19:05:54 (5688): No heartbeat from core client for 30 sec - exiting 19:06:26 (5688): No heartbeat from core client for 30 sec - exiting 19:06:28 (5688): No heartbeat from core client for 30 sec - exiting 19:06:29 (5688): No heartbeat from core client for 30 sec - exiting 19:06:30 (5688): No heartbeat from core client for 30 sec - exiting 19:06:31 (5688): No heartbeat from core client for 30 sec - exiting 19:06:32 (5688): No heartbeat from core client for 30 sec - exiting 19:06:33 (5688): No heartbeat from core client for 30 sec - exiting 19:06:34 (5688): No heartbeat from core client for 30 sec - exiting 19:06:35 (5688): No heartbeat from core client for 30 sec - exiting 19:06:36 (5688): No heartbeat from core client for 30 sec - exiting 19:06:37 (5688): No heartbeat from core client for 30 sec - exiting 19:06:38 (5688): No heartbeat from core client for 30 sec - exiting 19:06:39 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:07:33 (4432): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5292, selfPID=4432, iMonCtr=1 Model crash detected, will try to restart... Glontroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4200, iMonCtr=2 Model crash detected, will try to restart... obal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2996, iMonCtr=2 Mode l crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=2 Model crash detected, will try to restart... 04:20:30 (5276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Jul 2012 12:19:55 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 138,336 | 442,252 | 3.1969 |
28 Jul 2012 12:54:51 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 126,816 | 402,788 | 3.1762 |
27 Jul 2012 16:40:47 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 115,296 | 365,375 | 3.1690 |
26 Jul 2012 17:38:24 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 103,776 | 328,270 | 3.1633 |
25 Jul 2012 20:47:45 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 92,256 | 291,039 | 3.1547 |
25 Jul 2012 08:12:06 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 80,740 | 255,081 | 3.1593 |
25 Jul 2012 00:15:26 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 80,736 | 254,636 | 3.1539 |
24 Jul 2012 12:04:50 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 69,216 | 217,931 | 3.1486 |
22 Jul 2012 12:02:35 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 57,696 | 180,643 | 3.1309 |
21 Jul 2012 09:16:24 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 46,176 | 145,498 | 3.1509 |
20 Jul 2012 14:53:10 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 34,656 | 109,415 | 3.1572 |
19 Jul 2012 15:42:33 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 23,136 | 73,474 | 3.1757 |
18 Jul 2012 19:11:28 | 1176562 | 14931831 | hadam3p_eu_972c_1965_1_008058270_0 | 11,616 | 37,988 | 3.2703 |
©2024 cpdn.org