Name | hadam3p_eu_2rh6_1989_1_007169106_0 |
Workunit | 7353946 |
Created | 18 Feb 2011, 18:45:05 UTC |
Sent | 20 Feb 2011, 9:26:25 UTC |
Report deadline | 2 Feb 2012, 14:46:25 UTC |
Received | 20 Apr 2011, 17:48:02 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1131863 |
Run time | 6 days 3 hours 38 min 9 sec |
CPU time | 5 days 20 hours 12 min 35 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.39 |
Device peak FLOPS | 2.53 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.08 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2412, selfPID=1628, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2304, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2708, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3220, selfPID=3304, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3188, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2556, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2384, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3636, iMonCtr=2 Leaving CPDN_Main::Monitor... Coltroller :: kerDN:p: CPDN procot running, exiting, bRetVal = RetVa1, checkPID=0, self selfPID=4024, iMo2 Ctr=2 crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1248, selfPID=904, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2612, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 07:20:41 (1588): No heartbeat from core client for 30 sec - exiting 07:20:42 (1588): No heartbeat from core client for 30 sec - exiting 07:20:44 (1588): No heartbeat from core client for 30 sec - exiting 07:20:45 (1588): No heartbeat from core client for 30 sec - exiting 07:20:46 (1588): No heartbeat from core client for 30 sec - exiting 07:20:47 (1588): No heartbeat from core client for 30 sec - exiting 07:20:48 (1588): No heartbeat from core client for 30 sec - exiting 07:20:49 (1588): No heartbeat from core client for 30 sec - exiting 07:20:50 (1588): No heartbeat from core client for 30 sec - exiting 07:20:51 (1588): No heartbeat from core client for 30 sec - exiting 07:20:52 (1588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2112, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=164, iMonCtr=2 Leaving CPDN_Main::Monitor... zip error: Could not create output file (was replacing the original zip file) Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2844, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2912, selfPID=3100, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2076, selfPID=212, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2868, selfPID=2880, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3684, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1484, selfPID=3084, iMonCtr=1 Model crash detected, will try to restart... 23:09:40 (880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:09:42 (880): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=520, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1116, selfPID=3940, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 21:42:55 (2236): No heartbeat from core client for 30 sec - exiting 21:42:56 (2236): No heartbeat from core client for 30 sec - exiting 21:42:57 (2236): No heartbeat from core client for 30 sec - exiting 21:42:58 (2236): No heartbeat from core client for 30 sec - exiting 21:42:59 (2236): No heartbeat from core client for 30 sec - exiting 21:43:00 (2236): No heartbeat from core client for 30 sec - exiting 21:43:01 (2236): No heartbeat from core client for 30 sec - exiting 21:43:03 (2236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=720, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1848, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2968, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=296, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1980, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2332, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1984, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2504, selfPID=2700, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... ContbalrWollerr::: CDN Nproccess iss not running, exxitingg, bReVtVa=l , chec= 1, checklPID==0,4 s elofPIrD=7 Mod il crCtrh=2e ected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2912, selfPID=988, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:35:41 (3504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3916, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2792, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2616, selfPID=3212, iMonCtr=1 Model crash detected, will try to restart... 18:55:28 (3240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3348, selfPID=3248, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3120, selfPID=2344, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 21:47:04 (2204): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Apr 2011 19:07:44 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 138,336 | 503,945 | 3.6429 |
10 Apr 2011 07:35:29 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 126,816 | 462,853 | 3.6498 |
05 Apr 2011 19:04:46 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 115,296 | 423,301 | 3.6714 |
02 Apr 2011 15:50:53 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 103,780 | 383,683 | 3.6971 |
02 Apr 2011 11:06:37 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 103,776 | 383,076 | 3.6914 |
29 Mar 2011 06:15:49 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 92,256 | 342,371 | 3.7111 |
28 Mar 2011 18:26:08 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 80,736 | 301,583 | 3.7354 |
24 Mar 2011 11:05:39 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 69,216 | 261,238 | 3.7742 |
24 Mar 2011 00:03:35 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 57,696 | 221,435 | 3.8380 |
10 Mar 2011 02:06:48 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 46,176 | 181,255 | 3.9253 |
08 Mar 2011 22:02:31 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 34,656 | 137,767 | 3.9753 |
08 Mar 2011 22:02:31 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 23,137 | 92,989 | 4.0191 |
08 Mar 2011 22:02:31 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 23,136 | 92,329 | 3.9907 |
22 Feb 2011 09:18:12 | 1131863 | 12601305 | hadam3p_eu_2rh6_1989_1_007169106_0 | 11,616 | 46,707 | 4.0209 |
©2024 cpdn.org