Name | hadam3p_eu_2rqe_1985_1_007229817_0 |
Workunit | 7428057 |
Created | 28 Apr 2011, 16:48:13 UTC |
Sent | 2 May 2011, 16:02:06 UTC |
Report deadline | 13 Apr 2012, 21:22:06 UTC |
Received | 20 Jun 2011, 14:25:39 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1143281 |
Run time | 7 days 19 hours 20 min 1 sec |
CPU time | 6 days 15 hours 47 min 22 sec |
Validate state | Workunit error - check skipped |
Credit | 2,386.39 |
Device peak FLOPS | 2.53 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>6.10.60</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4444, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:03:50 (5556): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5728, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5124, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6124, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3980, selfPID=1032, iMonCtr=1 Model crash detected, will try to restart... GlControllCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3096, selfPID=3096, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5704, selfPID=5320, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:20:10 (5088): No heartbeat from core client for 30 sec - exiting 10:20:11 (5088): No heartbeat from core client for 30 sec - exiting 10:20:12 (5088): No heartbeat from core client for 30 sec - exiting 10:20:13 (5088): No heartbeat from core client for 30 sec - exiting 10:20:14 (5088): No heartbeat from core client for 30 sec - exiting 10:20:15 (5088): No heartbeat from core client for 30 sec - exiting 10:20:16 (5088): No heartbeat from core client for 30 sec - exiting 10:20:17 (5088): No heartbeat from core client for 30 sec - exiting 10:20:18 (5088): No heartbeat from core client for 30 sec - exiting 10:20:19 (5088): No heartbeat from core client for 30 sec - exiting 10:20:20 (5088): No heartbeat from core client for 30 sec - exiting 10:20:21 (5088): No heartbeat from core client for 30 sec - exiting 10:20:22 (5088): No heartbeat from core client for 30 sec - exiting 10:20:23 (5088): No heartbeat from core client for 30 sec - exiting 10:20:24 (5088): No heartbeat from core client for 30 sec - exiting 10:21:58 (5088): No heartbeat from core client for 30 sec - exiting 10:21:59 (5088): No heartbeat from core client for 30 sec - exiting 10:22:00 (5088): No heartbeat from core client for 30 sec - exiting 10:22:01 (5088): No heartbeat from core client for 30 sec - exiting 10:22:02 (5088): No heartbeat from core client for 30 sec - exiting 10:22:03 (5088): No heartbeat from core client for 30 sec - exiting 10:22:04 (5088): No heartbeat from core client for 30 sec - exiting 10:22:05 (5088): No heartbeat from core client for 30 sec - exiting 10:22:06 (5088): No heartbeat from core client for 30 sec - exiting 10:22:07 (5088): No heartbeat from core client for 30 sec - exiting 10:22:08 (5088): No heartbeat from core client for 30 sec - exiting 10:22:09 (5088): No heartbeat from core client for 30 sec - exiting 10:22:10 (5088): No heartbeat from core client for 30 sec - exiting 10:22:11 (5088): No heartbeat from core client for 30 sec - exiting 10:22:12 (5088): No heartbeat from core client for 30 sec - exiting 10:22:13 (5088): No heartbeat from core client for 30 sec - exiting 10:22:14 (5088): No heartbeat from core client for 30 sec - exiting 10:22:15 (5088): No heartbeat from core client for 30 sec - exiting 10:22:16 (5088): No heartbeat from core client for 30 sec - exiting 10:22:17 (5088): No heartbeat from core client for 30 sec - exiting 10:22:18 (5088): No heartbeat from core client for 30 sec - exiting 10:22:19 (5088): No heartbeat from core client for 30 sec - exiting 10:22:20 (5088): No heartbeat from core client for 30 sec - exiting 10:22:21 (5088): No heartbeat from core client for 30 sec - exiting 10:22:22 (5088): No heartbeat from core client for 30 sec - exiting 10:22:23 (5088): No heartbeat from core client for 30 sec - exiting 10:22:24 (5088): No heartbeat from core client for 30 sec - exiting 10:23:46 (5088): No heartbeat from core client for 30 sec - exiting 10:23:47 (5088): No heartbeat from core client for 30 sec - exiting 10:23:48 (5088): No heartbeat from core client for 30 sec - exiting 10:23:49 (5088): No heartbeat from core client for 30 sec - exiting 10:23:50 (5088): No heartbeat from core client for 30 sec - exiting 10:23:51 (5088): No heartbeat from core client for 30 sec - exiting 10:23:52 (5088): No heartbeat from core client for 30 sec - exiting 10:23:54 (5088): No heartbeat from core client for 30 sec - exiting 10:23:55 (5088): No heartbeat from core client for 30 sec - exiting 10:23:56 (5088): No heartbeat from core client for 30 sec - exiting 10:23:57 (5088): No heartbeat from core client for 30 sec - exiting 10:23:58 (5088): No heartbeat from core client for 30 sec - exiting 10:23:59 (5088): No heartbeat from core client for 30 sec - exiting 10:24:00 (5088): No heartbeat from core client for 30 sec - exiting 10:24:01 (5088): No heartbeat from core client for 30 sec - exiting 10:24:02 (5088): No heartbeat from core client for 30 sec - exiting 10:24:03 (5088): No heartbeat from core client for 30 sec - exiting 10:24:04 (5088): No heartbeat from core client for 30 sec - exiting 10:24:05 (5088): No heartbeat from core client for 30 sec - exiting 10:24:06 (5088): No heartbeat from core client for 30 sec - exiting 10:24:07 (5088): No heartbeat from core client for 30 sec - exiting 10:24:08 (5088): No heartbeat from core client for 30 sec - exiting 10:24:09 (5088): No heartbeat from core client for 30 sec - exiting 10:24:10 (5088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5804, selfPID=3600, iMonCtr=1 Model crash detected, will try to restart... 10:43:04 (5268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:44:09 (4684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2944, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4168, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3900, iMonCtr=2 Model crash detected, will try to restart... 10:10:19 (5996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3008, selfPID=5560, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4116, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7948, selfPID=7948, iMonCtr=2 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5400, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=968, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5296, selfPID=4724, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5348, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6032, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6116, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CGntroller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5844, iMonCtr=2 Model crash detected, will try to restart... lobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5664, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5244, selfPID=4896, iMonCtr=1 Model crash detected, will try to restart... 08:20:54 (4840): No heartbeat from core client for 30 sec - exiting 08:20:55 (4840): No heartbeat from core client for 30 sec - exiting 08:20:56 (4840): No heartbeat from core client for 30 sec - exiting 08:20:57 (4840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2476, iMonCtr=2 Model crash detected, will try to restart... 08:17:22 (4956): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5516, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2204, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5300, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 09:16:45 (5024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5736, selfPID=5736, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4620, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 15:52:18 (4900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... zip error: Could not create output file (was replacing the original zip file) CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6088, selfPID=6088, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3636, selfPID=3636, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5212, iMonCtr= 2 del crash detected, will try to restart... Leaving CPDN_Main::Monitor... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4836, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5300, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 13:24:41 (4864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Jun 2011 14:28:07 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 138,336 | 574,415 | 4.1523 |
15 Jun 2011 15:50:33 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 126,816 | 528,366 | 4.1664 |
13 Jun 2011 14:46:38 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 115,300 | 481,790 | 4.1786 |
11 Jun 2011 22:43:49 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 115,296 | 480,969 | 4.1716 |
09 Jun 2011 21:27:15 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 103,776 | 430,990 | 4.1531 |
07 Jun 2011 19:39:25 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 92,256 | 382,042 | 4.1411 |
03 Jun 2011 23:42:00 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 80,736 | 333,690 | 4.1331 |
02 Jun 2011 16:13:06 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 69,216 | 283,100 | 4.0901 |
30 May 2011 17:07:55 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 57,696 | 235,843 | 4.0877 |
26 May 2011 20:04:05 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 46,176 | 186,641 | 4.0419 |
23 May 2011 22:54:47 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 34,656 | 140,850 | 4.0642 |
21 May 2011 17:13:09 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 23,136 | 94,726 | 4.0943 |
14 May 2011 19:58:58 | 1143281 | 12839608 | hadam3p_eu_2rqe_1985_1_007229817_0 | 11,616 | 48,232 | 4.1522 |
©2024 cpdn.org