Name | hadam3p_pnw_bdbr_1991_1_008033091_1 |
Workunit | 8188205 |
Created | 8 Jul 2012, 20:22:27 UTC |
Sent | 8 Jul 2012, 20:22:37 UTC |
Report deadline | 21 Jun 2013, 1:42:37 UTC |
Received | 8 Sep 2012, 15:27:50 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1183189 |
Run time | 5 days 12 hours 22 min 29 sec |
CPU time | 5 days 4 hours 9 min 29 sec |
Validate state | Workunit error - check skipped |
Credit | 3,005.88 |
Device peak FLOPS | 2.31 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:20:01 (6808): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:18:53 (10148): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5272, selfPID=5156, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4664, selfPID=6080, iMonCtr=1 Model crash detected, will try to restart... 15:55:24 (5452): No heartbeat from core client for 30 sec - exiting 15:55:25 (5452): No heartbeat from core client for 30 sec - exiting 15:55:26 (5452): No heartbeat from core client for 30 sec - exiting 15:55:27 (5452): No heartbeat from core client for 30 sec - exiting 15:55:28 (5452): No heartbeat from core client for 30 sec - exiting 15:55:29 (5452): No heartbeat from core client for 30 sec - exiting 15:55:30 (5452): No heartbeat from core client for 30 sec - exiting 15:55:31 (5452): No heartbeat from core client for 30 sec - exiting 15:55:32 (5452): No heartbeat from core client for 30 sec - exiting 15:55:34 (5452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:55:35 (5452): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5432, selfPID=4784, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5392, selfPID=5048, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 22:26:23 (13352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:26:26 (13352): No heartbeat from core client for 30 sec - exiting 22:26:27 (13352): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=14452, selfPID=17236, iMonCtr=1 Model crash detected, will try to restart... 17:43:13 (8784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4260, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 17:10:04 (4016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=956, selfPID=956, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9176, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8552, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2168, selfPID=5584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7088, selfPID=3236, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3544, iMonCtr=2 07:46:52 (5716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=868, selfPID=5028, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4928, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4184, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5796, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4768, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... 21:05:27 (5004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=11968, selfPID=10400, iMonCtr=1 Model crash detected, will try to restart... 20:26:00 (6064): No heartbeat from core client for 30 sec - exiting 20:26:01 (6064): No heartbeat from core client for 30 sec - exiting 20:26:02 (6064): No heartbeat from core client for 30 sec - exiting 20:26:03 (6064): No heartbeat from core client for 30 sec - exiting 20:26:04 (6064): No heartbeat from core client for 30 sec - exiting 20:26:05 (6064): No heartbeat from core client for 30 sec - exiting 20:26:07 (6064): No heartbeat from core client for 30 sec - exiting 20:26:08 (6064): No heartbeat from core client for 30 sec - exiting 20:26:09 (6064): No heartbeat from core client for 30 sec - exiting 20:26:10 (6064): No heartbeat from core client for 30 sec - exiting 20:26:11 (6064): No heartbeat from core client for 30 sec - exiting 20:26:12 (6064): No heartbeat from core client for 30 sec - exiting 20:26:13 (6064): No heartbeat from core client for 30 sec - exiting 20:26:14 (6064): No heartbeat from core client for 30 sec - exiting 20:26:15 (6064): No heartbeat from core client for 30 sec - exiting 20:26:16 (6064): No heartbeat from core client for 30 sec - exiting 20:26:17 (6064): No heartbeat from core client for 30 sec - exiting 20:26:19 (6064): No heartbeat from core client for 30 sec - exiting 20:26:20 (6064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:26:21 (6064): No heartbeat from core client for 30 sec - exiting 23:42:39 (2704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:41:37 (8432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12636, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4784, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3936, selfPID=5148, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7792, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12256, iMonCtr=2 10:59:05 (5352): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:17:22 (3740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1060, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8596, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4896, iMonCtr=2 Model crash detected, will try to restart... 15:28:55 (5860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:27:47 (3100): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:13:26 (7620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GLeaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Sep 2012 15:28:18 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 138,336 | 446,152 | 3.2251 |
06 Sep 2012 17:30:40 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 126,816 | 410,793 | 3.2393 |
03 Sep 2012 00:24:19 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 115,296 | 374,443 | 3.2477 |
30 Aug 2012 01:04:07 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 103,776 | 337,382 | 3.2511 |
29 Aug 2012 00:48:27 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 92,256 | 300,826 | 3.2608 |
26 Aug 2012 00:03:30 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 80,736 | 263,928 | 3.2690 |
22 Aug 2012 00:34:23 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 69,219 | 225,969 | 3.2646 |
21 Aug 2012 01:53:18 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 69,216 | 225,381 | 3.2562 |
17 Aug 2012 14:00:55 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 57,696 | 188,421 | 3.2658 |
12 Aug 2012 19:29:51 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 46,176 | 150,730 | 3.2642 |
09 Aug 2012 02:10:35 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 34,656 | 111,895 | 3.2287 |
06 Aug 2012 23:58:10 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 23,136 | 74,922 | 3.2383 |
14 Jul 2012 21:29:40 | 1183189 | 14878974 | hadam3p_pnw_bdbr_1991_1_008033091_1 | 11,616 | 37,751 | 3.2499 |
©2025 cpdn.org