Name | hadam3p_pnw_32ag_1997_1_008236917_0 |
Workunit | 8392041 |
Created | 24 Oct 2012, 8:28:05 UTC |
Sent | 24 Oct 2012, 8:28:07 UTC |
Report deadline | 6 Oct 2013, 13:48:07 UTC |
Received | 14 Nov 2012, 15:20:07 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1206904 |
Run time | 8 days 22 hours 13 min 2 sec |
CPU time | 6 days 4 hours 49 min 23 sec |
Validate state | Workunit error - check skipped |
Credit | 3,005.88 |
Device peak FLOPS | 2.39 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1576, selfPID=7212, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2712, selfPID=7052, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4356, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8188, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6372, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7848, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4856, iMonCtr=2 Model crash detected, will try to restart... CGntroller:: CPDN process is not running, exiting, bRetVal =Val = 1, checkPID=0, selfPID=972, iMonCtr =2 el crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7468, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4984, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3704, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 6 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2096, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4576, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 16:41:43 (2876): No heartbeat from core client for 30 sec - exiting 16:41:44 (2876): No heartbeat from core client for 30 sec - exiting 16:41:45 (2876): No heartbeat from core client for 30 sec - exiting 16:41:46 (2876): No heartbeat from core client for 30 sec - exiting 16:41:47 (2876): No heartbeat from core client for 30 sec - exiting 16:41:48 (2876): No heartbeat from core client for 30 sec - exiting 16:41:49 (2876): No heartbeat from core client for 30 sec - exiting 16:41:50 (2876): No heartbeat from core client for 30 sec - exiting 16:41:51 (2876): No heartbeat from core client for 30 sec - exiting 16:41:52 (2876): No heartbeat from core client for 30 sec - exiting 16:41:53 (2876): No heartbeat from core client for 30 sec - exiting 16:41:54 (2876): No heartbeat from core client for 30 sec - exiting 16:41:55 (2876): No heartbeat from core client for 30 sec - exiting 16:41:56 (2876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1264, iMonCtr=2 Model crash detected, will try to restart... 14:11:15 (5860): No heartbeat from core client for 30 sec - exiting 14:11:17 (5860): No heartbeat from core client for 30 sec - exiting 14:11:18 (5860): No heartbeat from core client for 30 sec - exiting 14:11:19 (5860): No heartbeat from core client for 30 sec - exiting 14:11:20 (5860): No heartbeat from core client for 30 sec - exiting 14:11:21 (5860): No heartbeat from core client for 30 sec - exiting 14:11:22 (5860): No heartbeat from core client for 30 sec - exiting 14:11:23 (5860): No heartbeat from core client for 30 sec - exiting 14:11:24 (5860): No heartbeat from core client for 30 sec - exiting 14:11:25 (5860): No heartbeat from core client for 30 sec - exiting 14:11:26 (5860): No heartbeat from core client for 30 sec - exiting 14:11:27 (5860): No heartbeat from core client for 30 sec - exiting 14:11:29 (5860): No heartbeat from core client for 30 sec - exiting 14:11:30 (5860): No heartbeat from core client for 30 sec - exiting 14:11:31 (5860): No heartbeat from core client for 30 sec - exiting 14:11:32 (5860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2008, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4944, selfPID=5940, iMonCtr=1 Model crash detected, will try to restart... 11:23:54 (5676): No heartbeat from core client for 30 sec - exiting 11:23:55 (5676): No heartbeat from core client for 30 sec - exiting 11:23:56 (5676): No heartbeat from core client for 30 sec - exiting 11:23:57 (5676): No heartbeat from core client for 30 sec - exiting 11:23:58 (5676): No heartbeat from core client for 30 sec - exiting 11:23:59 (5676): No heartbeat from core client for 30 sec - exiting 11:24:00 (5676): No heartbeat from core client for 30 sec - exiting 11:24:01 (5676): No heartbeat from core client for 30 sec - exiting 11:24:03 (5676): No heartbeat from core client for 30 sec - exiting 11:24:04 (5676): No heartbeat from core client for 30 sec - exiting 11:24:05 (5676): No heartbeat from core client for 30 sec - exiting 11:24:06 (5676): No heartbeat from core client for 30 sec - exiting 11:24:07 (5676): No heartbeat from core client for 30 sec - exiting 11:24:08 (5676): No heartbeat from core client for 30 sec - exiting 11:24:09 (5676): No heartbeat from core client for 30 sec - exiting 11:24:10 (5676): No heartbeat from core client for 30 sec - exiting 11:24:11 (5676): No heartbeat from core client for 30 sec - exiting 11:24:12 (5676): No heartbeat from core client for 30 sec - exiting 11:24:13 (5676): No heartbeat from core client for 30 sec - exiting 11:24:15 (5676): No heartbeat from core client for 30 sec - exiting 11:24:16 (5676): No heartbeat from core client for 30 sec - exiting 11:24:17 (5676): No heartbeat from core client for 30 sec - exiting 11:24:18 (5676): No heartbeat from core client for 30 sec - exiting 11:24:19 (5676): No heartbeat from core client for 30 sec - exiting 11:24:20 (5676): No heartbeat from core client for 30 sec - exiting 11:24:21 (5676): No heartbeat from core client for 30 sec - exiting 11:24:22 (5676): No heartbeat from core client for 30 sec - exiting 11:24:23 (5676): No heartbeat from core client for 30 sec - exiting 11:24:24 (5676): No heartbeat from core client for 30 sec - exiting 11:24:25 (5676): No heartbeat from core client for 30 sec - exiting 11:24:27 (5676): No heartbeat from core client for 30 sec - exiting 11:24:28 (5676): No heartbeat from core client for 30 sec - exiting 11:24:29 (5676): No heartbeat from core client for 30 sec - exiting 11:24:30 (5676): No heartbeat from core client for 30 sec - exiting 11:24:31 (5676): No heartbeat from core client for 30 sec - exiting 11:24:32 (5676): No heartbeat from core client for 30 sec - exiting 11:24:33 (5676): No heartbeat from core client for 30 sec - exiting 11:24:34 (5676): No heartbeat from core client for 30 sec - exiting 11:24:35 (5676): No heartbeat from core client for 30 sec - exiting 11:24:36 (5676): No heartbeat from core client for 30 sec - exiting 11:24:38 (5676): No heartbeat from core client for 30 sec - exiting 11:24:39 (5676): No heartbeat from core client for 30 sec - exiting 11:24:40 (5676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5520, iMonCtr=2 13:56:12 (1092): No heartbeat from core client for 30 sec - exiting 13:56:14 (1092): No heartbeat from core client for 30 sec - exiting 13:56:15 (1092): No heartbeat from core client for 30 sec - exiting 13:56:16 (1092): No heartbeat from core client for 30 sec - exiting 13:56:17 (1092): No heartbeat from core client for 30 sec - exiting 13:56:18 (1092): No heartbeat from core client for 30 sec - exiting 13:56:19 (1092): No heartbeat from core client for 30 sec - exiting 13:56:20 (1092): No heartbeat from core client for 30 sec - exiting 13:56:21 (1092): No heartbeat from core client for 30 sec - exiting 13:56:22 (1092): No heartbeat from core client for 30 sec - exiting 13:56:23 (1092): No heartbeat from core client for 30 sec - exiting 13:56:25 (1092): No heartbeat from core client for 30 sec - exiting 13:56:26 (1092): No heartbeat from core client for 30 sec - exiting 13:56:27 (1092): No heartbeat from core client for 30 sec - exiting 13:56:28 (1092): No heartbeat from core client for 30 sec - exiting 13:56:29 (1092): No heartbeat from core client for 30 sec - exiting 13:56:30 (1092): No heartbeat from core client for 30 sec - exiting 13:56:31 (1092): No heartbeat from core client for 30 sec - exiting 13:56:32 (1092): No heartbeat from core client for 30 sec - exiting 13:56:33 (1092): No heartbeat from core client for 30 sec - exiting 13:56:34 (1092): No heartbeat from core client for 30 sec - exiting 13:56:35 (1092): No heartbeat from core client for 30 sec - exiting 13:56:37 (1092): No heartbeat from core client for 30 sec - exiting 13:56:38 (1092): No heartbeat from core client for 30 sec - exiting 13:56:39 (1092): No heartbeat from core client for 30 sec - exiting 13:56:40 (1092): No heartbeat from core client for 30 sec - exiting 13:56:41 (1092): No heartbeat from core client for 30 sec - exiting 13:56:42 (1092): No heartbeat from core client for 30 sec - exiting 13:56:43 (1092): No heartbeat from core client for 30 sec - exiting 13:56:44 (1092): No heartbeat from core client for 30 sec - exiting 13:56:45 (1092): No heartbeat from core client for 30 sec - exiting 13:56:46 (1092): No heartbeat from core client for 30 sec - exiting 13:56:47 (1092): No heartbeat from core client for 30 sec - exiting 13:56:49 (1092): No heartbeat from core client for 30 sec - exiting 13:56:50 (1092): No heartbeat from core client for 30 sec - exiting 13:56:51 (1092): No heartbeat from core client for 30 sec - exiting 13:56:52 (1092): No heartbeat from core client for 30 sec - exiting 13:56:53 (1092): No heartbeat from core client for 30 sec - exiting 13:56:54 (1092): No heartbeat from core client for 30 sec - exiting 13:56:55 (1092): No heartbeat from core client for 30 sec - exiting 13:56:56 (1092): No heartbeat from core client for 30 sec - exiting 13:56:57 (1092): No heartbeat from core client for 30 sec - exiting 13:56:58 (1092): No heartbeat from core client for 30 sec - exiting 13:56:59 (1092): No heartbeat from core client for 30 sec - exiting 13:57:01 (1092): No heartbeat from core client for 30 sec - exiting 13:57:02 (1092): No heartbeat from core client for 30 sec - exiting 13:57:03 (1092): No heartbeat from core client for 30 sec - exiting 13:57:04 (1092): No heartbeat from core client for 30 sec - exiting 13:57:05 (1092): No heartbeat from core client for 30 sec - exiting 13:57:06 (1092): No heartbeat from core client for 30 sec - exiting 13:57:07 (1092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2636, selfPID=3712, iMonCtr=1 Model crash detected, will try to restart... 10:38:52 (3752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CGlobal Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6416, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6480, selfPID=4128, iMonCtr=1 Model crash detected, will try to restart... 08:51:17 (5300): No heartbeat from core client for 30 sec - exiting 08:51:18 (5300): No heartbeat from core client for 30 sec - exiting 08:51:19 (5300): No heartbeat from core client for 30 sec - exiting 08:51:20 (5300): No heartbeat from core client for 30 sec - exiting 08:51:21 (5300): No heartbeat from core client for 30 sec - exiting 08:51:22 (5300): No heartbeat from core client for 30 sec - exiting 08:51:23 (5300): No heartbeat from core client for 30 sec - exiting 08:51:24 (5300): No heartbeat from core client for 30 sec - exiting 08:51:26 (5300): No heartbeat from core client for 30 sec - exiting 08:51:27 (5300): No heartbeat from core client for 30 sec - exiting 08:51:28 (5300): No heartbeat from core client for 30 sec - exiting 08:51:29 (5300): No heartbeat from core client for 30 sec - exiting 08:51:30 (5300): No heartbeat from core client for 30 sec - exiting 08:51:31 (5300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1464, selfPID=4572, iMonCtr=1 Model crash detected, will try to restart... 14:49:28 (2196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
14 Nov 2012 14:19:38 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 138,336 | 534,854 | 3.8663 |
12 Nov 2012 17:28:40 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 126,816 | 491,405 | 3.8749 |
11 Nov 2012 13:38:09 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 115,296 | 447,393 | 3.8804 |
09 Nov 2012 15:25:05 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 103,776 | 403,775 | 3.8908 |
06 Nov 2012 17:37:10 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 92,256 | 360,260 | 3.9050 |
04 Nov 2012 21:56:07 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 80,736 | 317,084 | 3.9274 |
03 Nov 2012 14:10:30 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 69,237 | 272,995 | 3.9429 |
03 Nov 2012 13:10:23 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 69,216 | 272,261 | 3.9335 |
31 Oct 2012 22:09:14 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 57,696 | 227,159 | 3.9372 |
30 Oct 2012 10:55:14 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 46,176 | 182,032 | 3.9421 |
28 Oct 2012 16:44:47 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 34,656 | 136,743 | 3.9457 |
27 Oct 2012 08:54:07 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 23,136 | 91,836 | 3.9694 |
25 Oct 2012 13:30:36 | 1206904 | 15394728 | hadam3p_pnw_32ag_1997_1_008236917_0 | 11,616 | 46,319 | 3.9875 |
©2024 cpdn.org