Name | hadam3p_pnw_2xxw_1984_1_007177484_0 |
Workunit | 7375766 |
Created | 22 Feb 2011, 11:42:46 UTC |
Sent | 9 Mar 2011, 3:12:03 UTC |
Report deadline | 19 Feb 2012, 8:32:03 UTC |
Received | 21 Apr 2011, 2:37:07 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1025193 |
Run time | |
CPU time | 5 days 3 hours 6 min 5 sec |
Validate state | Workunit error - check skipped |
Credit | 3,005.88 |
Device peak FLOPS | 2.77 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.08 windows_intelx86 |
Stderr | <core_client_version>6.2.28</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3556, iMonCtr=2 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 14:40:13 (3400): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4964, selfPID=6116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2180, selfPID=4720, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2436, selfPID=5332, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1320, selfPID=4536, iMonCtr=1 Model crash detected, will try to restart... GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2464, selfPID=3868, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5580, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1996, selfPID=2664, iMonCtr=1 Model crash detected, will try to restart... 10:22:56 (6104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5096, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3276, selfPID=744, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5812, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 16:32:18 (5176): No heartbeat from core client for 30 sec - exiting 16:32:19 (5176): No heartbeat from core client for 30 sec - exiting 16:32:20 (5176): No heartbeat from core client for 30 sec - exiting 16:32:21 (5176): No heartbeat from core client for 30 sec - exiting 16:32:23 (5176): No heartbeat from core client for 30 sec - exiting 16:32:24 (5176): No heartbeat from core client for 30 sec - exiting 16:32:25 (5176): No heartbeat from core client for 30 sec - exiting 16:32:26 (5176): No heartbeat from core client for 30 sec - exiting 16:32:27 (5176): No heartbeat from core client for 30 sec - exiting 16:32:28 (5176): No heartbeat from core client for 30 sec - exiting 16:32:29 (5176): No heartbeat from core client for 30 sec - exiting 16:32:30 (5176): No heartbeat from core client for 30 sec - exiting 16:32:31 (5176): No heartbeat from core client for 30 sec - exiting 16:32:32 (5176): No heartbeat from core client for 30 sec - exiting 16:32:33 (5176): No heartbeat from core client for 30 sec - exiting 16:32:35 (5176): No heartbeat from core client for 30 sec - exiting 16:32:36 (5176): No heartbeat from core client for 30 sec - exiting 16:32:37 (5176): No heartbeat from core client for 30 sec - exiting 16:32:38 (5176): No heartbeat from core client for 30 sec - exiting 16:32:39 (5176): No heartbeat from core client for 30 sec - exiting 16:32:40 (5176): No heartbeat from core client for 30 sec - exiting 16:32:41 (5176): No heartbeat from core client for 30 sec - exiting 16:32:42 (5176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=288, selfPID=5828, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2824, selfPID=6024, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5312, selfPID=4672, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5412, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=480, selfPID=4812, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4180, selfPID=5148, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4996, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 7 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4972, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... 15:46:37 (364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:58:58 (1520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3480, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5408, selfPID=2724, iMonCtr=1 Model crash detected, will try to restart... 16:37:33 (5000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4336, iMonCtr=2 Model crash detected, will try to restart... 17:08:55 (4440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2156, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 9 09:34:11 (5920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4108, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4908, iMonCtr=2 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5612, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=872, selfPID=5176, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 10 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4368, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4380, selfPID=4232, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4520, iMonCtr=2 Leaving CPDN_Main::Monitor... 12:35:17 (5016): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Apr 2011 03:43:53 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 138,336 | 442,320 | 3.1974 |
13 Apr 2011 01:02:08 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 126,816 | 405,887 | 3.2006 |
11 Apr 2011 06:27:00 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 115,296 | 369,956 | 3.2087 |
08 Apr 2011 04:46:17 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 103,776 | 333,798 | 3.2165 |
07 Apr 2011 01:54:37 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 92,256 | 297,897 | 3.2290 |
05 Apr 2011 07:10:24 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 80,741 | 261,430 | 3.2379 |
05 Apr 2011 06:54:55 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 80,736 | 261,061 | 3.2335 |
04 Apr 2011 04:09:09 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 69,216 | 224,842 | 3.2484 |
01 Apr 2011 02:30:53 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 57,696 | 187,760 | 3.2543 |
30 Mar 2011 08:28:15 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 46,176 | 150,782 | 3.2654 |
29 Mar 2011 05:45:04 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 34,656 | 114,033 | 3.2904 |
28 Mar 2011 05:05:22 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 23,136 | 76,914 | 3.3244 |
10 Mar 2011 08:07:38 | 1025193 | 12616806 | hadam3p_pnw_2xxw_1984_1_007177484_0 | 11,616 | 39,087 | 3.3649 |
©2024 cpdn.org