Name | hadam3p_pnw_blqz_1987_1_008032406_1 |
Workunit | 8187520 |
Created | 8 Jul 2012, 19:38:09 UTC |
Sent | 8 Jul 2012, 19:39:30 UTC |
Report deadline | 21 Jun 2013, 0:59:30 UTC |
Received | 9 Sep 2012, 17:16:43 UTC |
Server state | Over |
Outcome | Success |
Client state | Done |
Exit status | 0 (0x00000000) |
Computer ID | 1164195 |
Run time | 5 days 21 hours 49 min 3 sec |
CPU time | 4 days 14 hours 56 min 25 sec |
Validate state | Workunit error - check skipped |
Credit | 3,005.88 |
Device peak FLOPS | 2.48 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <stderr_txt> Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5020, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3960, selfPID=4620, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7764, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4500, selfPID=2272, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4244, selfPID=4280, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5212, selfPID=4372, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4220, selfPID=4644, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1232, selfPID=4916, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 1 07:49:16 (5040): No heartbeat from core client for 30 sec - exiting 07:49:17 (5040): No heartbeat from core client for 30 sec - exiting 07:49:18 (5040): No heartbeat from core client for 30 sec - exiting 07:49:19 (5040): No heartbeat from core client for 30 sec - exiting 07:49:20 (5040): No heartbeat from core client for 30 sec - exiting 07:49:21 (5040): No heartbeat from core client for 30 sec - exiting 07:49:22 (5040): No heartbeat from core client for 30 sec - exiting 07:49:23 (5040): No heartbeat from core client for 30 sec - exiting 07:49:24 (5040): No heartbeat from core client for 30 sec - exiting 07:49:25 (5040): No heartbeat from core client for 30 sec - exiting 07:49:26 (5040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2444, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4864, selfPID=4812, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2448, selfPID=4904, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1560, selfPID=3924, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=348, selfPID=4716, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 3 CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4200, selfPID=3432, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1560, selfPID=4864, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4752, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 4 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7736, iMonCtr=2 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1316, selfPID=3016, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1460, selfPID=3272, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9028, selfPID=9028, iMonCtr=2 20:58:49 (7064): start_timer_thread(): CreateThread() failed, errno 0 20:58:49 (8856): start_timer_thread(): CreateThread() failed, errno 0 CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8856, selfPID=8856, iMonCtr=2 Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10212, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4216, selfPID=4940, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 7 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4180, selfPID=4180, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:31:09 (4220): No heartbeat from core client for 30 sec - exiting 06:31:10 (4220): No heartbeat from core client for 30 sec - exiting 06:31:11 (4220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7316, selfPID=7316, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5008, iMonCtr=2 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:58:05 (1440): start_timer_thread(): CreateThread() failed, errno 0 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5072, selfPID=3184, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 8 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4608, selfPID=1888, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4252, selfPID=4252, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 07:17:42 (4588): No heartbeat from core client for 30 sec - exiting 07:17:43 (4588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4176, selfPID=4144, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4564, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1456, selfPID=1456, iMonCtr=2 GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4852, selfPID=5008, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4696, selfPID=2272, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2016, selfPID=4776, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Regional yearly means requires 12 input files got 11 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2796, selfPID=2796, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3380, selfPID=3380, iMonCtr=2 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3668, selfPID=3668, iMonCtr=2 CPDN Monitor - Quit request from BOINC... 07:04:13 (2764): No heartbeat from core client for 30 sec - exiting 07:04:14 (2764): No heartbeat from core client for 30 sec - exiting 07:04:15 (2764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7516, selfPID=7516, iMonCtr=2 Leaving CPDN_Main::Monitor... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Sep 2012 19:21:16 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 138,336 | 398,686 | 2.8820 |
28 Aug 2012 20:01:29 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 126,816 | 365,170 | 2.8795 |
24 Aug 2012 06:30:40 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 115,303 | 330,777 | 2.8688 |
23 Aug 2012 22:18:47 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 115,296 | 330,324 | 2.8650 |
16 Aug 2012 21:22:16 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 103,776 | 295,495 | 2.8474 |
10 Aug 2012 06:16:21 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 92,256 | 259,520 | 2.8130 |
07 Aug 2012 06:24:03 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 80,738 | 226,018 | 2.7994 |
06 Aug 2012 22:37:56 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 80,736 | 225,578 | 2.7940 |
02 Aug 2012 20:08:27 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 69,216 | 191,946 | 2.7731 |
29 Jul 2012 21:04:19 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 57,697 | 158,901 | 2.7541 |
29 Jul 2012 20:03:38 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 57,696 | 158,552 | 2.7481 |
22 Jul 2012 21:50:01 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 46,176 | 127,203 | 2.7547 |
18 Jul 2012 21:38:47 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 34,656 | 96,236 | 2.7769 |
16 Jul 2012 18:34:53 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 23,143 | 64,462 | 2.7854 |
16 Jul 2012 17:34:00 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 23,136 | 64,050 | 2.7684 |
11 Jul 2012 19:42:30 | 1164195 | 14878708 | hadam3p_pnw_blqz_1987_1_008032406_1 | 11,616 | 32,681 | 2.8134 |
©2024 cpdn.org