Name | hadam3p_pnw_psn9_2013_1_009984494_0 |
Workunit | 9990852 |
Created | 29 Jun 2015, 19:01:05 UTC |
Sent | 30 Jun 2015, 11:19:03 UTC |
Report deadline | 11 Jun 2016, 16:39:03 UTC |
Received | 29 Aug 2015, 12:28:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 0 (0x00000000) |
Computer ID | 1129146 |
Run time | 6 days 10 hours 50 min 35 sec |
CPU time | 5 days 20 hours 45 min |
Validate state | Invalid |
Credit | 4,011.55 |
Device peak FLOPS | 3.25 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Pacific North West v7.27 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8056, iMonCtr=2 Model crash detected, will try to restart... 17:02:06 (7524): No heartbeat from client for 30 sec - exiting 17:02:06 (7524): timer handler: client dead, exiting 17:02:07 (7524): No heartbeat from client for 30 sec - exiting 17:02:07 (7524): timer handler: client dead, exiting 17:02:08 (7524): No heartbeat from client for 30 sec - exiting 17:02:08 (7524): timer handler: client dead, exiting 17:02:09 (7524): No heartbeat from client for 30 sec - exiting 17:02:09 (7524): timer handler: client dead, exiting 17:02:10 (7524): No heartbeat from client for 30 sec - exiting 17:02:10 (7524): timer handler: client dead, exiting 17:02:11 (7524): No heartbeat from client for 30 sec - exiting 17:02:11 (7524): timer handler: client dead, exiting 17:02:12 (7524): No heartbeat from client for 30 sec - exiting 17:02:12 (7524): timer handler: client dead, exiting 17:02:13 (7524): No heartbeat from client for 30 sec - exiting 17:02:13 (7524): timer handler: client dead, exiting 17:02:14 (7524): No heartbeat from client for 30 sec - exiting 17:02:14 (7524): timer handler: client dead, exiting 17:02:15 (7524): No heartbeat from client for 30 sec - exiting 17:02:15 (7524): timer handler: client dead, exiting 17:02:16 (7524): No heartbeat from client for 30 sec - exiting 17:02:16 (7524): timer handler: client dead, exiting 17:02:18 (7524): No heartbeat from client for 30 sec - exiting 17:02:18 (7524): timer handler: client dead, exiting 17:02:19 (7524): No heartbeat from client for 30 sec - exiting 17:02:19 (7524): timer handler: client dead, exiting 17:02:20 (7524): No heartbeat from client for 30 sec - exiting 17:02:20 (7524): timer handler: client dead, exiting 17:02:21 (7524): No heartbeat from client for 30 sec - exiting 17:02:21 (7524): timer handler: client dead, exiting 17:02:22 (7524): No heartbeat from client for 30 sec - exiting 17:02:22 (7524): timer handler: client dead, exiting 17:02:23 (7524): No heartbeat from client for 30 sec - exiting 17:02:23 (7524): timer handler: client dead, exiting 17:02:24 (7524): No heartbeat from client for 30 sec - exiting 17:02:24 (7524): timer handler: client dead, exiting 17:02:25 (7524): No heartbeat from client for 30 sec - exiting 17:02:25 (7524): timer handler: client dead, exiting 17:02:26 (7524): No heartbeat from client for 30 sec - exiting 17:02:26 (7524): timer handler: client dead, exiting 17:02:27 (7524): No heartbeat from client for 30 sec - exiting 17:02:27 (7524): timer handler: client dead, exiting 17:02:29 (7524): No heartbeat from client for 30 sec - exiting 17:02:29 (7524): timer handler: client dead, exiting 17:02:30 (7524): No heartbeat from client for 30 sec - exiting 17:02:30 (7524): timer handler: client dead, exiting 17:02:31 (7524): No heartbeat from client for 30 sec - exiting 17:02:31 (7524): timer handler: client dead, exiting 17:02:32 (7524): No heartbeat from client for 30 sec - exiting 17:02:32 (7524): timer handler: client dead, exiting 17:02:33 (7524): No heartbeat from client for 30 sec - exiting 17:02:33 (7524): timer handler: client dead, exiting 17:02:34 (7524): No heartbeat from client for 30 sec - exiting 17:02:34 (7524): timer handler: client dead, exiting 17:02:35 (7524): No heartbeat from client for 30 sec - exiting 17:02:35 (7524): timer handler: client dead, exiting 17:02:36 (7524): No heartbeat from client for 30 sec - exiting 17:02:36 (7524): timer handler: client dead, exiting 17:02:37 (7524): No heartbeat from client for 30 sec - exiting 17:02:37 (7524): timer handler: client dead, exiting 17:02:38 (7524): No heartbeat from client for 30 sec - exiting 17:02:38 (7524): timer handler: client dead, exiting 17:02:39 (7524): No heartbeat from client for 30 sec - exiting 17:02:39 (7524): timer handler: client dead, exiting 17:02:41 (7524): No heartbeat from client for 30 sec - exiting 17:02:41 (7524): timer handler: client dead, exiting 17:02:42 (7524): No heartbeat from client for 30 sec - exiting 17:02:42 (7524): timer handler: client dead, exiting 17:02:43 (7524): No heartbeat from client for 30 sec - exiting 17:02:43 (7524): timer handler: client dead, exiting 17:02:44 (7524): No heartbeat from client for 30 sec - exiting 17:02:44 (7524): timer handler: client dead, exiting 17:02:45 (7524): No heartbeat from client for 30 sec - exiting 17:02:45 (7524): timer handler: client dead, exiting 17:02:46 (7524): No heartbeat from client for 30 sec - exiting 17:02:46 (7524): timer handler: client dead, exiting 17:02:47 (7524): No heartbeat from client for 30 sec - exiting 17:02:47 (7524): timer handler: client dead, exiting 17:02:48 (7524): No heartbeat from client for 30 sec - exiting 17:02:48 (7524): timer handler: client dead, exiting 17:02:49 (7524): No heartbeat from client for 30 sec - exiting 17:02:49 (7524): timer handler: client dead, exiting 17:02:50 (7524): No heartbeat from client for 30 sec - exiting 17:02:50 (7524): timer handler: client dead, exiting 17:02:51 (7524): No heartbeat from client for 30 sec - exiting 17:02:51 (7524): timer handler: client dead, exiting 17:02:53 (7524): No heartbeat from client for 30 sec - exiting 17:02:53 (7524): timer handler: client dead, exiting 17:02:54 (7524): No heartbeat from client for 30 sec - exiting 17:02:54 (7524): timer handler: client dead, exiting 17:02:55 (7524): No heartbeat from client for 30 sec - exiting 17:02:55 (7524): timer handler: client dead, exiting 17:02:56 (7524): No heartbeat from client for 30 sec - exiting 17:02:56 (7524): timer handler: client dead, exiting GController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3724, selfPID=6992, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7772, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7876, selfPID=7008, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7216, selfPID=5428, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7828, selfPID=7064, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7776, selfPID=7120, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7296, selfPID=7116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7720, selfPID=7036, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=8056, selfPID=6628, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7728, selfPID=7120, iMonCtr=1 Model crash detected, will try to restart... cpdnmonitor: error reading file D:\05_Climateprediction\02_Data/projects/climateprediction.net/hadam3p_pnw_psn9_2013_1_009984494/datain/ancil/ic19611116_16_N96 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3612, selfPID=8172, iMonCtr=1 Model crash detected, will try to restart... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7432, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7612, selfPID=2448, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7040, selfPID=6432, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7312, iMonCtr=2 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CGController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2876, selfPID=6256, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6756, iMonCtr=2 Model crash detected, will try to restart... 22:09:35 (1948): start_timer_thread(): CreateThread() failed, errno 0 17:59:56 (5292): start_timer_thread(): CreateThread() failed, errno 0 17:03:45 (9208): start_timer_thread(): CreateThread() failed, errno 0 17:37:32 (8568): start_timer_thread(): CreateThread() failed, errno 0 15:15:56 (7052): start_timer_thread(): CreateThread() failed, errno 0 15:15:58 (8788): start_timer_thread(): CreateThread() failed, errno 0 09:50:29 (6884): start_timer_thread(): CreateThread() failed, errno 0 09:50:31 (3120): start_timer_thread(): CreateThread() failed, errno 0 09:48:33 (8336): start_timer_thread(): CreateThread() failed, errno 0 Global Worker:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=6388, iMonCtr=1 Regional Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=0, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=9716, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... 10:23:29 (4660): called boinc_finish(0) </stderr_txt> <message> <file_xfer_error> <file_name>hadam3p_pnw_psn9_2013_1_009984494_0_17.zip</file_name> <error_code>-161</error_code> </file_xfer_error> <file_xfer_error> <file_name>hadam3p_pnw_psn9_2013_1_009984494_0_18.zip</file_name> <error_code>-161</error_code> </file_xfer_error> </message> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Aug 2015 15:23:40 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 184,619 | 483,164 | 2.6171 |
22 Aug 2015 17:27:33 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 173,099 | 452,217 | 2.6125 |
18 Aug 2015 17:48:36 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 161,579 | 422,002 | 2.6117 |
16 Aug 2015 12:42:47 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 150,059 | 391,987 | 2.6122 |
06 Aug 2015 15:04:18 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 138,539 | 362,055 | 2.6134 |
02 Aug 2015 07:38:52 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 127,019 | 333,026 | 2.6219 |
27 Jul 2015 20:30:44 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 115,499 | 303,646 | 2.6290 |
25 Jul 2015 20:13:21 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 103,979 | 273,851 | 2.6337 |
23 Jul 2015 12:31:50 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 92,459 | 244,050 | 2.6395 |
21 Jul 2015 15:03:38 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 80,939 | 213,969 | 2.6436 |
18 Jul 2015 17:14:53 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 69,419 | 182,838 | 2.6338 |
16 Jul 2015 19:37:29 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 57,899 | 152,863 | 2.6402 |
12 Jul 2015 18:15:27 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 46,379 | 122,580 | 2.6430 |
11 Jul 2015 16:45:43 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 34,859 | 92,424 | 2.6514 |
08 Jul 2015 19:00:46 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 23,339 | 62,447 | 2.6757 |
07 Jul 2015 14:17:24 | 1129146 | 18649483 | hadam3p_pnw_psn9_2013_1_009984494_0 | 11,819 | 31,577 | 2.6717 |
©2024 climateprediction.net