Name | hadcm3n_p2yq_1940_40_007420224_0 |
Workunit | 7617859 |
Created | 24 Aug 2011, 21:46:55 UTC |
Sent | 24 Aug 2011, 21:48:16 UTC |
Report deadline | 24 Nov 2011, 5:15:27 UTC |
Received | 30 Sep 2011, 11:05:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1165442 |
Run time | 4 days 15 hours 57 min 7 sec |
CPU time | 4 days 14 hours 16 min |
Validate state | Invalid |
Credit | 4,043.52 |
Device peak FLOPS | 2.88 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.33</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:15:04 (3120): No heartbeat from core client for 30 sec - exiting 13:15:05 (3120): No heartbeat from core client for 30 sec - exiting 13:15:06 (3120): No heartbeat from core client for 30 sec - exiting 13:15:07 (3120): No heartbeat from core client for 30 sec - exiting 13:15:08 (3120): No heartbeat from core client for 30 sec - exiting 13:15:09 (3120): No heartbeat from core client for 30 sec - exiting 13:15:10 (3120): No heartbeat from core client for 30 sec - exiting 13:15:11 (3120): No heartbeat from core client for 30 sec - exiting 13:15:12 (3120): No heartbeat from core client for 30 sec - exiting 13:15:13 (3120): No heartbeat from core client for 30 sec - exiting 13:15:14 (3120): No heartbeat from core client for 30 sec - exiting 13:15:15 (3120): No heartbeat from core client for 30 sec - exiting 13:15:16 (3120): No heartbeat from core client for 30 sec - exiting 13:15:17 (3120): No heartbeat from core client for 30 sec - exiting 13:15:18 (3120): No heartbeat from core client for 30 sec - exiting 13:15:19 (3120): No heartbeat from core client for 30 sec - exiting 13:15:20 (3120): No heartbeat from core client for 30 sec - exiting 13:15:21 (3120): No heartbeat from core client for 30 sec - exiting 13:15:22 (3120): No heartbeat from core client for 30 sec - exiting 13:15:23 (3120): No heartbeat from core client for 30 sec - exiting 13:15:24 (3120): No heartbeat from core client for 30 sec - exiting 13:15:25 (3120): No heartbeat from core client for 30 sec - exiting 13:15:26 (3120): No heartbeat from core client for 30 sec - exiting 13:15:27 (3120): No heartbeat from core client for 30 sec - exiting 13:15:28 (3120): No heartbeat from core client for 30 sec - exiting 13:15:29 (3120): No heartbeat from core client for 30 sec - exiting 13:15:30 (3120): No heartbeat from core client for 30 sec - exiting 13:15:31 (3120): No heartbeat from core client for 30 sec - exiting 13:15:32 (3120): No heartbeat from core client for 30 sec - exiting 13:15:33 (3120): No heartbeat from core client for 30 sec - exiting 13:15:34 (3120): No heartbeat from core client for 30 sec - exiting 13:15:35 (3120): No heartbeat from core client for 30 sec - exiting 13:15:36 (3120): No heartbeat from core client for 30 sec - exiting 13:15:37 (3120): No heartbeat from core client for 30 sec - exiting 13:15:38 (3120): No heartbeat from core client for 30 sec - exiting 13:15:39 (3120): No heartbeat from core client for 30 sec - exiting 13:15:40 (3120): No heartbeat from core client for 30 sec - exiting 13:15:41 (3120): No heartbeat from core client for 30 sec - exiting 13:15:42 (3120): No heartbeat from core client for 30 sec - exiting 13:15:43 (3120): No heartbeat from core client for 30 sec - exiting 13:15:44 (3120): No heartbeat from core client for 30 sec - exiting 13:15:45 (3120): No heartbeat from core client for 30 sec - exiting 13:15:46 (3120): No heartbeat from core client for 30 sec - exiting 13:15:47 (3120): No heartbeat from core client for 30 sec - exiting 13:15:48 (3120): No heartbeat from core client for 30 sec - exiting 13:15:49 (3120): No heartbeat from core client for 30 sec - exiting 13:15:50 (3120): No heartbeat from core client for 30 sec - exiting 13:15:51 (3120): No heartbeat from core client for 30 sec - exiting 13:15:52 (3120): No heartbeat from core client for 30 sec - exiting 13:15:53 (3120): No heartbeat from core client for 30 sec - exiting 13:15:54 (3120): No heartbeat from core client for 30 sec - exiting 13:15:55 (3120): No heartbeat from core client for 30 sec - exiting 13:15:56 (3120): No heartbeat from core client for 30 sec - exiting 13:15:57 (3120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:15:07 (3484): No heartbeat from core client for 30 sec - exiting 17:15:09 (3484): No heartbeat from core client for 30 sec - exiting 17:15:10 (3484): No heartbeat from core client for 30 sec - exiting 17:15:11 (3484): No heartbeat from core client for 30 sec - exiting 17:15:12 (3484): No heartbeat from core client for 30 sec - exiting 17:15:13 (3484): No heartbeat from core client for 30 sec - exiting 17:15:14 (3484): No heartbeat from core client for 30 sec - exiting 17:15:15 (3484): No heartbeat from core client for 30 sec - exiting 17:15:16 (3484): No heartbeat from core client for 30 sec - exiting 17:15:17 (3484): No heartbeat from core client for 30 sec - exiting 17:15:18 (3484): No heartbeat from core client for 30 sec - exiting 17:15:20 (3484): No heartbeat from core client for 30 sec - exiting 17:15:21 (3484): No heartbeat from core client for 30 sec - exiting 17:15:22 (3484): No heartbeat from core client for 30 sec - exiting 17:15:23 (3484): No heartbeat from core client for 30 sec - exiting 17:15:24 (3484): No heartbeat from core client for 30 sec - exiting 17:15:25 (3484): No heartbeat from core client for 30 sec - exiting 17:15:26 (3484): No heartbeat from core client for 30 sec - exiting 17:15:27 (3484): No heartbeat from core client for 30 sec - exiting 17:15:28 (3484): No heartbeat from core client for 30 sec - exiting 17:15:29 (3484): No heartbeat from core client for 30 sec - exiting 17:15:30 (3484): No heartbeat from core client for 30 sec - exiting 17:15:31 (3484): No heartbeat from core client for 30 sec - exiting 17:15:32 (3484): No heartbeat from core client for 30 sec - exiting 17:15:33 (3484): No heartbeat from core client for 30 sec - exiting 17:15:34 (3484): No heartbeat from core client for 30 sec - exiting 17:15:35 (3484): No heartbeat from core client for 30 sec - exiting 17:15:36 (3484): No heartbeat from core client for 30 sec - exiting 17:15:37 (3484): No heartbeat from core client for 30 sec - exiting 17:15:38 (3484): No heartbeat from core client for 30 sec - exiting 17:15:39 (3484): No heartbeat from core client for 30 sec - exiting 17:15:40 (3484): No heartbeat from core client for 30 sec - exiting 17:15:41 (3484): No heartbeat from core client for 30 sec - exiting 17:15:42 (3484): No heartbeat from core client for 30 sec - exiting 17:15:43 (3484): No heartbeat from core client for 30 sec - exiting 17:15:44 (3484): No heartbeat from core client for 30 sec - exiting 17:15:45 (3484): No heartbeat from core client for 30 sec - exiting 17:15:46 (3484): No heartbeat from core client for 30 sec - exiting 17:15:47 (3484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2360, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Sep 2011 09:28:36 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 336,960 | 391,196 | 1.1610 |
29 Sep 2011 03:23:36 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 311,040 | 369,379 | 1.1876 |
28 Sep 2011 18:40:25 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 285,120 | 346,894 | 1.2167 |
28 Sep 2011 12:17:29 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 259,200 | 324,631 | 1.2524 |
27 Sep 2011 19:07:13 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 233,280 | 299,296 | 1.2830 |
27 Sep 2011 11:55:11 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 207,360 | 273,348 | 1.3182 |
26 Sep 2011 18:30:41 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 181,440 | 247,135 | 1.3621 |
26 Sep 2011 11:16:19 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 155,520 | 221,100 | 1.4217 |
25 Sep 2011 18:08:47 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 129,600 | 195,310 | 1.5070 |
24 Sep 2011 23:18:15 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 103,680 | 161,986 | 1.5624 |
24 Sep 2011 00:30:59 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 77,760 | 125,898 | 1.6191 |
26 Aug 2011 15:19:30 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 51,840 | 87,135 | 1.6808 |
25 Aug 2011 10:39:48 | 1165442 | 13288209 | hadcm3n_p2yq_1940_40_007420224_0 | 25,920 | 43,243 | 1.6683 |
©2024 cpdn.org