Name | hadcm3n_o02f_2140_40_008270350_3 |
Workunit | 8425474 |
Created | 15 May 2013, 14:09:25 UTC |
Sent | 15 May 2013, 14:09:43 UTC |
Report deadline | 14 Aug 2013, 21:36:54 UTC |
Received | 20 Jul 2013, 21:36:14 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1274202 |
Run time | 46 days 8 hours 5 min 3 sec |
CPU time | 37 days 3 hours 39 min 7 sec |
Validate state | Invalid |
Credit | 8,087.04 |
Device peak FLOPS | 1.35 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 04:55:41 (2856): No heartbeat from core client for 30 sec - exiting 04:55:42 (2856): No heartbeat from core client for 30 sec - exiting 04:55:43 (2856): No heartbeat from core client for 30 sec - exiting 04:55:44 (2856): No heartbeat from core client for 30 sec - exiting 04:55:45 (2856): No heartbeat from core client for 30 sec - exiting 04:55:47 (2856): No heartbeat from core client for 30 sec - exiting 04:55:48 (2856): No heartbeat from core client for 30 sec - exiting 04:55:49 (2856): No heartbeat from core client for 30 sec - exiting 04:55:50 (2856): No heartbeat from core client for 30 sec - exiting 04:55:51 (2856): No heartbeat from core client for 30 sec - exiting 04:55:52 (2856): No heartbeat from core client for 30 sec - exiting 04:55:53 (2856): No heartbeat from core client for 30 sec - exiting 04:55:54 (2856): No heartbeat from core client for 30 sec - exiting 04:55:55 (2856): No heartbeat from core client for 30 sec - exiting 04:55:56 (2856): No heartbeat from core client for 30 sec - exiting 04:55:57 (2856): No heartbeat from core client for 30 sec - exiting 04:55:58 (2856): No heartbeat from core client for 30 sec - exiting 04:55:59 (2856): No heartbeat from core client for 30 sec - exiting 04:56:00 (2856): No heartbeat from core client for 30 sec - exiting 04:56:01 (2856): No heartbeat from core client for 30 sec - exiting 04:56:02 (2856): No heartbeat from core client for 30 sec - exiting 04:56:03 (2856): No heartbeat from core client for 30 sec - exiting 04:56:04 (2856): No heartbeat from core client for 30 sec - exiting 04:56:05 (2856): No heartbeat from core client for 30 sec - exiting 04:56:06 (2856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:38:18 (6180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from CPDN Monitor - Quit requesCPDN Monitor - Quit request fCPDN Monitor - Quit request froCPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:43:30 (4644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:51:36 (6068): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:34:21 (4320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:17:29 (6152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... MainError: 01:26:44 AM No files match the supplied pattern. MainError: 01:26:44 AM No files match the supplied pattern. MainError: 11:29:24 AM No files match the supplied pattern. MainError: 11:29:24 AM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... MainError: 12:56:47 AM No files match the supplied pattern. MainError: 12:56:48 AM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... MainError: 05:29:03 PM No files match the supplied pattern. MainError: 05:29:03 PM No files match the supplied pattern. MainError: 07:50:32 AM No files match the supplied pattern. MainError: 07:50:32 AM No files match the supplied pattern. 17:03:57 (5100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 09:39:57 PM No files match the supplied pattern. MainError: 09:39:57 PM No files match the supplied pattern. MainError: 01:03:18 PM No files match the supplied pattern. MainError: 01:03:18 PM No files match the supplied pattern. 15:53:23 (1564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4672, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3228, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Jul 2013 16:44:30 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 673,920 | 3,164,490 | 4.6956 |
23 Jul 2013 16:44:30 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 648,000 | 3,041,716 | 4.6940 |
23 Jul 2013 16:44:30 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 622,080 | 2,915,192 | 4.6862 |
23 Jul 2013 16:44:30 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 596,160 | 2,787,901 | 4.6764 |
11 Jul 2013 00:57:51 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 570,240 | 2,665,904 | 4.6751 |
09 Jul 2013 11:33:15 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 544,320 | 2,548,073 | 4.6812 |
08 Jul 2013 01:30:46 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 518,400 | 2,434,587 | 4.6963 |
06 Jul 2013 07:16:53 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 492,480 | 2,311,491 | 4.6936 |
04 Jul 2013 14:28:41 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 466,560 | 2,192,436 | 4.6992 |
02 Jul 2013 16:57:13 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 440,640 | 2,065,155 | 4.6867 |
02 Jul 2013 09:55:06 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 414,720 | 1,945,083 | 4.6901 |
27 Jun 2013 07:28:24 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 388,800 | 1,824,935 | 4.6938 |
23 Jun 2013 06:30:47 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 362,880 | 1,704,518 | 4.6972 |
21 Jun 2013 13:51:21 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 336,960 | 1,589,368 | 4.7168 |
19 Jun 2013 15:49:36 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 311,040 | 1,466,984 | 4.7164 |
18 Jun 2013 02:10:53 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 285,120 | 1,351,792 | 4.7411 |
12 Jun 2013 05:34:41 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 259,200 | 1,233,224 | 4.7578 |
10 Jun 2013 08:47:47 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 233,280 | 1,101,676 | 4.7225 |
08 Jun 2013 05:22:17 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 207,360 | 973,228 | 4.6934 |
06 Jun 2013 11:12:22 | 1274202 | 15785283 | hadcm3n_o02f_2140_40_008270350_3 | 181,440 | 846,542 | 4.6657 |
©2024 cpdn.org