Name | hadcm3n_o3rx_2020_40_008403655_2 |
Workunit | 8554511 |
Created | 1 Nov 2013, 18:50:52 UTC |
Sent | 1 Nov 2013, 18:50:56 UTC |
Report deadline | 1 Feb 2014, 2:18:07 UTC |
Received | 9 Dec 2013, 21:30:41 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1113142 |
Run time | 23 days 15 hours 52 min 39 sec |
CPU time | 14 days 14 hours 17 min 23 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.18 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:06:28 (7056): No heartbeat from core client for 30 sec - exiting 17:06:29 (7056): No heartbeat from core client for 30 sec - exiting 17:06:30 (7056): No heartbeat from core client for 30 sec - exiting 17:06:31 (7056): No heartbeat from core client for 30 sec - exiting 17:06:33 (7056): No heartbeat from core client for 30 sec - exiting 17:06:34 (7056): No heartbeat from core client for 30 sec - exiting 17:06:35 (7056): No heartbeat from core client for 30 sec - exiting 17:06:36 (7056): No heartbeat from core client for 30 sec - exiting 17:06:37 (7056): No heartbeat from core client for 30 sec - exiting 17:06:38 (7056): No heartbeat from core client for 30 sec - exiting 17:06:39 (7056): No heartbeat from core client for 30 sec - exiting 17:06:40 (7056): No heartbeat from core client for 30 sec - exiting 17:06:41 (7056): No heartbeat from core client for 30 sec - exiting 17:06:42 (7056): No heartbeat from core client for 30 sec - exiting 17:06:43 (7056): No heartbeat from core client for 30 sec - exiting 17:06:45 (7056): No heartbeat from core client for 30 sec - exiting 17:06:46 (7056): No heartbeat from core client for 30 sec - exiting 17:06:47 (7056): No heartbeat from core client for 30 sec - exiting 17:06:48 (7056): No heartbeat from core client for 30 sec - exiting 17:06:49 (7056): No heartbeat from core client for 30 sec - exiting 17:06:50 (7056): No heartbeat from core client for 30 sec - exiting 17:06:51 (7056): No heartbeat from core client for 30 sec - exiting 17:06:52 (7056): No heartbeat from core client for 30 sec - exiting 17:06:53 (7056): No heartbeat from core client for 30 sec - exiting 17:06:54 (7056): No heartbeat from core client for 30 sec - exiting 17:06:55 (7056): No heartbeat from core client for 30 sec - exiting 17:06:57 (7056): No heartbeat from core client for 30 sec - exiting 17:06:58 (7056): No heartbeat from core client for 30 sec - exiting 17:06:59 (7056): No heartbeat from core client for 30 sec - exiting 17:07:00 (7056): No heartbeat from core client for 30 sec - exiting 17:07:01 (7056): No heartbeat from core client for 30 sec - exiting 17:07:02 (7056): No heartbeat from core client for 30 sec - exiting 17:07:03 (7056): No heartbeat from core client for 30 sec - exiting 17:07:04 (7056): No heartbeat from core client for 30 sec - exiting 17:07:05 (7056): No heartbeat from core client for 30 sec - exiting 17:07:06 (7056): No heartbeat from core client for 30 sec - exiting 17:07:07 (7056): No heartbeat from core client for 30 sec - exiting 17:07:09 (7056): No heartbeat from core client for 30 sec - exiting 17:07:10 (7056): No heartbeat from core client for 30 sec - exiting 17:07:11 (7056): No heartbeat from core client for 30 sec - exiting 17:07:12 (7056): No heartbeat from core client for 30 sec - exiting 17:07:13 (7056): No heartbeat from core client for 30 sec - exiting 17:07:14 (7056): No heartbeat from core client for 30 sec - exiting 17:07:15 (7056): No heartbeat from core client for 30 sec - exiting 17:07:16 (7056): No heartbeat from core client for 30 sec - exiting 17:07:17 (7056): No heartbeat from core client for 30 sec - exiting 17:07:18 (7056): No heartbeat from core client for 30 sec - exiting 17:07:19 (7056): No heartbeat from core client for 30 sec - exiting 17:07:21 (7056): No heartbeat from core client for 30 sec - exiting 17:07:22 (7056): No heartbeat from core client for 30 sec - exiting 17:07:23 (7056): No heartbeat from core client for 30 sec - exiting 17:07:24 (7056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:07:25 (7056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:21:40 (7388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:21:41 (7388): No heartbeat from core client for 30 sec - exiting 21:21:42 (7388): No heartbeat from core client for 30 sec - exiting 21:21:43 (7388): No heartbeat from core client for 30 sec - exiting 21:21:44 (7388): No heartbeat from core client for 30 sec - exiting 21:21:45 (7388): No heartbeat from core client for 30 sec - exiting 21:21:46 (7388): No heartbeat from core client for 30 sec - exiting 21:21:47 (7388): No heartbeat from core client for 30 sec - exiting 21:21:48 (7388): No heartbeat from core client for 30 sec - exiting 21:21:49 (7388): No heartbeat from core client for 30 sec - exiting 21:21:50 (7388): No heartbeat from core client for 30 sec - exiting forrtl: Access is denied. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7828, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 09:48:29 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Dec 2013 16:28:34 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 777,600 | 1,352,456 | 1.7393 |
08 Dec 2013 23:30:29 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 751,680 | 1,308,237 | 1.7404 |
08 Dec 2013 05:23:17 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 725,760 | 1,263,111 | 1.7404 |
07 Dec 2013 08:25:18 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 699,840 | 1,217,127 | 1.7392 |
06 Dec 2013 12:59:34 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 673,920 | 1,171,796 | 1.7388 |
05 Dec 2013 19:01:33 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 648,000 | 1,127,548 | 1.7400 |
04 Dec 2013 23:39:48 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 622,080 | 1,083,036 | 1.7410 |
04 Dec 2013 06:39:25 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 596,160 | 1,037,698 | 1.7406 |
03 Dec 2013 09:58:37 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 570,240 | 991,211 | 1.7382 |
02 Dec 2013 11:29:38 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 544,320 | 945,505 | 1.7370 |
01 Dec 2013 14:59:35 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 518,400 | 899,220 | 1.7346 |
30 Nov 2013 20:05:44 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 492,480 | 852,801 | 1.7316 |
30 Nov 2013 03:08:30 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 466,560 | 806,754 | 1.7292 |
29 Nov 2013 11:29:38 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 440,640 | 760,541 | 1.7260 |
28 Nov 2013 18:50:28 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 414,720 | 714,972 | 1.7240 |
28 Nov 2013 02:40:45 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 388,800 | 669,717 | 1.7225 |
27 Nov 2013 07:00:27 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 362,880 | 624,748 | 1.7216 |
26 Nov 2013 09:10:22 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 336,960 | 579,916 | 1.7210 |
24 Nov 2013 21:55:33 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 311,040 | 534,825 | 1.7195 |
24 Nov 2013 04:35:45 | 1113142 | 16076613 | hadcm3n_o3rx_2020_40_008403655_2 | 285,120 | 489,959 | 1.7184 |
©2024 cpdn.org