Name | hadcm3n_77mm_1980_40_008421169_3 |
Workunit | 8572025 |
Created | 2 Sep 2013, 0:00:41 UTC |
Sent | 2 Sep 2013, 0:26:27 UTC |
Report deadline | 2 Dec 2013, 7:53:38 UTC |
Received | 27 Sep 2013, 1:55:42 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1290294 |
Run time | 24 days 13 hours 53 min 35 sec |
CPU time | 16 days 21 hours 29 min 50 sec |
Validate state | Invalid |
Credit | 11,508.48 |
Device peak FLOPS | 2.37 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:20:06 (3044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:48:21 (1168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:53:08 (5836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:15:45 (4220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:13 (6000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:14 (6000): No heartbeat from core client for 30 sec - exiting 23:49:40 (6444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:49:41 (6444): No heartbeat from core client for 30 sec - exiting 23:49:42 (6444): No heartbeat from core client for 30 sec - exiting 23:49:43 (6444): No heartbeat from core client for 30 sec - exiting 23:49:44 (6444): No heartbeat from core client for 30 sec - exiting 23:49:45 (6444): No heartbeat from core client for 30 sec - exiting 23:49:46 (6444): No heartbeat from core client for 30 sec - exiting 23:49:47 (6444): No heartbeat from core client for 30 sec - exiting 23:49:48 (6444): No heartbeat from core client for 30 sec - exiting 23:49:49 (6444): No heartbeat from core client for 30 sec - exiting 23:49:50 (6444): No heartbeat from core client for 30 sec - exiting forrtl: Access is denied. 00:57:39 (5172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:19:32 (220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:19:33 (220): No heartbeat from core client for 30 sec - exiting 01:19:34 (220): No heartbeat from core client for 30 sec - exiting 01:19:35 (220): No heartbeat from core client for 30 sec - exiting 01:19:36 (220): No heartbeat from core client for 30 sec - exiting 01:19:37 (220): No heartbeat from core client for 30 sec - exiting 01:19:38 (220): No heartbeat from core client for 30 sec - exiting 01:19:39 (220): No heartbeat from core client for 30 sec - exiting 01:19:40 (220): No heartbeat from core client for 30 sec - exiting 01:19:41 (220): No heartbeat from core client for 30 sec - exiting 01:19:42 (220): No heartbeat from core client for 30 sec - exiting 03:12:04 (84): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:17:23 (84): No heartbeat from core client for 30 sec - exiting forrtl: Access is denied. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 06:53:19 (5608): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:26:11 (464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:00:23 (1600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... forrtl: Access is denied. 20:58:46 (5196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:01:46 (648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:09:20 (5644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:09:21 (5644): No heartbeat from core client for 30 sec - exiting 21:09:22 (5644): No heartbeat from core client for 30 sec - exiting 21:09:23 (5644): No heartbeat from core client for 30 sec - exiting 21:09:24 (5644): No heartbeat from core client for 30 sec - exiting 21:09:25 (5644): No heartbeat from core client for 30 sec - exiting 21:09:26 (5644): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Sep 2013 20:58:21 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 959,040 | 1,708,777 | 1.7818 |
26 Sep 2013 06:41:17 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 933,120 | 1,662,096 | 1.7812 |
25 Sep 2013 15:13:05 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 907,200 | 1,616,287 | 1.7816 |
25 Sep 2013 09:26:13 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 881,280 | 1,570,176 | 1.7817 |
24 Sep 2013 08:44:10 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 855,360 | 1,524,635 | 1.7824 |
23 Sep 2013 16:30:25 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 829,440 | 1,478,710 | 1.7828 |
23 Sep 2013 10:03:40 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 803,520 | 1,432,740 | 1.7831 |
23 Sep 2013 10:03:40 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 777,600 | 1,385,929 | 1.7823 |
21 Sep 2013 18:45:40 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 751,680 | 1,338,729 | 1.7810 |
21 Sep 2013 02:33:24 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 725,760 | 1,292,074 | 1.7803 |
20 Sep 2013 10:44:47 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 699,840 | 1,246,340 | 1.7809 |
19 Sep 2013 18:43:38 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 673,920 | 1,200,593 | 1.7815 |
19 Sep 2013 03:46:06 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 648,000 | 1,154,045 | 1.7809 |
18 Sep 2013 12:19:48 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 622,080 | 1,107,159 | 1.7798 |
17 Sep 2013 20:54:14 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 596,160 | 1,061,076 | 1.7799 |
17 Sep 2013 05:32:08 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 570,240 | 1,015,090 | 1.7801 |
16 Sep 2013 13:48:24 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 544,320 | 968,802 | 1.7798 |
15 Sep 2013 21:27:41 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 518,400 | 922,814 | 1.7801 |
15 Sep 2013 06:03:13 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 492,480 | 876,066 | 1.7789 |
14 Sep 2013 13:49:18 | 1290294 | 15999727 | hadcm3n_77mm_1980_40_008421169_3 | 466,560 | 829,708 | 1.7784 |
©2024 climateprediction.net