Name | hadcm3n_t6dm_1940_40_007445270_4 |
Workunit | 7642773 |
Created | 18 Sep 2011, 9:29:58 UTC |
Sent | 18 Sep 2011, 10:00:39 UTC |
Report deadline | 18 Dec 2011, 17:27:50 UTC |
Received | 27 Sep 2011, 22:04:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1122757 |
Run time | 9 days 9 hours 15 min 53 sec |
CPU time | 6 days 19 hours 17 min 47 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.70 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 16:59:20 (5672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:00:21 (5908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:00:26 (5908): No heartbeat from core client for 30 sec - exiting 17:00:27 (5908): No heartbeat from core client for 30 sec - exiting 17:00:28 (5908): No heartbeat from core client for 30 sec - exiting 17:00:29 (5908): No heartbeat from core client for 30 sec - exiting 17:00:30 (5908): No heartbeat from core client for 30 sec - exiting 17:00:31 (5908): No heartbeat from core client for 30 sec - exiting 17:00:32 (5908): No heartbeat from core client for 30 sec - exiting 17:00:34 (5908): No heartbeat from core client for 30 sec - exiting 17:00:35 (5908): No heartbeat from core client for 30 sec - exiting 17:00:36 (5908): No heartbeat from core client for 30 sec - exiting 17:09:18 (7052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:00:34 (2864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:34:44 (1708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:34:46 (1708): No heartbeat from core client for 30 sec - exiting 21:34:47 (1708): No heartbeat from core client for 30 sec - exiting 21:34:48 (1708): No heartbeat from core client for 30 sec - exiting 21:34:49 (1708): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:24:07 (6840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:24:20 (6840): No heartbeat from core client for 30 sec - exiting 02:24:21 (6840): No heartbeat from core client for 30 sec - exiting 02:24:23 (6840): No heartbeat from core client for 30 sec - exiting 02:24:24 (6840): No heartbeat from core client for 30 sec - exiting 02:24:25 (6840): No heartbeat from core client for 30 sec - exiting 02:24:26 (6840): No heartbeat from core client for 30 sec - exiting 02:24:27 (6840): No heartbeat from core client for 30 sec - exiting 02:24:28 (6840): No heartbeat from core client for 30 sec - exiting 02:24:29 (6840): No heartbeat from core client for 30 sec - exiting 02:24:30 (6840): No heartbeat from core client for 30 sec - exiting 02:24:31 (6840): No heartbeat from core client for 30 sec - exiting 02:24:32 (6840): No heartbeat from core client for 30 sec - exiting 02:24:34 (6840): No heartbeat from core client for 30 sec - exiting 02:24:35 (6840): No heartbeat from core client for 30 sec - exiting 02:24:36 (6840): No heartbeat from core client for 30 sec - exiting 02:24:37 (6840): No heartbeat from core client for 30 sec - exiting 02:24:38 (6840): No heartbeat from core client for 30 sec - exiting 02:24:39 (6840): No heartbeat from core client for 30 sec - exiting 02:24:40 (6840): No heartbeat from core client for 30 sec - exiting 02:24:41 (6840): No heartbeat from core client for 30 sec - exiting 02:24:42 (6840): No heartbeat from core client for 30 sec - exiting 02:24:43 (6840): No heartbeat from core client for 30 sec - exiting 02:24:44 (6840): No heartbeat from core client for 30 sec - exiting 02:24:46 (6840): No heartbeat from core client for 30 sec - exiting 02:24:47 (6840): No heartbeat from core client for 30 sec - exiting 02:24:48 (6840): No heartbeat from core client for 30 sec - exiting 02:24:49 (6840): No heartbeat from core client for 30 sec - exiting 02:24:50 (6840): No heartbeat from core client for 30 sec - exiting 02:24:51 (6840): No heartbeat from core client for 30 sec - exiting 02:24:52 (6840): No heartbeat from core client for 30 sec - exiting 02:24:53 (6840): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Ocean Restart file copy failed on t6dmko.daf0c20 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Sep 2011 12:05:16 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 259,200 | 767,632 | 2.9615 |
26 Sep 2011 14:03:19 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 233,280 | 689,254 | 2.9546 |
25 Sep 2011 15:55:02 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 207,360 | 610,177 | 2.9426 |
24 Sep 2011 17:42:16 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 181,440 | 531,892 | 2.9315 |
23 Sep 2011 19:41:16 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 155,520 | 453,223 | 2.9142 |
22 Sep 2011 21:41:03 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 129,600 | 374,534 | 2.8899 |
21 Sep 2011 23:36:54 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 103,680 | 299,251 | 2.8863 |
21 Sep 2011 06:40:08 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 77,760 | 235,170 | 3.0243 |
20 Sep 2011 06:57:13 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 51,840 | 158,051 | 3.0488 |
19 Sep 2011 09:23:20 | 1122757 | 13395773 | hadcm3n_t6dm_1940_40_007445270_4 | 25,920 | 78,575 | 3.0314 |
©2024 cpdn.org