Name | hadcm3n_4cho_2020_40_008365496_0 |
Workunit | 8516355 |
Created | 11 May 2013, 1:13:42 UTC |
Sent | 11 May 2013, 1:18:34 UTC |
Report deadline | 10 Aug 2013, 8:45:45 UTC |
Received | 9 Jun 2013, 5:46:09 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1036109 |
Run time | 10 days 12 hours 0 min 36 sec |
CPU time | 8 days 23 hours 49 min 36 sec |
Validate state | Invalid |
Credit | 9,020.16 |
Device peak FLOPS | 2.87 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 17:57:03 (3544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:54:43 (2860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:45:10 (5544): No heartbeat from core client for 30 sec - exiting 17:45:11 (5544): No heartbeat from core client for 30 sec - exiting 17:45:12 (5544): No heartbeat from core client for 30 sec - exiting 17:45:13 (5544): No heartbeat from core client for 30 sec - exiting 17:45:14 (5544): No heartbeat from core client for 30 sec - exiting 17:45:16 (5544): No heartbeat from core client for 30 sec - exiting 17:45:17 (5544): No heartbeat from core client for 30 sec - exiting 17:45:18 (5544): No heartbeat from core client for 30 sec - exiting 17:45:19 (5544): No heartbeat from core client for 30 sec - exiting 17:45:20 (5544): No heartbeat from core client for 30 sec - exiting 17:45:21 (5544): No heartbeat from core client for 30 sec - exiting 17:45:22 (5544): No heartbeat from core client for 30 sec - exiting 17:45:23 (5544): No heartbeat from core client for 30 sec - exiting 17:45:24 (5544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:10:36 (5432): No heartbeat from core client for 30 sec - exiting 18:10:37 (5432): No heartbeat from core client for 30 sec - exiting 18:10:38 (5432): No heartbeat from core client for 30 sec - exiting 18:10:39 (5432): No heartbeat from core client for 30 sec - exiting 18:10:40 (5432): No heartbeat from core client for 30 sec - exiting 18:10:41 (5432): No heartbeat from core client for 30 sec - exiting 18:10:42 (5432): No heartbeat from core client for 30 sec - exiting 18:10:44 (5432): No heartbeat from core client for 30 sec - exiting 18:10:45 (5432): No heartbeat from core client for 30 sec - exiting 18:10:46 (5432): No heartbeat from core client for 30 sec - exiting 18:10:47 (5432): No heartbeat from core client for 30 sec - exiting 18:10:48 (5432): No heartbeat from core client for 30 sec - exiting 18:10:49 (5432): No heartbeat from core client for 30 sec - exiting 18:10:50 (5432): No heartbeat from core client for 30 sec - exiting 18:10:51 (5432): No heartbeat from core client for 30 sec - exiting 18:10:52 (5432): No heartbeat from core client for 30 sec - exiting 18:10:53 (5432): No heartbeat from core client for 30 sec - exiting 18:10:54 (5432): No heartbeat from core client for 30 sec - exiting 18:10:56 (5432): No heartbeat from core client for 30 sec - exiting 18:10:57 (5432): No heartbeat from core client for 30 sec - exiting 18:10:58 (5432): No heartbeat from core client for 30 sec - exiting 18:10:59 (5432): No heartbeat from core client for 30 sec - exiting 18:11:00 (5432): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:40:34 (5900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:45:36 (3276): No heartbeat from core client for 30 sec - exiting 17:45:38 (3276): No heartbeat from core client for 30 sec - exiting 17:45:39 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:44:54 (5492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:50:28 (5460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:43:15 (5340): No heartbeat from core client for 30 sec - exiting 17:43:17 (5340): No heartbeat from core client for 30 sec - exiting 17:43:18 (5340): No heartbeat from core client for 30 sec - exiting 17:43:19 (5340): No heartbeat from core client for 30 sec - exiting 17:43:20 (5340): No heartbeat from core client for 30 sec - exiting 17:43:21 (5340): No heartbeat from core client for 30 sec - exiting 17:43:22 (5340): No heartbeat from core client for 30 sec - exiting 17:43:23 (5340): No heartbeat from core client for 30 sec - exiting 17:43:24 (5340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:56 (5324): No heartbeat from core client for 30 sec - exiting 17:38:58 (5324): No heartbeat from core client for 30 sec - exiting 17:38:59 (5324): No heartbeat from core client for 30 sec - exiting 17:39:00 (5324): No heartbeat from core client for 30 sec - exiting 17:39:01 (5324): No heartbeat from core client for 30 sec - exiting 17:39:02 (5324): No heartbeat from core client for 30 sec - exiting 17:39:03 (5324): No heartbeat from core client for 30 sec - exiting 17:39:04 (5324): No heartbeat from core client for 30 sec - exiting 17:39:05 (5324): No heartbeat from core client for 30 sec - exiting 17:39:06 (5324): No heartbeat from core client for 30 sec - exiting 17:39:07 (5324): No heartbeat from core client for 30 sec - exiting 17:39:09 (5324): No heartbeat from core client for 30 sec - exiting 17:39:10 (5324): No heartbeat from core client for 30 sec - exiting 17:39:11 (5324): No heartbeat from core client for 30 sec - exiting 17:39:12 (5324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Jun 2013 01:46:56 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 751,680 | 896,622 | 1.1928 |
08 Jun 2013 17:07:34 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 725,760 | 865,588 | 1.1927 |
08 Jun 2013 08:58:44 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 699,840 | 836,307 | 1.1950 |
08 Jun 2013 00:44:58 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 673,920 | 806,648 | 1.1969 |
07 Jun 2013 16:27:02 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 648,000 | 777,000 | 1.1991 |
05 Jun 2013 19:53:11 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 622,080 | 745,557 | 1.1985 |
04 Jun 2013 17:08:54 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 596,160 | 714,726 | 1.1989 |
02 Jun 2013 16:31:49 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 570,240 | 684,162 | 1.1998 |
02 Jun 2013 08:23:37 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 544,320 | 654,959 | 1.2033 |
01 Jun 2013 23:10:03 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 518,400 | 621,788 | 1.1994 |
01 Jun 2013 14:21:35 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 492,480 | 590,424 | 1.1989 |
01 Jun 2013 06:17:04 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 466,560 | 561,265 | 1.2030 |
31 May 2013 23:05:33 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 440,640 | 531,790 | 1.2069 |
30 May 2013 23:07:29 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 414,720 | 501,410 | 1.2090 |
29 May 2013 19:01:33 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 388,800 | 470,903 | 1.2112 |
27 May 2013 19:16:20 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 362,880 | 440,272 | 1.2133 |
26 May 2013 14:46:55 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 336,960 | 407,630 | 1.2097 |
22 May 2013 18:44:39 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 311,040 | 375,778 | 1.2081 |
20 May 2013 19:15:36 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 285,120 | 344,340 | 1.2077 |
19 May 2013 16:40:11 | 1036109 | 15775406 | hadcm3n_4cho_2020_40_008365496_0 | 259,200 | 313,307 | 1.2087 |
©2024 cpdn.org