Name | hadcm3n_ygml_1940_40_007538622_4 |
Workunit | 7735854 |
Created | 15 Dec 2011, 20:07:24 UTC |
Sent | 15 Dec 2011, 20:11:41 UTC |
Report deadline | 16 Mar 2012, 3:38:52 UTC |
Received | 8 Jan 2012, 8:39:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1120190 |
Run time | 6 days 12 hours 55 min 28 sec |
CPU time | 4 days 5 hours 50 min 43 sec |
Validate state | Invalid |
Credit | 3,421.44 |
Device peak FLOPS | 1.94 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 CPDN Monitor - Quit request from BOINC... 09:25:41 (2222): No heartbeat from core client for 30 sec - exiting 09:25:42 (2222): No heartbeat from core client for 30 sec - exiting 09:25:43 (2222): No heartbeat from core client for 30 sec - exiting 09:25:44 (2222): No heartbeat from core client for 30 sec - exiting 09:25:45 (2222): No heartbeat from core client for 30 sec - exiting 09:25:46 (2222): No heartbeat from core client for 30 sec - exiting 09:25:47 (2222): No heartbeat from core client for 30 sec - exiting 09:25:48 (2222): No heartbeat from core client for 30 sec - exiting 09:25:49 (2222): No heartbeat from core client for 30 sec - exiting 09:25:50 (2222): No heartbeat from core client for 30 sec - exiting 09:25:51 (2222): No heartbeat from core client for 30 sec - exiting 09:25:52 (2222): No heartbeat from core client for 30 sec - exiting 09:25:53 (2222): No heartbeat from core client for 30 sec - exiting 09:25:54 (2222): No heartbeat from core client for 30 sec - exiting 09:25:55 (2222): No heartbeat from core client for 30 sec - exiting 09:25:56 (2222): No heartbeat from core client for 30 sec - exiting 09:25:57 (2222): No heartbeat from core client for 30 sec - exiting 09:25:58 (2222): No heartbeat from core client for 30 sec - exiting 09:25:59 (2222): No heartbeat from core client for 30 sec - exiting 09:26:01 (2222): No heartbeat from core client for 30 sec - exiting 09:26:02 (2222): No heartbeat from core client for 30 sec - exiting 09:26:03 (2222): No heartbeat from core client for 30 sec - exiting 09:26:04 (2222): No heartbeat from core client for 30 sec - exiting 09:26:05 (2222): No heartbeat from core client for 30 sec - exiting 09:26:06 (2222): No heartbeat from core client for 30 sec - exiting 09:26:07 (2222): No heartbeat from core client for 30 sec - exiting 09:26:08 (2222): No heartbeat from core client for 30 sec - exiting 09:26:09 (2222): No heartbeat from core client for 30 sec - exiting 09:26:10 (2222): No heartbeat from core client for 30 sec - exiting 09:26:11 (2222): No heartbeat from core client for 30 sec - exiting 09:26:12 (2222): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 11:59:42 (1761): No heartbeat from core client for 30 sec - exiting 11:59:43 (1761): No heartbeat from core client for 30 sec - exiting 11:59:44 (1761): No heartbeat from core client for 30 sec - exiting 11:59:45 (1761): No heartbeat from core client for 30 sec - exiting 11:59:46 (1761): No heartbeat from core client for 30 sec - exiting 11:59:47 (1761): No heartbeat from core client for 30 sec - exiting 11:59:48 (1761): No heartbeat from core client for 30 sec - exiting 11:59:49 (1761): No heartbeat from core client for 30 sec - exiting 11:59:50 (1761): No heartbeat from core client for 30 sec - exiting 11:59:51 (1761): No heartbeat from core client for 30 sec - exiting 11:59:52 (1761): No heartbeat from core client for 30 sec - exiting 11:59:53 (1761): No heartbeat from core client for 30 sec - exiting 11:59:54 (1761): No heartbeat from core client for 30 sec - exiting 11:59:55 (1761): No heartbeat from core client for 30 sec - exiting 11:59:56 (1761): No heartbeat from core client for 30 sec - exiting 11:59:57 (1761): No heartbeat from core client for 30 sec - exiting 11:59:58 (1761): No heartbeat from core client for 30 sec - exiting 11:59:59 (1761): No heartbeat from core client for 30 sec - exiting 12:00:00 (1761): No heartbeat from core client for 30 sec - exiting 12:00:01 (1761): No heartbeat from core client for 30 sec - exiting 12:00:02 (1761): No heartbeat from core client for 30 sec - exiting 12:00:03 (1761): No heartbeat from core client for 30 sec - exiting 12:00:04 (1761): No heartbeat from core client for 30 sec - exiting 12:00:05 (1761): No heartbeat from core client for 30 sec - exiting 12:00:06 (1761): No heartbeat from core client for 30 sec - exiting 12:00:07 (1761): No heartbeat from core client for 30 sec - exiting 12:00:08 (1761): No heartbeat from core client for 30 sec - exiting 12:00:09 (1761): No heartbeat from core client for 30 sec - exiting 12:00:11 (1761): No heartbeat from core client for 30 sec - exiting 12:00:12 (1761): No heartbeat from core client for 30 sec - exiting 12:00:14 (1761): No heartbeat from core client for 30 sec - exiting 12:00:15 (1761): No heartbeat from core client for 30 sec - exiting 12:00:16 (1761): No heartbeat from core client for 30 sec - exiting 12:00:17 (1761): No heartbeat from core client for 30 sec - exiting 12:00:18 (1761): No heartbeat from core client for 30 sec - exiting 12:00:19 (1761): No heartbeat from core client for 30 sec - exiting 12:00:20 (1761): No heartbeat from core client for 30 sec - exiting 12:00:21 (1761): No heartbeat from core client for 30 sec - exiting 12:00:22 (1761): No heartbeat from core client for 30 sec - exiting 12:00:23 (1761): No heartbeat from core client for 30 sec - exiting 12:00:24 (1761): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 12:11:16 (1767): No heartbeat from core client for 30 sec - exiting 12:11:17 (1767): No heartbeat from core client for 30 sec - exiting 12:11:18 (1767): No heartbeat from core client for 30 sec - exiting 12:11:19 (1767): No heartbeat from core client for 30 sec - exiting 12:11:20 (1767): No heartbeat from core client for 30 sec - exiting 12:11:21 (1767): No heartbeat from core client for 30 sec - exiting 12:11:22 (1767): No heartbeat from core client for 30 sec - exiting 12:11:23 (1767): No heartbeat from core client for 30 sec - exiting 12:11:24 (1767): No heartbeat from core client for 30 sec - exiting 12:11:25 (1767): No heartbeat from core client for 30 sec - exiting 12:11:27 (1767): No heartbeat from core client for 30 sec - exiting 12:11:28 (1767): No heartbeat from core client for 30 sec - exiting 12:11:29 (1767): No heartbeat from core client for 30 sec - exiting 12:11:30 (1767): No heartbeat from core client for 30 sec - exiting 12:11:31 (1767): No heartbeat from core client for 30 sec - exiting 12:11:32 (1767): No heartbeat from core client for 30 sec - exiting 12:11:33 (1767): No heartbeat from core client for 30 sec - exiting 12:11:34 (1767): No heartbeat from core client for 30 sec - exiting 12:11:35 (1767): No heartbeat from core client for 30 sec - exiting 12:11:36 (1767): No heartbeat from core client for 30 sec - exiting 12:11:37 (1767): No heartbeat from core client for 30 sec - exiting 12:11:38 (1767): No heartbeat from core client for 30 sec - exiting 12:11:39 (1767): No heartbeat from core client for 30 sec - exiting 12:11:40 (1767): No heartbeat from core client for 30 sec - exiting 12:11:42 (1767): No heartbeat from core client for 30 sec - exiting 12:11:43 (1767): No heartbeat from core client for 30 sec - exiting 12:11:44 (1767): No heartbeat from core client for 30 sec - exiting 12:11:45 (1767): No heartbeat from core client for 30 sec - exiting 12:11:46 (1767): No heartbeat from core client for 30 sec - exiting 12:11:48 (1767): No heartbeat from core client for 30 sec - exiting 12:11:49 (1767): No heartbeat from core client for 30 sec - exiting 12:11:50 (1767): No heartbeat from core client for 30 sec - exiting 12:11:51 (1767): No heartbeat from core client for 30 sec - exiting 12:11:52 (1767): No heartbeat from core client for 30 sec - exiting 12:11:53 (1767): No heartbeat from core client for 30 sec - exiting 12:11:54 (1767): No heartbeat from core client for 30 sec - exiting 12:11:55 (1767): No heartbeat from core client for 30 sec - exiting 12:11:56 (1767): No heartbeat from core client for 30 sec - exiting 12:11:57 (1767): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
08 Jan 2012 05:29:15 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 285,120 | 490,161 | 1.7191 |
07 Jan 2012 11:22:45 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 259,200 | 439,291 | 1.6948 |
06 Jan 2012 21:19:33 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 233,280 | 389,058 | 1.6678 |
06 Jan 2012 07:07:42 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 207,360 | 338,474 | 1.6323 |
05 Jan 2012 17:06:44 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 181,440 | 288,217 | 1.5885 |
05 Jan 2012 01:57:49 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 155,520 | 237,746 | 1.5287 |
04 Jan 2012 11:56:54 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 129,600 | 187,377 | 1.4458 |
03 Jan 2012 21:48:04 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 103,680 | 136,859 | 1.3200 |
03 Jan 2012 07:46:01 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 77,760 | 86,755 | 1.1157 |
02 Jan 2012 17:01:45 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 51,840 | 37,054 | 0.7148 |
16 Dec 2011 10:46:59 | 1120190 | 13783939 | hadcm3n_ygml_1940_40_007538622_4 | 25,920 | 50,468 | 1.9471 |
©2024 cpdn.org