Name | hadcm3n_o2pz_2020_40_008409962_4 |
Workunit | 8560818 |
Created | 12 Mar 2014, 1:08:27 UTC |
Sent | 12 Mar 2014, 1:08:33 UTC |
Report deadline | 11 Jun 2014, 8:35:44 UTC |
Received | 29 Mar 2014, 1:36:07 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1172912 |
Run time | 8 days 13 hours 50 min 53 sec |
CPU time | 8 days 0 hours 39 min 36 sec |
Validate state | Invalid |
Credit | 7,153.92 |
Device peak FLOPS | 3.29 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> 08:35:29 (7620): No heartbeat from core client for 30 sec - exiting 08:35:30 (7620): No heartbeat from core client for 30 sec - exiting 08:35:31 (7620): No heartbeat from core client for 30 sec - exiting 08:35:32 (7620): No heartbeat from core client for 30 sec - exiting 08:35:33 (7620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:35:34 (7620): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:18:49 (5940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 16:07:06 (12920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 17:06:37 (9736): No heartbeat from core client for 30 sec - exiting 17:06:38 (9736): No heartbeat from core client for 30 sec - exiting 17:06:39 (9736): No heartbeat from core client for 30 sec - exiting 17:06:40 (9736): No heartbeat from core client for 30 sec - exiting 17:06:41 (9736): No heartbeat from core client for 30 sec - exiting 17:06:42 (9736): No heartbeat from core client for 30 sec - exiting 17:06:43 (9736): No heartbeat from core client for 30 sec - exiting 17:06:44 (9736): No heartbeat from core client for 30 sec - exiting 17:06:45 (9736): No heartbeat from core client for 30 sec - exiting 17:06:46 (9736): No heartbeat from core client for 30 sec - exiting 17:06:47 (9736): No heartbeat from core client for 30 sec - exiting 17:06:48 (9736): No heartbeat from core client for 30 sec - exiting 17:06:49 (9736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:38:51 (7216): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:41:17 (10164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:19:13 (7832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:55:05 (6992): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:55:07 (6992): No heartbeat from core client for 30 sec - exiting 16:55:08 (6992): No heartbeat from core client for 30 sec - exiting 16:55:09 (6992): No heartbeat from core client for 30 sec - exiting 16:55:10 (6992): No heartbeat from core client for 30 sec - exiting 16:55:11 (6992): No heartbeat from core client for 30 sec - exiting 16:55:12 (6992): No heartbeat from core client for 30 sec - exiting 16:55:13 (6992): No heartbeat from core client for 30 sec - exiting 16:55:14 (6992): No heartbeat from core client for 30 sec - exiting 16:55:15 (6992): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 16:56:19 (7040): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:56:20 (7040): No heartbeat from core client for 30 sec - exiting 16:56:21 (7040): No heartbeat from core client for 30 sec - exiting 16:56:22 (7040): No heartbeat from core client for 30 sec - exiting 16:56:23 (7040): No heartbeat from core client for 30 sec - exiting 16:56:24 (7040): No heartbeat from core client for 30 sec - exiting 16:56:25 (7040): No heartbeat from core client for 30 sec - exiting 16:56:26 (7040): No heartbeat from core client for 30 sec - exiting 16:56:27 (7040): No heartbeat from core client for 30 sec - exiting 16:56:28 (7040): No heartbeat from core client for 30 sec - exiting 16:56:29 (7040): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 17:27:33 (9492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:28:07 (10260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 08:30:30 (6936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:30:31 (6936): No heartbeat from core client for 30 sec - exiting 08:30:32 (6936): No heartbeat from core client for 30 sec - exiting 08:30:33 (6936): No heartbeat from core client for 30 sec - exiting 08:30:34 (6936): No heartbeat from core client for 30 sec - exiting 08:30:35 (6936): No heartbeat from core client for 30 sec - exiting 08:30:36 (6936): No heartbeat from core client for 30 sec - exiting 08:30:37 (6936): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:57:41 (4460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:58:23 (8784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:25:09 (464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:25:10 (464): No heartbeat from core client for 30 sec - exiting 19:25:11 (464): No heartbeat from core client for 30 sec - exiting 19:25:12 (464): No heartbeat from core client for 30 sec - exiting 19:26:00 (496): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 08:23:26 (5916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 22:09:08 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:09:11 (4620): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 09:25:50 (6428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:06:59 (7436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 10:12:29 (6540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7092, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Mar 2014 06:01:47 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 596,160 | 665,922 | 1.1170 |
27 Mar 2014 19:35:57 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 570,240 | 637,219 | 1.1175 |
26 Mar 2014 23:52:52 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 544,320 | 608,368 | 1.1177 |
26 Mar 2014 05:39:42 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 518,400 | 579,701 | 1.1183 |
25 Mar 2014 21:13:12 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 492,480 | 550,889 | 1.1186 |
25 Mar 2014 03:35:33 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 466,560 | 521,944 | 1.1187 |
24 Mar 2014 03:32:05 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 440,640 | 493,215 | 1.1193 |
23 Mar 2014 08:58:53 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 414,720 | 464,448 | 1.1199 |
23 Mar 2014 00:17:39 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 388,800 | 435,549 | 1.1202 |
22 Mar 2014 05:54:34 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 362,880 | 406,764 | 1.1209 |
21 Mar 2014 21:27:54 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 336,960 | 378,017 | 1.1218 |
21 Mar 2014 02:19:21 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 311,040 | 349,213 | 1.1227 |
20 Mar 2014 08:25:50 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 285,120 | 319,876 | 1.1219 |
19 Mar 2014 23:59:07 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 259,200 | 291,056 | 1.1229 |
19 Mar 2014 07:36:27 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 233,280 | 262,398 | 1.1248 |
15 Mar 2014 20:09:51 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 207,360 | 233,625 | 1.1267 |
15 Mar 2014 05:00:10 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 181,440 | 204,698 | 1.1282 |
14 Mar 2014 20:28:07 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 155,520 | 175,973 | 1.1315 |
14 Mar 2014 05:28:23 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 129,600 | 147,203 | 1.1358 |
13 Mar 2014 20:16:24 | 1172912 | 16365872 | hadcm3n_o2pz_2020_40_008409962_4 | 103,680 | 118,119 | 1.1393 |
©2024 climateprediction.net