Name | hadcm3n_p3kh_1940_40_007420425_2 |
Workunit | 7618060 |
Created | 24 Nov 2011, 6:11:09 UTC |
Sent | 24 Nov 2011, 6:11:17 UTC |
Report deadline | 23 Feb 2012, 13:38:28 UTC |
Received | 12 Jan 2012, 17:42:31 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1084131 |
Run time | 25 days 18 hours 53 min 53 sec |
CPU time | 15 days 2 hours 45 min 36 sec |
Validate state | Invalid |
Credit | 11,197.44 |
Device peak FLOPS | 2.86 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:28:45 (71840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:28:46 (71840): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 07:35:55 (100064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:35:58 (100064): No heartbeat from core client for 30 sec - exiting 07:35:59 (100064): No heartbeat from core client for 30 sec - exiting 07:36:00 (100064): No heartbeat from core client for 30 sec - exiting 07:36:01 (100064): No heartbeat from core client for 30 sec - exiting 07:36:02 (100064): No heartbeat from core client for 30 sec - exiting 07:36:03 (100064): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:57:53 (95380): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 08:57:58 (95380): No heartbeat from core client for 30 sec - exiting 08:57:59 (95380): No heartbeat from core client for 30 sec - exiting 08:58:00 (95380): No heartbeat from core client for 30 sec - exiting 08:58:01 (95380): No heartbeat from core client for 30 sec - exiting 08:58:02 (95380): No heartbeat from core client for 30 sec - exiting 08:58:03 (95380): No heartbeat from core client for 30 sec - exiting 08:58:04 (95380): No heartbeat from core client for 30 sec - exiting 08:58:05 (95380): No heartbeat from core client for 30 sec - exiting 08:58:06 (95380): No heartbeat from core client for 30 sec - exiting 08:58:07 (95380): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:22:04 (91428): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:22:04 (8036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:22:06 (8036): No heartbeat from core client for 30 sec - exiting 21:22:07 (8036): No heartbeat from core client for 30 sec - exiting 21:22:08 (8036): No heartbeat from core client for 30 sec - exiting 21:22:09 (8036): No heartbeat from core client for 30 sec - exiting 21:22:10 (8036): No heartbeat from core client for 30 sec - exiting 23:44:28 (104332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 00:59:40 (91820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:59:41 (91820): No heartbeat from core client for 30 sec - exiting 11:02:20 (106060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:02:21 (106060): No heartbeat from core client for 30 sec - exiting 11:02:22 (106060): No heartbeat from core client for 30 sec - exiting 11:02:23 (106060): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:07:45 (4836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:17:17 (8376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:36:32 (16732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:36:33 (16732): No heartbeat from core client for 30 sec - exiting 15:51:00 (7800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:51:01 (7800): No heartbeat from core client for 30 sec - exiting 15:51:02 (7800): No heartbeat from core client for 30 sec - exiting 15:51:03 (7800): No heartbeat from core client for 30 sec - exiting 15:51:04 (7800): No heartbeat from core client for 30 sec - exiting 15:51:05 (7800): No heartbeat from core client for 30 sec - exiting 15:51:06 (7800): No heartbeat from core client for 30 sec - exiting 15:51:07 (7800): No heartbeat from core client for 30 sec - exiting 15:51:08 (7800): No heartbeat from core client for 30 sec - exiting 15:51:09 (7800): No heartbeat from core client for 30 sec - exiting 15:51:10 (7800): No heartbeat from core client for 30 sec - exiting 03:14:02 (2256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:35:44 (8844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:49:25 (10940): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 12:49:41 (10940): No heartbeat from core client for 30 sec - exiting 12:49:42 (10940): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23120, iMonCtr=1 Model crash detected, will try to restart... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Jan 2012 14:38:28 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 933,120 | 1,283,155 | 1.3751 |
03 Jan 2012 13:07:20 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 907,200 | 1,247,436 | 1.3750 |
31 Dec 2011 14:32:44 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 881,280 | 1,211,093 | 1.3742 |
30 Dec 2011 02:58:16 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 855,360 | 1,174,849 | 1.3735 |
28 Dec 2011 14:46:14 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 829,440 | 1,139,121 | 1.3734 |
27 Dec 2011 03:06:11 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 803,520 | 1,103,430 | 1.3732 |
26 Dec 2011 00:28:32 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 777,600 | 1,067,764 | 1.3732 |
24 Dec 2011 16:21:22 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 751,680 | 1,032,282 | 1.3733 |
22 Dec 2011 00:10:46 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 725,760 | 996,220 | 1.3727 |
21 Dec 2011 03:41:42 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 699,840 | 961,092 | 1.3733 |
20 Dec 2011 10:18:08 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 673,920 | 926,426 | 1.3747 |
19 Dec 2011 20:29:22 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 648,000 | 891,674 | 1.3760 |
19 Dec 2011 04:46:47 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 622,080 | 857,104 | 1.3778 |
18 Dec 2011 09:32:39 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 596,160 | 821,886 | 1.3786 |
17 Dec 2011 16:50:41 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 570,240 | 786,646 | 1.3795 |
16 Dec 2011 06:40:59 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 544,320 | 751,511 | 1.3806 |
15 Dec 2011 08:49:33 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 518,400 | 716,197 | 1.3816 |
14 Dec 2011 08:32:39 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 492,480 | 680,465 | 1.3817 |
12 Dec 2011 22:47:09 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 466,560 | 644,326 | 1.3810 |
12 Dec 2011 00:21:05 | 1084131 | 13658086 | hadcm3n_p3kh_1940_40_007420425_2 | 440,640 | 609,255 | 1.3827 |
©2024 cpdn.org