Name | hadam3p_eu_2isw_1985_1_008117181_0 |
Workunit | 8272295 |
Created | 8 Aug 2012, 11:36:16 UTC |
Sent | 8 Aug 2012, 11:36:31 UTC |
Report deadline | 21 Jul 2013, 16:56:31 UTC |
Received | 23 Aug 2012, 3:05:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1216229 |
Run time | 5 days 5 hours 1 min 11 sec |
CPU time | 4 days 20 hours 33 min 6 sec |
Validate state | Invalid |
Credit | 1,591.48 |
Device peak FLOPS | 2.27 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:26:37 (7504): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 17:26:38 (7504): No heartbeat from core client for 30 sec - exiting 17:26:39 (7504): No heartbeat from core client for 30 sec - exiting 17:26:40 (7504): No heartbeat from core client for 30 sec - exiting 17:26:41 (7504): No heartbeat from core client for 30 sec - exiting 02:39:33 (3088): No heartbeat from core client for 30 sec - exiting 02:39:34 (3088): No heartbeat from core client for 30 sec - exiting 02:39:35 (3088): No heartbeat from core client for 30 sec - exiting 02:39:36 (3088): No heartbeat from core client for 30 sec - exiting 02:39:37 (3088): No heartbeat from core client for 30 sec - exiting 02:39:38 (3088): No heartbeat from core client for 30 sec - exiting 02:39:39 (3088): No heartbeat from core client for 30 sec - exiting 02:39:40 (3088): No heartbeat from core client for 30 sec - exiting 02:39:41 (3088): No heartbeat from core client for 30 sec - exiting 02:39:42 (3088): No heartbeat from core client for 30 sec - exiting 02:39:43 (3088): No heartbeat from core client for 30 sec - exiting 02:39:44 (3088): No heartbeat from core client for 30 sec - exiting 02:39:45 (3088): No heartbeat from core client for 30 sec - exiting 02:39:46 (3088): No heartbeat from core client for 30 sec - exiting 02:39:47 (3088): No heartbeat from core client for 30 sec - exiting 02:39:48 (3088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:39:49 (3088): No heartbeat from core client for 30 sec - exiting 04:42:08 (7988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:42:14 (7988): No heartbeat from core client for 30 sec - exiting 04:42:15 (7988): No heartbeat from core client for 30 sec - exiting 04:42:16 (7988): No heartbeat from core client for 30 sec - exiting Global Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9968, iMonCtr=2 Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10032, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:23:11 (2360): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2360, selfPID=7560, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:27:35 (4752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:51:34 (6256): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... 11:51:36 (6256): No heartbeat from core client for 30 sec - exiting 11:51:37 (6256): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
21 Aug 2012 10:55:27 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 92,256 | 401,271 | 4.3495 |
19 Aug 2012 09:59:31 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 80,736 | 351,595 | 4.3549 |
17 Aug 2012 23:12:17 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 69,216 | 301,622 | 4.3577 |
16 Aug 2012 22:42:40 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 57,696 | 252,609 | 4.3783 |
14 Aug 2012 07:46:16 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 46,176 | 202,928 | 4.3947 |
12 Aug 2012 15:31:58 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 34,656 | 153,328 | 4.4243 |
11 Aug 2012 06:06:27 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 23,136 | 103,516 | 4.4742 |
10 Aug 2012 07:46:37 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 11,617 | 53,005 | 4.5627 |
10 Aug 2012 06:46:26 | 1216229 | 15073641 | hadam3p_eu_2isw_1985_1_008117181_0 | 11,616 | 52,396 | 4.5107 |
©2024 cpdn.org