Name | hadcm3n_n2b5_1920_40_008410433_0 |
Workunit | 8561289 |
Created | 22 Aug 2013, 5:28:20 UTC |
Sent | 22 Aug 2013, 5:53:17 UTC |
Report deadline | 21 Nov 2013, 13:20:28 UTC |
Received | 6 Sep 2013, 5:13:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1283909 |
Run time | 2 days 18 hours 11 min 52 sec |
CPU time | 2 days 16 hours 29 min 15 sec |
Validate state | Invalid |
Credit | 1,555.20 |
Device peak FLOPS | 2.47 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> too many exit(0)s </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6332, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:23:26 (6336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:17:36 (5884): No heartbeat from core client for 30 sec - exiting 11:17:37 (5884): No heartbeat from core client for 30 sec - exiting 11:17:38 (5884): No heartbeat from core client for 30 sec - exiting 11:17:39 (5884): No heartbeat from core client for 30 sec - exiting 11:17:40 (5884): No heartbeat from core client for 30 sec - exiting 11:17:41 (5884): No heartbeat from core client for 30 sec - exiting 11:17:42 (5884): No heartbeat from core client for 30 sec - exiting 11:17:43 (5884): No heartbeat from core client for 30 sec - exiting 11:17:44 (5884): No heartbeat from core client for 30 sec - exiting 11:17:45 (5884): No heartbeat from core client for 30 sec - exiting 11:17:46 (5884): No heartbeat from core client for 30 sec - exiting 11:17:47 (5884): No heartbeat from core client for 30 sec - exiting 11:17:48 (5884): No heartbeat from core client for 30 sec - exiting 11:17:49 (5884): No heartbeat from core client for 30 sec - exiting 11:17:50 (5884): No heartbeat from core client for 30 sec - exiting 11:17:51 (5884): No heartbeat from core client for 30 sec - exiting 11:17:52 (5884): No heartbeat from core client for 30 sec - exiting 11:17:53 (5884): No heartbeat from core client for 30 sec - exiting 11:17:54 (5884): No heartbeat from core client for 30 sec - exiting 11:17:55 (5884): No heartbeat from core client for 30 sec - exiting 11:17:56 (5884): No heartbeat from core client for 30 sec - exiting 11:17:57 (5884): No heartbeat from core client for 30 sec - exiting 11:17:58 (5884): No heartbeat from core client for 30 sec - exiting 11:17:59 (5884): No heartbeat from core client for 30 sec - exiting 11:18:00 (5884): No heartbeat from core client for 30 sec - exiting 11:18:01 (5884): No heartbeat from core client for 30 sec - exiting 11:18:02 (5884): No heartbeat from core client for 30 sec - exiting 11:18:03 (5884): No heartbeat from core client for 30 sec - exiting 11:18:04 (5884): No heartbeat from core client for 30 sec - exiting 11:18:05 (5884): No heartbeat from core client for 30 sec - exiting 11:18:06 (5884): No heartbeat from core client for 30 sec - exiting 11:18:07 (5884): No heartbeat from core client for 30 sec - exiting 11:18:08 (5884): No heartbeat from core client for 30 sec - exiting 11:18:09 (5884): No heartbeat from core client for 30 sec - exiting 11:18:10 (5884): No heartbeat from core client for 30 sec - exiting 11:18:11 (5884): No heartbeat from core client for 30 sec - exiting 11:18:12 (5884): No heartbeat from core client for 30 sec - exiting 11:18:13 (5884): No heartbeat from core client for 30 sec - exiting 11:18:14 (5884): No heartbeat from core client for 30 sec - exiting 11:18:15 (5884): No heartbeat from core client for 30 sec - exiting 11:18:16 (5884): No heartbeat from core client for 30 sec - exiting 11:18:17 (5884): No heartbeat from core client for 30 sec - exiting 11:18:18 (5884): No heartbeat from core client for 30 sec - exiting 11:18:19 (5884): No heartbeat from core client for 30 sec - exiting 11:18:20 (5884): No heartbeat from core client for 30 sec - exiting 11:18:21 (5884): No heartbeat from core client for 30 sec - exiting 11:18:22 (5884): No heartbeat from core client for 30 sec - exiting 11:18:23 (5884): No heartbeat from core client for 30 sec - exiting 11:18:24 (5884): No heartbeat from core client for 30 sec - exiting 11:18:25 (5884): No heartbeat from core client for 30 sec - exiting 11:18:26 (5884): No heartbeat from core client for 30 sec - exiting 11:18:27 (5884): No heartbeat from core client for 30 sec - exiting 11:32:26 (1768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:32:27 (1768): No heartbeat from core client for 30 sec - exiting 11:32:28 (1768): No heartbeat from core client for 30 sec - exiting 11:32:29 (1768): No heartbeat from core client for 30 sec - exiting 11:32:30 (1768): No heartbeat from core client for 30 sec - exiting 11:32:31 (1768): No heartbeat from core client for 30 sec - exiting 11:32:32 (1768): No heartbeat from core client for 30 sec - exiting 11:32:33 (1768): No heartbeat from core client for 30 sec - exiting 11:32:34 (1768): No heartbeat from core client for 30 sec - exiting 11:32:35 (1768): No heartbeat from core client for 30 sec - exiting 11:32:36 (1768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:53:42 (4748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 13:57:11 (7160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 15:11:57 (7752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CWorker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=876, selfPID=876, iMonCtr=1 CPDN Monitor - Quit request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7316, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:22:01 (6280): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CNo Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3640, selfPID=3640, iMonCtr=1 CPDN Monitor - Quit request from BOINC... 14:06:14 (6224): No heartbeat from core client for 30 sec - exiting 14:09:56 (6224): No heartbeat from core client for 30 sec - exiting 14:09:57 (6224): No heartbeat from core client for 30 sec - exiting 14:09:58 (6224): No heartbeat from core client for 30 sec - exiting 14:09:59 (6224): No heartbeat from core client for 30 sec - exiting 14:10:00 (6224): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Sep 2013 18:14:37 | 1283909 | 15934852 | hadcm3n_n2b5_1920_40_008410433_0 | 129,600 | 226,990 | 1.7515 |
03 Sep 2013 14:07:43 | 1283909 | 15934852 | hadcm3n_n2b5_1920_40_008410433_0 | 103,680 | 189,567 | 1.8284 |
02 Sep 2013 10:04:38 | 1283909 | 15934852 | hadcm3n_n2b5_1920_40_008410433_0 | 77,760 | 152,671 | 1.9634 |
29 Aug 2013 12:26:33 | 1283909 | 15934852 | hadcm3n_n2b5_1920_40_008410433_0 | 51,840 | 106,849 | 2.0611 |
26 Aug 2013 10:49:51 | 1283909 | 15934852 | hadcm3n_n2b5_1920_40_008410433_0 | 25,920 | 52,835 | 2.0384 |
©2024 cpdn.org