Name | hadcm3n_yf83_1900_40_007352589_2 |
Workunit | 7550019 |
Created | 15 Jul 2011, 22:26:22 UTC |
Sent | 15 Jul 2011, 22:28:53 UTC |
Report deadline | 15 Oct 2011, 5:56:04 UTC |
Received | 19 Aug 2011, 15:53:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1158160 |
Run time | 6 days 12 hours 34 min 26 sec |
CPU time | 5 days 18 hours 49 min 36 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.60</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 16:29:51 (1864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:29:52 (1864): No heartbeat from core client for 30 sec - exiting 16:29:53 (1864): No heartbeat from core client for 30 sec - exiting 16:29:54 (1864): No heartbeat from core client for 30 sec - exiting 16:29:55 (1864): No heartbeat from core client for 30 sec - exiting 16:29:56 (1864): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1152, iMonCtr=1 Model crash detected, will try to restart... 09:11:11 (3488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:11:12 (3488): No heartbeat from core client for 30 sec - exiting 09:11:13 (3488): No heartbeat from core client for 30 sec - exiting 09:11:14 (3488): No heartbeat from core client for 30 sec - exiting 09:11:15 (3488): No heartbeat from core client for 30 sec - exiting 09:13:25 (1636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:13:26 (1636): No heartbeat from core client for 30 sec - exiting 09:13:27 (1636): No heartbeat from core client for 30 sec - exiting 09:13:28 (1636): No heartbeat from core client for 30 sec - exiting 09:13:29 (1636): No heartbeat from core client for 30 sec - exiting 09:13:30 (1636): No heartbeat from core client for 30 sec - exiting 09:13:31 (1636): No heartbeat from core client for 30 sec - exiting 09:13:32 (1636): No heartbeat from core client for 30 sec - exiting 09:18:34 (3568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:18:35 (3568): No heartbeat from core client for 30 sec - exiting 09:18:36 (3568): No heartbeat from core client for 30 sec - exiting 09:18:37 (3568): No heartbeat from core client for 30 sec - exiting 09:25:04 (1460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:25:05 (1460): No heartbeat from core client for 30 sec - exiting 09:25:06 (1460): No heartbeat from core client for 30 sec - exiting 09:25:07 (1460): No heartbeat from core client for 30 sec - exiting 09:28:05 (3600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:28:06 (3600): No heartbeat from core client for 30 sec - exiting 09:28:07 (3600): No heartbeat from core client for 30 sec - exiting 09:28:08 (3600): No heartbeat from core client for 30 sec - exiting 09:28:09 (3600): No heartbeat from core client for 30 sec - exiting 09:28:10 (3600): No heartbeat from core client for 30 sec - exiting 09:33:24 (3816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:33:25 (3816): No heartbeat from core client for 30 sec - exiting 09:33:26 (3816): No heartbeat from core client for 30 sec - exiting 09:33:27 (3816): No heartbeat from core client for 30 sec - exiting 09:33:28 (3816): No heartbeat from core client for 30 sec - exiting 09:33:29 (3816): No heartbeat from core client for 30 sec - exiting 09:38:10 (3176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:38:11 (3176): No heartbeat from core client for 30 sec - exiting 09:39:35 (2244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:39:36 (2244): No heartbeat from core client for 30 sec - exiting 09:39:37 (2244): No heartbeat from core client for 30 sec - exiting 09:39:38 (2244): No heartbeat from core client for 30 sec - exiting 09:42:35 (2536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:42:36 (2536): No heartbeat from core client for 30 sec - exiting 09:42:37 (2536): No heartbeat from core client for 30 sec - exiting 09:48:30 (2584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:48:31 (2584): No heartbeat from core client for 30 sec - exiting 09:48:32 (2584): No heartbeat from core client for 30 sec - exiting 09:48:33 (2584): No heartbeat from core client for 30 sec - exiting 09:48:34 (2584): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4080, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3632, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3820, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4212, iMonCtr=1 Model crash detected, will try to restart... 12:10:34 (3260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:10:35 (3260): No heartbeat from core client for 30 sec - exiting 12:10:36 (3260): No heartbeat from core client for 30 sec - exiting 12:10:37 (3260): No heartbeat from core client for 30 sec - exiting 12:12:23 (4888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:12:24 (4888): No heartbeat from core client for 30 sec - exiting 12:12:25 (4888): No heartbeat from core client for 30 sec - exiting 12:20:17 (4256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:36 (4152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:07 (5096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:08:08 (5096): No heartbeat from core client for 30 sec - exiting 13:08:09 (5096): No heartbeat from core client for 30 sec - exiting 13:13:03 (5088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:13:04 (5088): No heartbeat from core client for 30 sec - exiting 13:13:05 (5088): No heartbeat from core client for 30 sec - exiting 13:38:15 (5060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:38:16 (5060): No heartbeat from core client for 30 sec - exiting 13:38:17 (5060): No heartbeat from core client for 30 sec - exiting 14:08:38 (1916): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:08:39 (1916): No heartbeat from core client for 30 sec - exiting 14:18:22 (5012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:38:43 (5112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:38:44 (5112): No heartbeat from core client for 30 sec - exiting 14:43:53 (4300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:09:21 (4312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:09:22 (4312): No heartbeat from core client for 30 sec - exiting 16:09:23 (4312): No heartbeat from core client for 30 sec - exiting 16:24:22 (4392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:24:23 (4392): No heartbeat from core client for 30 sec - exiting 16:24:24 (4392): No heartbeat from core client for 30 sec - exiting 16:24:25 (4392): No heartbeat from core client for 30 sec - exiting 16:29:33 (3320): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:29:34 (3320): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3760, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3760, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
19 Aug 2011 14:54:22 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 259,200 | 499,774 | 1.9281 |
17 Aug 2011 20:07:30 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 233,280 | 452,046 | 1.9378 |
15 Aug 2011 22:03:53 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 207,360 | 404,234 | 1.9494 |
13 Aug 2011 17:19:47 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 181,440 | 355,868 | 1.9614 |
10 Aug 2011 17:56:17 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 155,520 | 307,135 | 1.9749 |
06 Aug 2011 22:55:19 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 129,600 | 259,441 | 2.0019 |
02 Aug 2011 21:26:14 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 103,680 | 210,444 | 2.0297 |
28 Jul 2011 23:37:15 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 77,760 | 155,316 | 1.9974 |
25 Jul 2011 21:03:59 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 51,840 | 101,707 | 1.9619 |
25 Jul 2011 17:38:56 | 1158160 | 13140825 | hadcm3n_yf83_1900_40_007352589_2 | 25,920 | 52,020 | 2.0069 |
©2024 cpdn.org