Name | hadcm3n_u1ya_1980_40_007548434_2 |
Workunit | 7745666 |
Created | 1 Dec 2011, 7:30:09 UTC |
Sent | 1 Dec 2011, 7:34:43 UTC |
Report deadline | 1 Mar 2012, 15:01:54 UTC |
Received | 25 Jan 2012, 6:18:19 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 678086 |
Run time | 26 days 12 hours 5 min 26 sec |
CPU time | 23 days 11 hours 22 min 17 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.22 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> 10:33:51 (7364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=888, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 19:23:05 (6184): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2096, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not runController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=568, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5448, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4704, iMonCtr=1 Model crash detected, will try to restart... 10:57:55 (5548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4652, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2488, iMonCtr=1 Model crash detected, will try to restart... 10:30:40 (324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:33:25 (3376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:33:26 (3376): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 08:20:23 (1152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4852, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... 16:16:06 (6072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6040, iMonCtr=1 Model crash detected, will try to restart... 08:27:04 (4044): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4772, iMonCtr=1 Model crash detected, will try to restart... 09:16:46 (5648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:09:18 (4404): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:56:44 (2632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 10:19:01 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:25:57 (5692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=1 Model crash detected, will try to restart... 08:47:55 (4648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77CF1F8F read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... 11:07:58 (4608): No heartbeat from core client for 30 sec - exiting 11:07:59 (4608): No heartbeat from core client for 30 sec - exiting Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76E689C1 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... 11:08:00 (4608): No heartbeat from core client for 30 sec - exiting 11:08:01 (4608): No heartbeat from core client for 30 sec - exiting 11:08:02 (4608): No heartbeat from core client for 30 sec - exiting 11:08:03 (4608): No heartbeat from core client for 30 sec - exiting 11:08:04 (4608): No heartbeat from core client for 30 sec - exiting Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77D589C1 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... 08:28:02 (564): No heartbeat from core client for 30 sec - exiting 08:28:04 (564): No heartbeat from core client for 30 sec - exiting 08:28:05 (564): No heartbeat from core client for 30 sec - exiting 08:28:06 (564): No heartbeat from core client for 30 sec - exiting 08:28:07 (564): No heartbeat from core client for 30 sec - exiting 08:28:08 (564): No heartbeat from core client for 30 sec - exiting 08:28:09 (564): No heartbeat from core client for 30 sec - exiting 08:28:10 (564): No heartbeat from core client for 30 sec - exiting 08:28:11 (564): No heartbeat from core client for 30 sec - exiting 08:28:12 (564): No heartbeat from core client for 30 sec - exiting 08:28:13 (564): No heartbeat from core client for 30 sec - exiting 08:28:15 (564): No heartbeat from core client for 30 sec - exiting 08:28:16 (564): No heartbeat from core client for 30 sec - exiting 08:28:17 (564): No heartbeat from core client for 30 sec - exiting 08:28:18 (564): No heartbeat from core client for 30 sec - exiting 08:28:19 (564): No heartbeat from core client for 30 sec - exiting 08:28:20 (564): No heartbeat from core client for 30 sec - exiting 08:28:21 (564): No heartbeat from core client for 30 sec - exiting 08:28:22 (564): No heartbeat from core client for 30 sec - exiting 08:28:23 (564): No heartbeat from core client for 30 sec - exiting 08:28:24 (564): No heartbeat from core client for 30 sec - exiting 08:28:25 (564): No heartbeat from core client for 30 sec - exiting 08:28:27 (564): No heartbeat from core client for 30 sec - exiting 08:28:28 (564): No heartbeat from core client for 30 sec - exiting 08:28:29 (564): No heartbeat from core client for 30 sec - exiting 08:28:30 (564): No heartbeat from core client for 30 sec - exiting 08:28:31 (564): No heartbeat from core client for 30 sec - exiting 08:28:32 (564): No heartbeat from core client for 30 sec - exiting 08:28:33 (564): No heartbeat from core client for 30 sec - exiting 08:28:34 (564): No heartbeat from core client for 30 sec - exiting 08:28:35 (564): No heartbeat from core client for 30 sec - exiting 08:28:36 (564): No heartbeat from core client for 30 sec - exiting 08:28:37 (564): No heartbeat from core client for 30 sec - exiting 08:28:39 (564): No heartbeat from core client for 30 sec - exiting 08:28:40 (564): No heartbeat from core client for 30 sec - exiting 08:28:41 (564): No heartbeat from core client for 30 sec - exiting 08:28:42 (564): No heartbeat from core client for 30 sec - exiting Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77BD89C1 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77C489C1 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... 07:15:56 (1756): No heartbeat from core client for 30 sec - exiting 07:15:57 (1756): No heartbeat from core client for 30 sec - exiting 07:15:58 (1756): No heartbeat from core client for 30 sec - exiting 07:15:59 (1756): No heartbeat from core client for 30 sec - exiting 07:16:00 (1756): No heartbeat from core client for 30 sec - exiting 07:16:01 (1756): No heartbeat from core client for 30 sec - exiting 07:16:02 (1756): No heartbeat from core client for 30 sec - exiting 07:16:03 (1756): No heartbeat from core client for 30 sec - exiting 07:16:05 (1756): No heartbeat from core client for 30 sec - exiting 07:16:06 (1756): No heartbeat from core client for 30 sec - exiting 07:16:07 (1756): No heartbeat from core client for 30 sec - exiting 07:16:08 (1756): No heartbeat from core client for 30 sec - exiting 07:16:09 (1756): No heartbeat from core client for 30 sec - exiting 07:16:10 (1756): No heartbeat from core client for 30 sec - exiting 07:16:11 (1756): No heartbeat from core client for 30 sec - exiting 07:16:12 (1756): No heartbeat from core client for 30 sec - exiting 07:16:13 (1756): No heartbeat from core client for 30 sec - exiting 07:16:14 (1756): No heartbeat from core client for 30 sec - exiting 07:16:15 (1756): No heartbeat from core client for 30 sec - exiting 07:16:16 (1756): No heartbeat from core client for 30 sec - exiting 07:16:18 (1756): No heartbeat from core client for 30 sec - exiting 07:16:19 (1756): No heartbeat from core client for 30 sec - exiting 07:16:20 (1756): No heartbeat from core client for 30 sec - exiting 07:16:21 (1756): No heartbeat from core client for 30 sec - exiting 07:16:22 (1756): No heartbeat from core client for 30 sec - exiting 07:16:23 (1756): No heartbeat from core client for 30 sec - exiting 07:16:24 (1756): No heartbeat from core client for 30 sec - exiting 07:16:25 (1756): No heartbeat from core client for 30 sec - exiting 07:16:26 (1756): No heartbeat from core client for 30 sec - exiting 07:16:27 (1756): No heartbeat from core client for 30 sec - exiting 07:16:28 (1756): No heartbeat from core client for 30 sec - exiting 07:16:29 (1756): No heartbeat from core client for 30 sec - exiting 07:16:30 (1756): No heartbeat from core client for 30 sec - exiting 07:16:31 (1756): No heartbeat from core client for 30 sec - exiting 07:16:32 (1756): No heartbeat from core client for 30 sec - exiting 07:16:33 (1756): No heartbeat from core client for 30 sec - exiting 07:16:34 (1756): No heartbeat from core client for 30 sec - exiting 07:16:35 (1756): No heartbeat from core client for 30 sec - exiting 07:16:36 (1756): No heartbeat from core client for 30 sec - exiting 07:16:37 (1756): No heartbeat from core client for 30 sec - exiting 07:16:39 (1756): No heartbeat from core client for 30 sec - exiting 07:16:40 (1756): No heartbeat from core client for 30 sec - exiting 07:16:41 (1756): No heartbeat from core client for 30 sec - exiting Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77DA6FCF read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Jan 2012 14:06:05 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 1,036,800 | 2,027,380 | 1.9554 |
19 Jan 2012 14:38:09 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 1,010,880 | 1,976,808 | 1.9555 |
18 Jan 2012 16:26:04 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 984,960 | 1,927,724 | 1.9572 |
17 Jan 2012 16:35:14 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 959,040 | 1,880,466 | 1.9608 |
16 Jan 2012 17:32:13 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 933,120 | 1,830,882 | 1.9621 |
15 Jan 2012 14:41:40 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 907,200 | 1,781,540 | 1.9638 |
14 Jan 2012 15:18:33 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 881,280 | 1,732,274 | 1.9656 |
13 Jan 2012 19:31:53 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 855,360 | 1,683,223 | 1.9679 |
12 Jan 2012 21:24:55 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 829,440 | 1,632,439 | 1.9681 |
11 Jan 2012 21:19:39 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 803,520 | 1,582,121 | 1.9690 |
10 Jan 2012 22:51:31 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 777,600 | 1,531,798 | 1.9699 |
10 Jan 2012 07:22:59 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 751,680 | 1,484,399 | 1.9748 |
09 Jan 2012 08:53:23 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 725,760 | 1,433,025 | 1.9745 |
08 Jan 2012 10:10:09 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 699,840 | 1,384,927 | 1.9789 |
07 Jan 2012 20:21:50 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 673,920 | 1,336,115 | 1.9826 |
06 Jan 2012 18:25:49 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 648,000 | 1,286,160 | 1.9848 |
06 Jan 2012 03:06:52 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 622,080 | 1,236,101 | 1.9870 |
05 Jan 2012 12:05:21 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 596,160 | 1,187,515 | 1.9919 |
04 Jan 2012 07:40:59 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 570,240 | 1,134,797 | 1.9900 |
03 Jan 2012 15:37:53 | 678086 | 13680399 | hadcm3n_u1ya_1980_40_007548434_2 | 544,320 | 1,080,774 | 1.9855 |
©2024 cpdn.org