Name | hadcm3n_p62h_1940_40_007421261_0 |
Workunit | 7618896 |
Created | 25 Aug 2011, 1:41:54 UTC |
Sent | 26 Aug 2011, 6:14:05 UTC |
Report deadline | 25 Nov 2011, 13:41:16 UTC |
Received | 7 Nov 2011, 6:54:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1115924 |
Run time | 13 days 21 hours 30 min 47 sec |
CPU time | 13 days 9 hours 50 min 58 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.71 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2416, iMonCtr=1 Model crash detected, will try to restart... 07:56:02 (5180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:56:03 (5180): No heartbeat from core client for 30 sec - exiting 07:56:04 (5180): No heartbeat from core client for 30 sec - exiting 07:56:05 (5180): No heartbeat from core client for 30 sec - exiting 07:56:06 (5180): No heartbeat from core client for 30 sec - exiting 07:56:07 (5180): No heartbeat from core client for 30 sec - exiting 07:56:08 (5180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 07:54:59 (6088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 07:53:19 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:24:28 (1188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:50 (2980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:51 (2980): No heartbeat from core client for 30 sec - exiting 14:06:52 (2980): No heartbeat from core client for 30 sec - exiting 14:06:53 (2980): No heartbeat from core client for 30 sec - exiting 14:06:54 (2980): No heartbeat from core client for 30 sec - exiting 14:06:55 (2980): No heartbeat from core client for 30 sec - exiting 07:59:59 (4128): No heartbeat from core client for 30 sec - exiting 08:00:00 (4128): No heartbeat from core client for 30 sec - exiting 08:00:01 (4128): No heartbeat from core client for 30 sec - exiting 08:00:02 (4128): No heartbeat from core client for 30 sec - exiting 08:00:03 (4128): No heartbeat from core client for 30 sec - exiting 08:00:04 (4128): No heartbeat from core client for 30 sec - exiting 08:00:05 (4128): No heartbeat from core client for 30 sec - exiting 08:00:06 (4128): No heartbeat from core client for 30 sec - exiting 08:00:07 (4128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:07:50 (4140): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:41:23 (4784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:25:44 (3020): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1848, iMonCtr=1 Model crash detected, will try to restart... 18:15:39 (4768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1 Model crash detected, will try to restart... 09:10:43 (5808): No heartbeat from core client for 30 sec - exiting 09:10:44 (5808): No heartbeat from core client for 30 sec - exiting 09:10:45 (5808): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:52:11 (3424): No heartbeat from core client for 30 sec - exiting 07:52:13 (3424): No heartbeat from core client for 30 sec - exiting 07:52:14 (3424): No heartbeat from core client for 30 sec - exiting 07:52:15 (3424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CSuspended CPDN Monitor - Suspend request from BOINC... 07:53:35 (1508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:08:10 (3292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2964, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2712, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8180, iMonCtr=1 Model crash detected, will try to restart... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77C6E722 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77D35F1B read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
04 Nov 2011 12:09:20 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 777,600 | 1,145,866 | 1.4736 |
03 Nov 2011 08:39:00 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 751,680 | 1,106,396 | 1.4719 |
01 Nov 2011 14:36:13 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 725,760 | 1,066,758 | 1.4698 |
31 Oct 2011 17:39:51 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 699,840 | 1,028,336 | 1.4694 |
31 Oct 2011 17:09:06 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 673,920 | 990,008 | 1.4690 |
31 Oct 2011 15:53:49 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 648,000 | 951,701 | 1.4687 |
31 Oct 2011 12:49:59 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 622,080 | 913,483 | 1.4684 |
31 Oct 2011 12:49:59 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 596,160 | 878,251 | 1.4732 |
31 Oct 2011 12:49:58 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 570,240 | 842,019 | 1.4766 |
31 Oct 2011 12:49:58 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 544,320 | 803,521 | 1.4762 |
18 Oct 2011 09:02:04 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 518,400 | 765,572 | 1.4768 |
03 Oct 2011 17:51:09 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 492,480 | 727,165 | 1.4765 |
30 Sep 2011 06:29:01 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 466,560 | 688,548 | 1.4758 |
28 Sep 2011 13:49:20 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 440,640 | 650,376 | 1.4760 |
27 Sep 2011 11:55:11 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 414,720 | 612,104 | 1.4759 |
26 Sep 2011 09:45:23 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 388,800 | 574,650 | 1.4780 |
22 Sep 2011 14:13:16 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 362,880 | 536,548 | 1.4786 |
21 Sep 2011 12:24:02 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 336,960 | 498,765 | 1.4802 |
20 Sep 2011 09:28:55 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 311,040 | 460,631 | 1.4809 |
19 Sep 2011 07:26:58 | 1115924 | 13290776 | hadcm3n_p62h_1940_40_007421261_0 | 285,120 | 422,466 | 1.4817 |
©2024 cpdn.org