Name | hadcm3n_o5oi_1940_40_008380450_0 |
Workunit | 8531309 |
Created | 31 May 2013, 21:51:39 UTC |
Sent | 16 Jun 2013, 19:34:27 UTC |
Report deadline | 16 Sep 2013, 3:01:38 UTC |
Received | 26 Jul 2013, 7:32:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1189727 |
Run time | 23 days 14 hours 12 min 53 sec |
CPU time | 22 days 19 hours 51 min 5 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.40 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005) </message> <stderr_txt> 20:38:11 (2500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:39:55 (2500): No heartbeat from core client for 30 sec - exiting 20:57:19 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:57:40 (4008): No heartbeat from core client for 30 sec - exiting 20:57:41 (4008): No heartbeat from core client for 30 sec - exiting 20:57:42 (4008): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5892, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6000, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5892, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5776, iMonCtr=1 Model crash detected, will try to restart... 21:58:34 (5428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:59:40 (964): No heartbeat from core client for 30 sec - exiting 21:59:41 (964): No heartbeat from core client for 30 sec - exiting 21:59:42 (964): No heartbeat from core client for 30 sec - exiting 21:59:43 (964): No heartbeat from core client for 30 sec - exiting 21:59:44 (964): No heartbeat from core client for 30 sec - exiting 21:59:45 (964): No heartbeat from core client for 30 sec - exiting 21:59:46 (964): No heartbeat from core client for 30 sec - exiting 21:59:47 (964): No heartbeat from core client for 30 sec - exiting 21:59:49 (964): No heartbeat from core client for 30 sec - exiting 21:59:50 (964): No heartbeat from core client for 30 sec - exiting 21:59:51 (964): No heartbeat from core client for 30 sec - exiting 21:59:52 (964): No heartbeat from core client for 30 sec - exiting 21:59:53 (964): No heartbeat from core client for 30 sec - exiting 21:59:54 (964): No heartbeat from core client for 30 sec - exiting 21:59:55 (964): No heartbeat from core client for 30 sec - exiting 21:59:56 (964): No heartbeat from core client for 30 sec - exiting 21:59:57 (964): No heartbeat from core client for 30 sec - exiting 21:59:58 (964): No heartbeat from core client for 30 sec - exiting 21:59:59 (964): No heartbeat from core client for 30 sec - exiting 22:00:01 (964): No heartbeat from core client for 30 sec - exiting 22:00:02 (964): No heartbeat from core client for 30 sec - exiting 22:00:03 (964): No heartbeat from core client for 30 sec - exiting 22:00:04 (964): No heartbeat from core client for 30 sec - exiting 22:00:05 (964): No heartbeat from core client for 30 sec - exiting 22:00:06 (964): No heartbeat from core client for 30 sec - exiting 22:00:07 (964): No heartbeat from core client for 30 sec - exiting 22:00:08 (964): No heartbeat from core client for 30 sec - exiting 22:00:09 (964): No heartbeat from core client for 30 sec - exiting 22:00:10 (964): No heartbeat from core client for 30 sec - exiting 22:00:11 (964): No heartbeat from core client for 30 sec - exiting 22:00:13 (964): No heartbeat from core client for 30 sec - exiting 22:00:14 (964): No heartbeat from core client for 30 sec - exiting 22:00:15 (964): No heartbeat from core client for 30 sec - exiting 22:00:16 (964): No heartbeat from core client for 30 sec - exiting 22:00:17 (964): No heartbeat from core client for 30 sec - exiting 22:00:18 (964): No heartbeat from core client for 30 sec - exiting 22:00:19 (964): No heartbeat from core client for 30 sec - exiting 22:00:20 (964): No heartbeat from core client for 30 sec - exiting 22:00:21 (964): No heartbeat from core client for 30 sec - exiting 22:00:22 (964): No heartbeat from core client for 30 sec - exiting 22:00:24 (964): No heartbeat from core client for 30 sec - exiting 22:00:25 (964): No heartbeat from core client for 30 sec - exiting 22:00:26 (964): No heartbeat from core client for 30 sec - exiting 22:00:27 (964): No heartbeat from core client for 30 sec - exiting 22:00:28 (964): No heartbeat from core client for 30 sec - exiting 22:00:29 (964): No heartbeat from core client for 30 sec - exiting 22:00:30 (964): No heartbeat from core client for 30 sec - exiting 22:00:31 (964): No heartbeat from core client for 30 sec - exiting 22:00:32 (964): No heartbeat from core client for 30 sec - exiting 22:00:33 (964): No heartbeat from core client for 30 sec - exiting 22:00:34 (964): No heartbeat from core client for 30 sec - exiting 22:00:36 (964): No heartbeat from core client for 30 sec - exiting 22:00:37 (964): No heartbeat from core client for 30 sec - exiting 22:00:38 (964): No heartbeat from core client for 30 sec - exiting 22:00:39 (964): No heartbeat from core client for 30 sec - exiting 22:00:40 (964): No heartbeat from core client for 30 sec - exiting 22:00:41 (964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5068, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4956, iMonCtr=1 Model crash detected, will try to restart... 20:31:13 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:31:15 (5960): No heartbeat from core client for 30 sec - exiting 20:31:16 (5960): No heartbeat from core client for 30 sec - exiting 20:31:17 (5960): No heartbeat from core client for 30 sec - exiting 20:31:18 (5960): No heartbeat from core client for 30 sec - exiting 20:31:19 (5960): No heartbeat from core client for 30 sec - exiting 20:31:20 (5960): No heartbeat from core client for 30 sec - exiting 20:31:21 (5960): No heartbeat from core client for 30 sec - exiting 20:31:22 (5960): No heartbeat from core client for 30 sec - exiting 20:31:23 (5960): No heartbeat from core client for 30 sec - exiting 20:31:24 (5960): No heartbeat from core client for 30 sec - exiting 20:31:25 (5960): No heartbeat from core client for 30 sec - exiting 20:31:27 (5960): No heartbeat from core client for 30 sec - exiting 20:31:28 (5960): No heartbeat from core client for 30 sec - exiting 20:31:29 (5960): No heartbeat from core client for 30 sec - exiting 20:31:30 (5960): No heartbeat from core client for 30 sec - exiting 20:31:31 (5960): No heartbeat from core client for 30 sec - exiting 20:31:32 (5960): No heartbeat from core client for 30 sec - exiting 20:31:33 (5960): No heartbeat from core client for 30 sec - exiting 20:31:34 (5960): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6840, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3584, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5164, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5344, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5344, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5288, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5212, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7028, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5048, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77E471F3 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4380, selfPID=4380, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Jul 2013 06:49:13 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 777,600 | 1,969,506 | 2.5328 |
23 Jul 2013 23:34:51 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 751,680 | 1,899,856 | 2.5275 |
23 Jul 2013 21:34:18 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 725,760 | 1,830,506 | 2.5222 |
23 Jul 2013 20:43:46 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 699,840 | 1,761,938 | 2.5176 |
23 Jul 2013 19:10:56 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 673,920 | 1,686,788 | 2.5029 |
23 Jul 2013 18:53:20 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 648,000 | 1,614,959 | 2.4922 |
23 Jul 2013 18:53:19 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 622,080 | 1,543,841 | 2.4817 |
23 Jul 2013 18:53:18 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 596,160 | 1,466,900 | 2.4606 |
23 Jul 2013 18:53:18 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 570,240 | 1,389,440 | 2.4366 |
12 Jul 2013 02:37:22 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 544,320 | 1,313,569 | 2.4132 |
10 Jul 2013 07:00:31 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 518,400 | 1,245,292 | 2.4022 |
08 Jul 2013 22:06:11 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 492,480 | 1,174,673 | 2.3852 |
07 Jul 2013 14:37:13 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 466,560 | 1,107,563 | 2.3739 |
06 Jul 2013 16:59:54 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 440,640 | 1,038,504 | 2.3568 |
06 Jul 2013 05:29:03 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 414,720 | 968,436 | 2.3352 |
04 Jul 2013 14:20:15 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 388,800 | 900,902 | 2.3171 |
03 Jul 2013 08:15:12 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 362,880 | 834,547 | 2.2998 |
02 Jul 2013 12:00:38 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 336,960 | 772,574 | 2.2928 |
02 Jul 2013 10:41:40 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 311,040 | 713,550 | 2.2941 |
02 Jul 2013 10:16:39 | 1189727 | 15811545 | hadcm3n_o5oi_1940_40_008380450_0 | 285,120 | 654,117 | 2.2942 |
©2024 cpdn.org