Name | hadcm3n_o53l_1940_40_007267315_2 |
Workunit | 7465555 |
Created | 2 Sep 2011, 10:05:06 UTC |
Sent | 2 Sep 2011, 13:18:10 UTC |
Report deadline | 2 Dec 2011, 20:45:21 UTC |
Received | 13 Dec 2011, 14:54:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Computer ID | 1088606 |
Run time | 25 days 15 hours 44 min 23 sec |
CPU time | 24 days 10 hours 46 min 45 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.41 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... forrtl: Access is denied. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5416, iMonCtr=1 Model crash detected, will try to restart... forrtl: Access is denied. CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4660, selfPID=4660, iMonCtr=1 CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:48:46 (4524): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:23:03 (6588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 14:54:46 (6640): No heartbeat from core client for 30 sec - exiting 14:54:47 (6640): No heartbeat from core client for 30 sec - exiting 14:54:48 (6640): No heartbeat from core client for 30 sec - exiting 14:54:49 (6640): No heartbeat from core client for 30 sec - exiting 14:54:50 (6640): No heartbeat from core client for 30 sec - exiting 14:54:51 (6640): No heartbeat from core client for 30 sec - exiting 14:54:52 (6640): No heartbeat from core client for 30 sec - exiting 14:54:53 (6640): No heartbeat from core client for 30 sec - exiting 14:54:54 (6640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:54:55 (6640): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:51:24 (5168): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:51:25 (5168): No heartbeat from core client for 30 sec - exiting 10:51:26 (5168): No heartbeat from core client for 30 sec - exiting 10:51:27 (5168): No heartbeat from core client for 30 sec - exiting 10:51:28 (5168): No heartbeat from core client for 30 sec - exiting 10:51:29 (5168): No heartbeat from core client for 30 sec - exiting 10:51:30 (5168): No heartbeat from core client for 30 sec - exiting 10:51:31 (5168): No heartbeat from core client for 30 sec - exiting 10:51:32 (5168): No heartbeat from core client for 30 sec - exiting 10:51:33 (5168): No heartbeat from core client for 30 sec - exiting 10:51:34 (5168): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4680, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:00:50 (3048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4492, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4568, iMonCtr=1 Model crash detected, will try to restart... 08:17:31 (5056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:23:54 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6088, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5516, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4528, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=812, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4728, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 19:17:41 (2936): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:43:00 (5860): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6552, iMonCtr=1 Model crash detected, will try to restart... 12:42:35 (4420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:56:28 (2052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4340, iMonCtr=1 Model crash detected, will try to restart... 10:56:56 (4596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:14:51 (4716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4592, iMonCtr=1 Model crash detected, will try to restart... 08:17:20 (4300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:18 (4560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:39 (4508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3752, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 08:16:14 (4744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2916, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 08:13:16 (4560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:31:34 (4124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 17:49:10 (3488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6204, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2656, iMonCtr=1 Model crash detected, will try to restart... 08:17:39 (4980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:33:58 (4668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 16:52:02 (6336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 08:17:36 (4948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x77C23A95 read attempt to address 0x02802804 Engaging BOINC Windows Runtime Debugger... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Dec 2011 00:58:02 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 1,036,800 | 2,111,706 | 2.0368 |
09 Dec 2011 17:50:18 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 1,010,880 | 2,058,760 | 2.0366 |
08 Dec 2011 11:03:48 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 984,960 | 2,005,470 | 2.0361 |
07 Dec 2011 19:32:58 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 959,040 | 1,952,550 | 2.0359 |
05 Dec 2011 21:30:47 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 933,120 | 1,898,918 | 2.0350 |
02 Dec 2011 15:18:42 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 907,200 | 1,845,910 | 2.0347 |
01 Dec 2011 08:55:52 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 881,280 | 1,792,662 | 2.0342 |
30 Nov 2011 17:17:52 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 855,360 | 1,738,671 | 2.0327 |
28 Nov 2011 19:35:53 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 829,440 | 1,686,274 | 2.0330 |
22 Nov 2011 21:24:24 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 803,520 | 1,633,469 | 2.0329 |
21 Nov 2011 15:20:13 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 777,600 | 1,580,734 | 2.0328 |
17 Nov 2011 16:55:29 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 751,680 | 1,527,886 | 2.0326 |
15 Nov 2011 18:26:17 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 725,760 | 1,475,024 | 2.0324 |
15 Nov 2011 17:39:06 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 699,840 | 1,422,238 | 2.0322 |
09 Nov 2011 21:38:25 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 673,920 | 1,369,084 | 2.0315 |
08 Nov 2011 15:20:29 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 648,000 | 1,316,641 | 2.0319 |
04 Nov 2011 15:27:25 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 622,080 | 1,263,860 | 2.0317 |
02 Nov 2011 18:01:37 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 596,160 | 1,210,918 | 2.0312 |
31 Oct 2011 18:12:27 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 570,240 | 1,158,096 | 2.0309 |
31 Oct 2011 16:43:46 | 1088606 | 13328705 | hadcm3n_o53l_1940_40_007267315_2 | 544,320 | 1,105,481 | 2.0309 |
©2024 cpdn.org