climateprediction.net home page
Task 19319586

Task 19319586

Name hadcm3n_larw_194012_480_353_010335561_1
Workunit 10335561
Created 27 Feb 2016, 13:52:16 UTC
Sent 27 Feb 2016, 13:59:17 UTC
Report deadline 8 Feb 2017, 19:19:17 UTC
Received 6 Jul 2016, 19:42:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID 991976
Run time 12 days 7 hours 45 min 40 sec
CPU time 7 days 3 hours 52 min 55 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 2.01 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
 - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1648, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3764, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
01:19:12 (588): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
01:19:13 (588): No heartbeat from core client for 30 sec - exiting
01:19:14 (588): No heartbeat from core client for 30 sec - exiting
01:19:15 (588): No heartbeat from core client for 30 sec - exiting
01:19:16 (588): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:27:17 (2232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3316, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3160, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2612, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1844, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3420, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3600, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4192, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1700, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3608, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/larwko.pje8c10
Error converting file to netcdf: dataout/larwko.pie8c10
Error converting file to netcdf: dataout/larwko.pfe8c10
Error converting file to netcdf: dataout/larwka.phe8c10
Error converting file to netcdf: dataout/larwka.pge8c10
Error converting file to netcdf: dataout/larwka.pee8c10
Error converting file to netcdf: dataout/larwka.pde8c10
23:28:37 (3444): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:28:39 (3444): No heartbeat from core client for 30 sec - exiting
23:28:40 (3444): No heartbeat from core client for 30 sec - exiting
23:28:41 (3444): No heartbeat from core client for 30 sec - exiting
23:28:42 (3444): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2764, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2608, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3048, iMonCtr=1
Model crash detected, will try to restart...
04:39:21 (3800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:39:22 (3800): No heartbeat from core client for 30 sec - exiting
04:39:23 (3800): No heartbeat from core client for 30 sec - exiting
04:39:24 (3800): No heartbeat from core client for 30 sec - exiting
04:39:25 (3800): No heartbeat from core client for 30 sec - exiting
04:39:27 (3800): No heartbeat from core client for 30 sec - exiting


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77337A45 read attempt to address 0xFFFFFFF8

Engaging BOINC Windows Runtime Debugger...


</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Jul 2016 14:44:59 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 259,200 618,768 2.3872
12 Jul 2016 14:43:24 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 233,280 557,027 2.3878
14 Jun 2016 21:14:15 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 207,360 492,833 2.3767
01 Jun 2016 03:04:17 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 181,440 431,684 2.3792
26 May 2016 13:08:23 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 155,520 372,613 2.3959
06 Apr 2016 19:43:41 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 129,600 311,326 2.4022
31 Mar 2016 08:22:09 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 103,680 249,313 2.4046
30 Mar 2016 12:10:49 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 77,760 187,963 2.4172
08 Mar 2016 18:44:18 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 51,840 125,986 2.4303
28 Feb 2016 10:35:34 991976 19319586 hadcm3n_larw_194012_480_353_010335561_1 25,920 61,959 2.3904


©2024 cpdn.org