climateprediction.net home page
Task 13544438

Task 13544438

Name hadcm3n_ycgj_1900_40_007519378_0
Workunit 7716853
Created 28 Oct 2011, 13:02:38 UTC
Sent 19 Nov 2011, 10:22:26 UTC
Report deadline 18 Feb 2012, 17:49:37 UTC
Received 6 Dec 2011, 5:59:26 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION
Computer ID 1054144
Run time 10 days 11 hours 14 min 11 sec
CPU time 9 days 19 hours 24 min 51 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.76 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
 - exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=680, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2588, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4908, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3792, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
10:54:57 (5088): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:11:56 (5020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3852, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:49:07 (2740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2144, iMonCtr=1
Model crash detected, will try to restart...
08:21:25 (5060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:03:34 (2828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:03:36 (2828): No heartbeat from core client for 30 sec - exiting
13:03:37 (2828): No heartbeat from core client for 30 sec - exiting
13:03:38 (2828): No heartbeat from core client for 30 sec - exiting
13:03:39 (2828): No heartbeat from core client for 30 sec - exiting
13:03:40 (2828): No heartbeat from core client for 30 sec - exiting
13:03:41 (2828): No heartbeat from core client for 30 sec - exiting
13:03:42 (2828): No heartbeat from core client for 30 sec - exiting
13:03:43 (2828): No heartbeat from core client for 30 sec - exiting
13:03:44 (2828): No heartbeat from core client for 30 sec - exiting
13:03:45 (2828): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4536, iMonCtr=1
Model crash detected, will try to restart...
07:14:17 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
13:58:52 (1264): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
22:03:02 (4008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:57:08 (4280): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/ycgjko.pjb8c10
Error converting file to netcdf: dataout/ycgjko.pib8c10
Error converting file to netcdf: dataout/ycgjko.pfb8c10
Error converting file to netcdf: dataout/ycgjka.phb8c10
Error converting file to netcdf: dataout/ycgjka.pgb8c10
Error converting file to netcdf: dataout/ycgjka.peb8c10
Error converting file to netcdf: dataout/ycgjka.pdb8c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Ocean Restart file copy failed on ycgjko.dab9130
Ocean Restart file copy failed on ycgjko.dab9190
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77593A93 read attempt to address 0x40B267AC

Engaging BOINC Windows Runtime Debugger...


</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
06 Dec 2011 06:04:00 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 518,400 847,474 1.6348
04 Dec 2011 21:48:06 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 492,480 804,964 1.6345
03 Dec 2011 21:01:41 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 466,560 762,368 1.6340
03 Dec 2011 07:20:56 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 440,640 720,488 1.6351
02 Dec 2011 07:26:30 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 414,720 678,033 1.6349
01 Dec 2011 10:51:56 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 388,800 635,740 1.6351
30 Nov 2011 11:57:31 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 362,880 593,825 1.6364
29 Nov 2011 14:22:14 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 336,960 552,217 1.6388
28 Nov 2011 13:16:50 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 311,040 509,673 1.6386
27 Nov 2011 17:38:11 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 285,120 467,557 1.6399
26 Nov 2011 18:27:16 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 259,200 425,212 1.6405
25 Nov 2011 19:35:26 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 233,280 383,025 1.6419
25 Nov 2011 06:20:18 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 207,360 340,985 1.6444
24 Nov 2011 11:17:18 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 181,440 298,851 1.6471
23 Nov 2011 14:38:41 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 155,520 256,372 1.6485
23 Nov 2011 06:12:05 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 129,600 214,401 1.6543
22 Nov 2011 13:30:09 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 103,680 171,952 1.6585
21 Nov 2011 18:01:01 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 77,760 128,989 1.6588
20 Nov 2011 21:20:26 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 51,840 85,674 1.6527
20 Nov 2011 08:22:43 1054144 13544438 hadcm3n_ycgj_1900_40_007519378_0 25,920 42,890 1.6547


©2024 climateprediction.net