climateprediction.net home page
Task 12802433

Task 12802433

Name hadcm3n_o2w3_1900_40_007199078_2
Workunit 7397358
Created 20 Apr 2011, 19:19:22 UTC
Sent 20 Apr 2011, 19:19:33 UTC
Report deadline 21 Jul 2011, 2:46:44 UTC
Received 26 Jun 2011, 23:10:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1290254
Run time 17 days 3 hours 22 min 34 sec
CPU time 17 days 4 hours 14 min 23 sec
Validate state Invalid
Credit 9,642.24
Device peak FLOPS 1.94 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.60</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o2w3ko.pja7c10
Error converting file to netcdf: dataout/o2w3ko.pia7c10
Error converting file to netcdf: dataout/o2w3ko.pfa7c10
Error converting file to netcdf: dataout/o2w3ka.pha7c10
Error converting file to netcdf: dataout/o2w3ka.pga7c10
Error converting file to netcdf: dataout/o2w3ka.pea7c10
Error converting file to netcdf: dataout/o2w3ka.pda7c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
06:40:15 (1408): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=120, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:14:49 (3880): No heartbeat from core client for 30 sec - exiting
21:14:50 (3880): No heartbeat from core client for 30 sec - exiting
21:14:51 (3880): No heartbeat from core client for 30 sec - exiting
21:14:52 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:14:53 (3880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6808, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:06:33 (6208): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:05:37 (6756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6956, selfPID=6956, iMonCtr=1
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o2w3ko.pjc5c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
01:47:02 (5904): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:47:56 (1320): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:52:01 (6784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:09:24 (6812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:53:15 (6232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:18:05 (4708): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:07:36 (6056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3124, iMonCtr=1
Model crash detected, will try to restart...
18:53:44 (3616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:55:43 (1400): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
C22:43:02 (348): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6976, iMonCtr=1
Model crash detected, will try to restart...
10:17:50 (5308): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:20:08 (384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:51:32 (5528): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:22:17 (4048): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
02:24:52 (1828): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
16:40:52 (4252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
16:58:07 (1908): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:40:33 (5252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:29:13 (3452): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
01:40:58 (696): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Jun 2011 23:18:36 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 803,520 1,443,709 1.7967
19 Jun 2011 22:02:27 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 777,600 1,401,068 1.8018
19 Jun 2011 22:02:27 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 751,680 1,354,672 1.8022
17 Jun 2011 02:56:15 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 725,760 1,306,392 1.8000
16 Jun 2011 02:43:53 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 699,840 1,256,759 1.7958
06 Jun 2011 14:39:42 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 673,920 1,205,859 1.7893
04 Jun 2011 17:03:42 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 648,000 1,154,845 1.7822
02 Jun 2011 17:56:36 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 622,080 1,105,781 1.7776
01 Jun 2011 05:34:41 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 596,160 1,056,737 1.7726
30 May 2011 11:44:55 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 570,240 1,009,914 1.7710
23 May 2011 09:47:42 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 544,320 960,593 1.7648
22 May 2011 20:52:23 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 518,400 914,207 1.7635
21 May 2011 23:46:14 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 492,480 868,464 1.7635
21 May 2011 10:53:20 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 466,560 822,281 1.7624
20 May 2011 17:46:38 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 440,640 775,465 1.7599
20 May 2011 04:52:57 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 414,720 728,539 1.7567
19 May 2011 08:33:38 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 388,800 680,638 1.7506
18 May 2011 11:51:09 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 362,880 632,901 1.7441
17 May 2011 20:30:49 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 336,960 585,409 1.7373
07 May 2011 20:39:55 1144708 12802433 hadcm3n_o2w3_1900_40_007199078_2 311,040 536,715 1.7255


©2024 cpdn.org