climateprediction.net home page
Task 13061914

Task 13061914

Name hadcm3n_o1lf_1900_40_007197398_2
Workunit 7395678
Created 4 Jul 2011, 11:07:51 UTC
Sent 4 Jul 2011, 11:09:29 UTC
Report deadline 3 Oct 2011, 18:36:40 UTC
Received 18 Aug 2011, 0:11:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1058020
Run time 24 days 3 hours 49 min 47 sec
CPU time 24 days 3 hours 49 min 47 sec
Validate state Invalid
Credit 6,531.84
Device peak FLOPS 2.23 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4648, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7564, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8640, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10272, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2932, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=616, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3276, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/o1lfko.pjc1c10
Error converting file to netcdf: dataout/o1lfko.pic1c10
Error converting file to netcdf: dataout/o1lfko.pfc1c10
Error converting file to netcdf: dataout/o1lfka.phc1c10
Error converting file to netcdf: dataout/o1lfka.pgc1c10
Error converting file to netcdf: dataout/o1lfka.pec1c10
Error converting file to netcdf: dataout/o1lfka.pdc1c10
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Aug 2011 02:37:37 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 544,320 2,071,672 3.8060
14 Aug 2011 03:18:47 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 518,400 1,968,366 3.7970
11 Aug 2011 19:52:04 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 492,480 1,870,567 3.7983
10 Aug 2011 09:42:35 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 466,560 1,783,099 3.8218
09 Aug 2011 00:09:06 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 440,640 1,692,488 3.8410
07 Aug 2011 12:48:54 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 414,720 1,597,384 3.8517
06 Aug 2011 00:22:05 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 388,800 1,502,055 3.8633
04 Aug 2011 10:38:50 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 362,880 1,401,298 3.8616
01 Aug 2011 19:48:40 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 336,960 1,297,665 3.8511
31 Jul 2011 03:22:51 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 311,040 1,192,554 3.8341
29 Jul 2011 14:42:48 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 285,120 1,093,195 3.8342
28 Jul 2011 07:31:17 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 259,200 1,005,287 3.8784
26 Jul 2011 16:17:22 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 233,280 899,997 3.8580
25 Jul 2011 22:13:55 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 207,360 796,278 3.8401
25 Jul 2011 20:26:53 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 181,440 690,389 3.8051
25 Jul 2011 19:01:12 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 155,520 580,870 3.7350
25 Jul 2011 18:16:43 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 129,600 476,806 3.6791
25 Jul 2011 17:17:52 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 103,680 373,736 3.6047
25 Jul 2011 15:47:07 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 77,760 272,403 3.5031
25 Jul 2011 13:32:32 1058020 13061914 hadcm3n_o1lf_1900_40_007197398_2 51,840 165,304 3.1887


©2024 cpdn.org