Name | hadcm3n_o1lf_1900_40_007197398_2 |
Workunit | 7395678 |
Created | 4 Jul 2011, 11:07:51 UTC |
Sent | 4 Jul 2011, 11:09:29 UTC |
Report deadline | 3 Oct 2011, 18:36:40 UTC |
Received | 18 Aug 2011, 0:11:48 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1058020 |
Run time | 24 days 3 hours 49 min 47 sec |
CPU time | 24 days 3 hours 49 min 47 sec |
Validate state | Invalid |
Credit | 6,531.84 |
Device peak FLOPS | 2.23 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4648, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7564, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8640, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10272, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2932, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=616, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3276, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/o1lfko.pjc1c10 Error converting file to netcdf: dataout/o1lfko.pic1c10 Error converting file to netcdf: dataout/o1lfko.pfc1c10 Error converting file to netcdf: dataout/o1lfka.phc1c10 Error converting file to netcdf: dataout/o1lfka.pgc1c10 Error converting file to netcdf: dataout/o1lfka.pec1c10 Error converting file to netcdf: dataout/o1lfka.pdc1c10 BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
16 Aug 2011 02:37:37 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 544,320 | 2,071,672 | 3.8060 |
14 Aug 2011 03:18:47 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 518,400 | 1,968,366 | 3.7970 |
11 Aug 2011 19:52:04 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 492,480 | 1,870,567 | 3.7983 |
10 Aug 2011 09:42:35 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 466,560 | 1,783,099 | 3.8218 |
09 Aug 2011 00:09:06 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 440,640 | 1,692,488 | 3.8410 |
07 Aug 2011 12:48:54 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 414,720 | 1,597,384 | 3.8517 |
06 Aug 2011 00:22:05 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 388,800 | 1,502,055 | 3.8633 |
04 Aug 2011 10:38:50 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 362,880 | 1,401,298 | 3.8616 |
01 Aug 2011 19:48:40 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 336,960 | 1,297,665 | 3.8511 |
31 Jul 2011 03:22:51 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 311,040 | 1,192,554 | 3.8341 |
29 Jul 2011 14:42:48 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 285,120 | 1,093,195 | 3.8342 |
28 Jul 2011 07:31:17 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 259,200 | 1,005,287 | 3.8784 |
26 Jul 2011 16:17:22 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 233,280 | 899,997 | 3.8580 |
25 Jul 2011 22:13:55 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 207,360 | 796,278 | 3.8401 |
25 Jul 2011 20:26:53 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 181,440 | 690,389 | 3.8051 |
25 Jul 2011 19:01:12 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 155,520 | 580,870 | 3.7350 |
25 Jul 2011 18:16:43 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 129,600 | 476,806 | 3.6791 |
25 Jul 2011 17:17:52 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 103,680 | 373,736 | 3.6047 |
25 Jul 2011 15:47:07 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 77,760 | 272,403 | 3.5031 |
25 Jul 2011 13:32:32 | 1058020 | 13061914 | hadcm3n_o1lf_1900_40_007197398_2 | 51,840 | 165,304 | 3.1887 |
©2024 cpdn.org