climateprediction.net home page
Task 15611365

Task 15611365

Name hadcm3n_3cy5_1940_40_008258126_3
Workunit 8413250
Created 17 Feb 2013, 12:22:07 UTC
Sent 17 Feb 2013, 12:22:22 UTC
Report deadline 19 May 2013, 19:49:33 UTC
Received 28 Mar 2013, 16:10:13 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1127622
Run time 15 days 16 hours 48 min 38 sec
CPU time 15 days 10 hours 40 min 34 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 2.82 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.31</core_client_version>
<![CDATA[
<message>
Das Laufwerk kann einen bestimmten Bereich oder eine bestimmte Spur nicht finden. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1
Model crash detected, Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5884, iMonCtr=1
Model crash detected, will try to restart...
22:17:20 (3056): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
01:37:40 (10296): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:47:45 (5692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5492, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:44:09 (4988): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
08:24:49 (6656): No heartbeat from core client for 30 sec - exiting
08:24:50 (6656): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:54:26 (2924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:42:01 (6512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5632, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/3cy5ko.pjg9c10
Error converting file to netcdf: dataout/3cy5ko.pig9c10
Error converting file to netcdf: dataout/3cy5ko.pfg9c10
Error converting file to netcdf: dataout/3cy5ka.phg9c10
Error converting file to netcdf: dataout/3cy5ka.pgg9c10
Error converting file to netcdf: dataout/3cy5ka.peg9c10
13:59:21 (6780): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
15:01:36 (9440): Can't acquire lockfile (32) - waiting 35s
15:01:59 (6036): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
08:47:19 (7660): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:38:19 (4384): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4156, iMonCtr=1
Model crash detected, will try to restart...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Mar 2013 11:19:47 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 881,280 1,318,055 1.4956
27 Mar 2013 09:47:42 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 855,360 1,262,642 1.4762
25 Mar 2013 14:20:20 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 829,440 1,195,257 1.4410
22 Mar 2013 11:24:49 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 803,520 1,135,575 1.4133
21 Mar 2013 11:29:53 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 777,600 1,096,228 1.4098
20 Mar 2013 07:56:52 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 751,680 1,050,105 1.3970
19 Mar 2013 08:44:01 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 725,760 1,015,619 1.3994
17 Mar 2013 13:30:07 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 699,840 981,811 1.4029
14 Mar 2013 11:40:38 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 673,920 944,783 1.4019
13 Mar 2013 14:30:20 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 648,000 907,020 1.3997
12 Mar 2013 12:17:32 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 622,080 873,654 1.4044
11 Mar 2013 13:30:14 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 596,160 841,226 1.4111
10 Mar 2013 13:35:10 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 570,240 807,314 1.4157
09 Mar 2013 17:20:54 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 544,320 768,692 1.4122
08 Mar 2013 10:16:55 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 518,400 723,314 1.3953
07 Mar 2013 12:37:07 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 492,480 688,693 1.3984
05 Mar 2013 18:17:00 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 466,560 651,218 1.3958
05 Mar 2013 08:05:42 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 440,640 615,439 1.3967
04 Mar 2013 12:09:25 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 414,720 579,916 1.3983
01 Mar 2013 21:27:46 1127622 15611365 hadcm3n_3cy5_1940_40_008258126_3 388,800 544,447 1.4003


©2024 cpdn.org