climateprediction.net home page
Task 14224189

Task 14224189

Name hadcm3n_u0t0_1980_40_007753850_3
Workunit 7908959
Created 6 Mar 2012, 3:06:34 UTC
Sent 6 Mar 2012, 3:06:49 UTC
Report deadline 5 Jun 2012, 10:34:00 UTC
Received 25 May 2012, 2:47:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1166432
Run time 18 days 6 hours 7 min 47 sec
CPU time 17 days 20 hours 32 min 11 sec
Validate state Invalid
Credit 11,819.52
Device peak FLOPS 2.94 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3064, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3064, iMonCtr=1
Model crash detected, will try to restart...
16:32:19 (4780): Can't acquire lockfile (32) - waiting 35s
16:32:22 (3212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3404, iMonCtr=1
Model crash detected, will try to restart...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/u0t0ko.pjl1c10
Error converting file to netcdf: dataout/u0t0ko.pil1c10
Error converting file to netcdf: dataout/u0t0ko.pfl1c10
Error converting file to netcdf: dataout/u0t0ka.phl1c10
Error converting file to netcdf: dataout/u0t0ka.pgl1c10
Error converting file to netcdf: dataout/u0t0ka.pel1c10
Error converting file to netcdf: dataout/u0t0ka.pdl1c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CNo Process Handle
Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4348, selfPID=4348, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/u0t0ko.pjl7c10
Error converting file to netcdf: dataout/u0t0ko.pil7c10
Error converting file to netcdf: dataout/u0t0ko.pfl7c10
Error converting file to netcdf: dataout/u0t0ka.phl7c10
Error converting file to netcdf: dataout/u0t0ka.pgl7c10
Error converting file to netcdf: dataout/u0t0ka.pel7c10
Error converting file to netcdf: dataout/u0t0ka.pdl7c10
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3352, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2664, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
24 May 2012 04:13:52 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 984,960 1,523,250 1.5465
22 May 2012 04:48:15 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 959,040 1,484,482 1.5479
19 May 2012 06:47:19 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 933,120 1,444,316 1.5478
18 May 2012 16:01:44 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 907,200 1,403,239 1.5468
16 May 2012 04:57:54 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 881,280 1,362,028 1.5455
12 May 2012 01:17:04 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 855,360 1,322,928 1.5466
11 May 2012 07:30:24 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 829,440 1,283,630 1.5476
05 May 2012 07:16:33 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 803,520 1,243,349 1.5474
02 May 2012 05:16:23 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 777,600 1,202,309 1.5462
30 Apr 2012 05:29:50 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 751,680 1,161,234 1.5449
28 Apr 2012 23:19:52 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 725,760 1,120,263 1.5436
17 Apr 2012 18:48:17 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 699,840 1,080,814 1.5444
17 Apr 2012 03:48:41 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 673,920 1,040,407 1.5438
15 Apr 2012 15:11:02 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 648,000 998,960 1.5416
13 Apr 2012 07:10:15 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 622,080 958,006 1.5400
12 Apr 2012 03:53:56 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 596,160 916,528 1.5374
10 Apr 2012 22:40:18 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 570,240 876,282 1.5367
10 Apr 2012 03:25:25 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 544,320 835,954 1.5358
07 Apr 2012 07:30:37 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 518,400 793,805 1.5313
03 Apr 2012 06:26:48 1166432 14224189 hadcm3n_u0t0_1980_40_007753850_3 492,480 754,901 1.5329


©2024 cpdn.org