climateprediction.net home page
Task 15494417

Task 15494417

Name hadcm3n_z9j4_1880_40_008245314_3
Workunit 8400438
Created 21 Dec 2012, 12:01:08 UTC
Sent 21 Dec 2012, 12:04:11 UTC
Report deadline 22 Mar 2013, 19:31:22 UTC
Received 3 Jan 2013, 16:17:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1295273
Run time 12 days 2 hours 50 min 14 sec
CPU time 10 days 20 hours 59 min 51 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 3.63 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2988, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/z9j4ko.pja6c10
Error converting file to netcdf: dataout/z9j4ko.pia6c10
Error converting file to netcdf: dataout/z9j4ko.pfa6c10
Error converting file to netcdf: dataout/z9j4ka.pha6c10
Error converting file to netcdf: dataout/z9j4ka.pga6c10
Error converting file to netcdf: dataout/z9j4ka.pea6c10
Error converting file to netcdf: dataout/z9j4ka.pda6c10
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/z9j4ko.pja6c10
Error converting file to netcdf: dataout/z9j4ko.pia6c10
Error converting file to netcdf: dataout/z9j4ko.pfa6c10
Error converting file to netcdf: dataout/z9j4ka.pha6c10
Error converting file to netcdf: dataout/z9j4ka.pga6c10
Error converting file to netcdf: dataout/z9j4ka.pea6c10
Error converting file to netcdf: dataout/z9j4ka.pda6c10
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3928, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
03 Jan 2013 00:19:04 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 881,280 939,601 1.0662
02 Jan 2013 14:31:20 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 855,360 911,865 1.0661
02 Jan 2013 06:49:40 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 829,440 884,446 1.0663
01 Jan 2013 21:57:25 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 803,520 856,700 1.0662
01 Jan 2013 14:05:39 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 777,600 828,769 1.0658
01 Jan 2013 06:20:54 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 751,680 801,271 1.0660
31 Dec 2012 22:34:02 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 725,760 773,676 1.0660
31 Dec 2012 14:45:45 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 699,840 745,876 1.0658
31 Dec 2012 06:07:56 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 673,920 718,178 1.0657
30 Dec 2012 20:49:49 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 648,000 691,186 1.0666
30 Dec 2012 12:55:22 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 622,080 664,021 1.0674
30 Dec 2012 05:23:36 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 596,160 636,805 1.0682
29 Dec 2012 21:46:12 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 570,240 609,715 1.0692
29 Dec 2012 17:22:52 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 544,320 584,105 1.0731
29 Dec 2012 17:22:52 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 518,400 559,537 1.0794
28 Dec 2012 21:16:57 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 492,480 530,604 1.0774
28 Dec 2012 11:20:03 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 466,560 501,417 1.0747
28 Dec 2012 01:22:53 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 440,640 472,304 1.0719
27 Dec 2012 17:52:37 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 414,720 442,797 1.0677
27 Dec 2012 09:09:28 1186756 15494417 hadcm3n_z9j4_1880_40_008245314_3 388,800 414,167 1.0652


©2024 cpdn.org