climateprediction.net home page
Task 13927403

Task 13927403

Name hadcm3n_yg97_1940_40_007683288_2
Workunit 7838375
Created 16 Jan 2012, 6:47:38 UTC
Sent 16 Jan 2012, 15:12:52 UTC
Report deadline 16 Apr 2012, 22:40:03 UTC
Received 16 Feb 2012, 19:28:21 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 950229
Run time 21 days 19 hours 10 min 28 sec
CPU time 21 days 7 hours 20 min 20 sec
Validate state Invalid
Credit 8,087.04
Device peak FLOPS 1.97 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2012, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3460, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3888, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3740, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5184, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3080, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6084, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yg97_1940_40_007683288/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6584, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Feb 2012 08:18:27 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 673,920 1,800,849 2.6722
15 Feb 2012 12:00:05 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 648,000 1,730,509 2.6705
14 Feb 2012 16:16:08 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 622,080 1,660,285 2.6689
13 Feb 2012 20:30:41 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 596,160 1,590,420 2.6678
13 Feb 2012 01:11:13 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 570,240 1,520,695 2.6668
12 Feb 2012 05:39:47 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 544,320 1,450,974 2.6657
11 Feb 2012 09:09:08 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 518,400 1,381,237 2.6644
10 Feb 2012 13:28:21 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 492,480 1,311,500 2.6631
09 Feb 2012 17:46:48 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 466,560 1,242,281 2.6626
08 Feb 2012 22:24:24 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 440,640 1,173,107 2.6623
08 Feb 2012 02:15:39 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 414,720 1,103,841 2.6617
07 Feb 2012 06:39:02 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 388,800 1,034,587 2.6610
06 Feb 2012 10:26:02 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 362,880 965,195 2.6598
05 Feb 2012 14:20:24 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 336,960 895,643 2.6580
04 Feb 2012 18:42:38 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 311,040 825,907 2.6553
03 Feb 2012 22:56:09 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 285,120 756,360 2.6528
03 Feb 2012 03:27:09 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 259,200 687,071 2.6507
02 Feb 2012 07:35:43 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 233,280 618,383 2.6508
01 Feb 2012 12:23:55 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 207,360 550,119 2.6530
31 Jan 2012 16:15:16 950229 13927403 hadcm3n_yg97_1940_40_007683288_2 181,440 481,233 2.6523


©2024 cpdn.org