climateprediction.net home page
Task 14109764

Task 14109764

Name hadcm3n_ycm6_1900_40_007519255_3
Workunit 7716730
Created 18 Feb 2012, 21:19:20 UTC
Sent 18 Feb 2012, 21:20:14 UTC
Report deadline 20 May 2012, 4:47:25 UTC
Received 2 Sep 2012, 16:00:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1042736
Run time 11 days 20 hours 39 min
CPU time 8 days 21 hours 53 min 56 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 1.97 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3104, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2592, iMonCtr=1
Model crash detected, will try to restart...
10:43:52 (9436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:58:01 (9712): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:10:07 (9556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:44:39 (8880): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:17:30 (10496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:47:46 (9212): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:17:45 (9920): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:38:03 (7324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:38:04 (7324): No heartbeat from core client for 30 sec - exiting
13:38:05 (7324): No heartbeat from core client for 30 sec - exiting
CoSignal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5640, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_ycm6_1900_40_007519255/dataout/ocean_restart.day after 11 attempts
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5156, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Jul 2012 22:02:09 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 388,800 760,905 1.9571
05 Jul 2012 05:32:56 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 362,880 710,049 1.9567
28 May 2012 13:50:55 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 336,960 658,531 1.9543
27 May 2012 20:48:08 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 311,040 608,769 1.9572
23 May 2012 22:46:29 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 285,120 559,626 1.9628
03 Apr 2012 08:49:16 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 259,200 511,030 1.9716
02 Apr 2012 15:35:40 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 233,280 461,097 1.9766
02 Apr 2012 00:25:08 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 207,360 412,297 1.9883
01 Apr 2012 07:48:36 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 181,440 363,149 2.0015
31 Mar 2012 15:01:20 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 155,520 314,014 2.0191
04 Mar 2012 07:06:45 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 129,600 262,106 2.0224
27 Feb 2012 19:37:43 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 103,680 208,922 2.0151
27 Feb 2012 04:47:20 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 77,760 158,220 2.0347
20 Feb 2012 11:18:02 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 51,840 103,082 1.9885
19 Feb 2012 16:34:10 1042736 14109764 hadcm3n_ycm6_1900_40_007519255_3 25,920 50,786 1.9593


©2024 climateprediction.net