climateprediction.net home page
Task 13675436

Task 13675436

Name hadcm3n_yd9c_1940_40_007547298_2
Workunit 7744530
Created 30 Nov 2011, 3:44:10 UTC
Sent 30 Nov 2011, 3:45:27 UTC
Report deadline 29 Feb 2012, 11:12:38 UTC
Received 23 Dec 2011, 1:23:29 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1003311
Run time 19 days 3 hours 4 min 38 sec
CPU time 16 days 9 hours 7 min 56 sec
Validate state Invalid
Credit 9,020.16
Device peak FLOPS 2.45 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
03:28:27 (4980): No heartbeat from core client for 30 sec - exiting
03:28:28 (4980): No heartbeat from core client for 30 sec - exiting
03:28:29 (4980): No heartbeat from core client for 30 sec - exiting
03:28:30 (4980): No heartbeat from core client for 30 sec - exiting
03:28:31 (4980): No heartbeat from core client for 30 sec - exiting
03:28:32 (4980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2876, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4824, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller::Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/atmos_restart.day after 11 attempts
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_yd9c_1940_40_007547298/dataout/ocean_restart.day after 11 attempts
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=988, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Dec 2011 12:14:28 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 751,680 1,395,701 1.8568
21 Dec 2011 19:58:49 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 725,760 1,345,174 1.8535
21 Dec 2011 03:06:34 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 699,840 1,294,893 1.8503
20 Dec 2011 10:34:45 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 673,920 1,244,644 1.8469
19 Dec 2011 18:49:04 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 648,000 1,193,913 1.8425
19 Dec 2011 00:50:52 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 622,080 1,143,129 1.8376
18 Dec 2011 09:07:34 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 596,160 1,095,475 1.8376
17 Dec 2011 16:35:37 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 570,240 1,044,036 1.8309
16 Dec 2011 23:56:33 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 544,320 992,777 1.8239
16 Dec 2011 07:11:09 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 518,400 941,898 1.8169
15 Dec 2011 14:27:04 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 492,480 890,658 1.8085
14 Dec 2011 23:21:39 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 466,560 840,387 1.8012
14 Dec 2011 23:21:39 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 440,640 791,840 1.7970
13 Dec 2011 14:17:57 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 414,720 745,352 1.7972
12 Dec 2011 23:22:27 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 388,800 698,686 1.7970
12 Dec 2011 08:23:16 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 362,880 652,078 1.7970
11 Dec 2011 17:19:20 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 336,960 605,658 1.7974
11 Dec 2011 02:25:17 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 311,040 559,165 1.7977
10 Dec 2011 11:16:36 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 285,120 512,655 1.7980
09 Dec 2011 20:52:55 1003311 13675436 hadcm3n_yd9c_1940_40_007547298_2 259,200 466,078 1.7981


©2024 cpdn.org