climateprediction.net home page
Task 13108741

Task 13108741

Name hadcm3n_yf3u_1900_40_007352436_0
Workunit 7549866
Created 6 Jul 2011, 14:20:32 UTC
Sent 15 Jul 2011, 22:54:44 UTC
Report deadline 15 Oct 2011, 6:21:55 UTC
Received 30 Jul 2011, 10:25:40 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1158166
Run time 8 days 23 hours 30 min 22 sec
CPU time 8 days 12 hours 41 min 32 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.90 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
Enheden genkender ikke kommandoen. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:18:59 (6436): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:20:32 (6924): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Error converting file to netcdf: dataout/yf3uko.pjb5c10
Error converting file to netcdf: dataout/yf3uko.pib5c10
Error converting file to netcdf: dataout/yf3uko.pfb5c10
Error converting file to netcdf: dataout/yf3uka.phb5c10
Error converting file to netcdf: dataout/yf3uka.pgb5c10
Error converting file to netcdf: dataout/yf3uka.peb5c10
Error converting file to netcdf: dataout/yf3uka.pdb5c10
CPDN Monitor - Quit request from BOINC...
21:09:21 (16284): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15284, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1044, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=880, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=880, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=880, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=880, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=880, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=880, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Jul 2011 17:27:20 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 388,800 652,896 1.6793
27 Jul 2011 23:12:11 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 362,880 610,017 1.6810
27 Jul 2011 08:05:51 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 336,960 567,567 1.6844
25 Jul 2011 19:39:30 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 311,040 525,850 1.6906
25 Jul 2011 19:08:29 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 285,120 482,426 1.6920
25 Jul 2011 19:08:29 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 259,200 437,333 1.6872
25 Jul 2011 19:08:29 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 233,280 392,397 1.6821
25 Jul 2011 18:55:27 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 207,360 348,621 1.6812
25 Jul 2011 18:14:27 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 181,440 304,334 1.6773
25 Jul 2011 17:53:20 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 155,520 260,489 1.6750
25 Jul 2011 17:33:51 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 129,600 216,757 1.6725
25 Jul 2011 17:21:24 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 103,680 173,357 1.6720
25 Jul 2011 16:18:10 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 77,760 129,284 1.6626
25 Jul 2011 15:53:59 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 51,840 85,426 1.6479
25 Jul 2011 15:37:03 1158166 13108741 hadcm3n_yf3u_1900_40_007352436_0 25,920 42,201 1.6281


©2024 climateprediction.net