climateprediction.net home page
Task 13102486

Task 13102486

Name hadcm3n_ycoz_1900_40_007349309_1
Workunit 7546739
Created 6 Jul 2011, 13:57:13 UTC
Sent 17 Jul 2011, 19:33:03 UTC
Report deadline 17 Oct 2011, 3:00:14 UTC
Received 8 Aug 2011, 2:08:35 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1083997
Run time 7 days 2 hours 33 min 42 sec
CPU time 6 days 21 hours 37 min 33 sec
Validate state Invalid
Credit 3,732.48
Device peak FLOPS 2.05 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.33</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2888, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
20:17:22 (5608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7380, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5404, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
07 Aug 2011 13:09:11 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 311,040 574,227 1.8462
06 Aug 2011 18:31:40 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 285,120 527,710 1.8508
05 Aug 2011 21:53:49 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 259,200 479,931 1.8516
05 Aug 2011 00:56:05 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 233,280 432,329 1.8533
04 Aug 2011 07:53:36 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 207,360 385,096 1.8571
03 Aug 2011 18:35:04 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 181,440 338,268 1.8644
03 Aug 2011 05:07:30 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 155,520 291,051 1.8715
02 Aug 2011 08:47:01 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 129,600 242,596 1.8719
31 Jul 2011 04:55:49 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 103,680 194,728 1.8782
27 Jul 2011 14:02:18 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 77,760 147,475 1.8965
25 Jul 2011 18:54:42 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 51,840 98,553 1.9011
25 Jul 2011 18:14:15 1083997 13102486 hadcm3n_ycoz_1900_40_007349309_1 25,920 49,447 1.9077


©2024 climateprediction.net