climateprediction.net home page
Task 15501156

Task 15501156

Name hadcm3n_3esj_1940_40_008268790_0
Workunit 8423914
Created 23 Dec 2012, 14:01:12 UTC
Sent 23 Dec 2012, 14:03:57 UTC
Report deadline 24 Mar 2013, 21:31:08 UTC
Received 24 Mar 2013, 9:37:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 941579
Run time 15 days 10 hours 14 min 44 sec
CPU time 15 days 10 hours 14 min 44 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.05 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.2.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
22:12:25 (892): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:11:13 (3608): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:10:02 (1836): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:08:52 (460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:06:31 (5964): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
04:05:23 (5040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
09:04:11 (3692): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:02:59 (4168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:01:47 (3416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:00:37 (6052): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:59:26 (5556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:58:13 (4172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:19:41 (4020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
17:19:42 (4020): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=328, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:39:29 (2616): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1772, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Mar 2013 18:37:34 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 648,000 1,296,069 2.0001
18 Mar 2013 14:10:15 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 622,080 1,242,346 1.9971
11 Mar 2013 19:08:43 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 596,160 1,188,345 1.9933
16 Jan 2013 10:38:12 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 570,240 1,138,490 1.9965
15 Jan 2013 18:55:39 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 544,320 1,086,634 1.9963
15 Jan 2013 02:27:07 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 518,400 1,032,822 1.9923
14 Jan 2013 10:39:05 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 492,480 980,980 1.9919
13 Jan 2013 18:51:18 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 466,560 929,173 1.9915
13 Jan 2013 00:23:05 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 440,640 873,956 1.9834
12 Jan 2013 08:24:11 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 414,720 822,137 1.9824
11 Jan 2013 17:06:01 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 388,800 771,812 1.9851
11 Jan 2013 01:38:05 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 362,880 720,351 1.9851
10 Jan 2013 10:13:53 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 336,960 669,616 1.9872
09 Jan 2013 19:01:34 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 311,040 619,496 1.9917
09 Jan 2013 03:39:08 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 285,120 569,110 1.9960
08 Jan 2013 11:57:39 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 259,200 518,615 2.0008
07 Jan 2013 20:07:27 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 233,280 466,770 2.0009
07 Jan 2013 03:59:59 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 207,360 414,439 1.9986
06 Jan 2013 11:41:45 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 181,440 361,267 1.9911
31 Dec 2012 14:55:47 941579 15501156 hadcm3n_3esj_1940_40_008268790_0 155,520 305,196 1.9624


©2024 cpdn.org