climateprediction.net home page
Task 15684579

Task 15684579

Name hadcm3n_u3wt_2020_40_008336365_0
Workunit 8487226
Created 26 Mar 2013, 20:47:25 UTC
Sent 26 Mar 2013, 20:48:03 UTC
Report deadline 26 Jun 2013, 4:15:14 UTC
Received 23 Apr 2013, 14:14:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1260575
Run time 13 days 21 hours 15 min 20 sec
CPU time 11 days 21 hours 58 min 39 sec
Validate state Invalid
Credit 6,220.80
Device peak FLOPS 2.03 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:44:35 (10296): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
22:44:36 (10296): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:48:11 (3332): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:15:40 (3472): No heartbeat from core client for 30 sec - exiting
15:15:41 (3472): No heartbeat from core client for 30 sec - exiting
15:15:42 (3472): No heartbeat from core client for 30 sec - exiting
15:15:43 (3472): No heartbeat from core client for 30 sec - exiting
15:15:44 (3472): No heartbeat from core client for 30 sec - exiting
15:15:45 (3472): No heartbeat from core client for 30 sec - exiting
15:15:46 (3472): No heartbeat from core client for 30 sec - exiting
15:15:47 (3472): No heartbeat from core client for 30 sec - exiting
15:15:48 (3472): No heartbeat from core client for 30 sec - exiting
15:15:50 (3472): No heartbeat from core client for 30 sec - exiting
15:15:51 (3472): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3660, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3660, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
22 Apr 2013 00:15:54 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 518,400 1,021,129 1.9698
20 Apr 2013 18:23:03 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 492,480 967,097 1.9637
19 Apr 2013 14:45:34 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 466,560 916,267 1.9639
17 Apr 2013 09:27:00 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 440,640 865,518 1.9642
16 Apr 2013 09:19:51 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 414,720 815,749 1.9670
15 Apr 2013 13:23:35 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 388,800 764,079 1.9652
14 Apr 2013 15:50:44 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 362,880 713,561 1.9664
13 Apr 2013 18:17:02 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 336,960 662,075 1.9648
12 Apr 2013 19:21:47 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 311,040 611,702 1.9666
11 Apr 2013 16:00:40 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 285,120 561,278 1.9686
06 Apr 2013 23:24:42 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 259,200 509,386 1.9652
05 Apr 2013 22:07:45 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 233,280 459,451 1.9695
04 Apr 2013 21:40:12 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 207,360 409,109 1.9729
03 Apr 2013 19:43:31 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 181,440 358,732 1.9771
01 Apr 2013 20:43:21 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 155,520 308,105 1.9811
31 Mar 2013 21:55:22 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 129,600 252,684 1.9497
30 Mar 2013 13:25:56 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 103,680 200,706 1.9358
29 Mar 2013 15:25:38 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 77,760 150,662 1.9375
28 Mar 2013 16:41:34 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 51,840 101,781 1.9634
27 Mar 2013 19:44:08 1260575 15684579 hadcm3n_u3wt_2020_40_008336365_0 25,920 52,473 2.0244


©2024 cpdn.org