climateprediction.net home page
Task 15749634

Task 15749634

Name hadcm3n_3mwt_1980_40_008334545_2
Workunit 8485406
Created 24 Apr 2013, 21:30:08 UTC
Sent 24 Apr 2013, 21:31:19 UTC
Report deadline 25 Jul 2013, 4:58:30 UTC
Received 30 May 2013, 23:34:49 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1253770
Run time 20 days 22 hours 50 min 33 sec
CPU time 20 days 13 hours 11 min 1 sec
Validate state Invalid
Credit 12,130.56
Device peak FLOPS 1.59 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
10:10:04 (10148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:50:09 (16140): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:17:24 (10192): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
23:22:24 (16496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Atmos Hold Restart file rename failed on atmos_restart.hold
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
13:58:07 (11500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
19:14:55 (5852): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:14:56 (5852): No heartbeat from core client for 30 sec - exiting
19:53:44 (21200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:53:45 (21200): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
22:00:47 (15844): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
00:51:59 (14756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:37:13 (7040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:37:14 (7040): No heartbeat from core client for 30 sec - exiting
09:37:16 (7040): No heartbeat from core client for 30 sec - exiting
09:37:17 (7040): No heartbeat from core client for 30 sec - exiting
09:37:18 (7040): No heartbeat from core client for 30 sec - exiting
09:37:19 (7040): No heartbeat from core client for 30 sec - exiting
09:37:20 (7040): No heartbeat from core client for 30 sec - exiting
09:37:21 (7040): No heartbeat from core client for 30 sec - exiting
09:37:22 (7040): No heartbeat from core client for 30 sec - exiting
09:37:23 (7040): No heartbeat from core client for 30 sec - exiting
09:37:24 (7040): No heartbeat from core client for 30 sec - exiting
Atmos Hold Restart file rename failed on atmos_restart.hold
17:22:33 (12808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:24:19 (67504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:56:07 (75812): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
12:33:22 (153356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:36:36 (173768): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:36:37 (173768): No heartbeat from core client for 30 sec - exiting
12:36:38 (173768): No heartbeat from core client for 30 sec - exiting
21:08:33 (174016): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
23:35:13 (116636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
18:28:47 (28732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
08:31:15 (34180): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
BUFFIN: C I/O Error feof - Unit 63 - Return code = 16
BUFFIN: C I/O Error feof - Unit 64 - Return code = 16
BUFFIN: C I/O Error feof - Unit 65 - Return code = 16
BUFFIN: C I/O Error feof - Unit 66 - Return code = 16
BUFFIN: C I/O Error feof - Unit 67 - Return code = 16
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4232, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1116, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
25 May 2013 23:52:21 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 1,010,880 1,753,587 1.7347
24 May 2013 13:19:21 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 984,960 1,706,142 1.7322
23 May 2013 09:53:28 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 959,040 1,660,384 1.7313
22 May 2013 08:28:15 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 933,120 1,614,777 1.7305
21 May 2013 19:20:56 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 907,200 1,567,528 1.7279
21 May 2013 06:07:03 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 881,280 1,520,148 1.7249
20 May 2013 06:25:06 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 855,360 1,471,100 1.7199
19 May 2013 13:03:37 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 829,440 1,421,289 1.7136
18 May 2013 09:23:50 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 803,520 1,373,996 1.7100
17 May 2013 09:45:49 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 777,600 1,328,449 1.7084
16 May 2013 20:23:43 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 751,680 1,280,374 1.7033
16 May 2013 04:30:41 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 725,760 1,233,968 1.7002
15 May 2013 06:29:45 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 699,840 1,190,982 1.7018
14 May 2013 08:08:28 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 673,920 1,141,528 1.6939
13 May 2013 03:25:31 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 648,000 1,097,753 1.6941
12 May 2013 14:20:14 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 622,080 1,054,745 1.6955
11 May 2013 16:16:18 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 596,160 1,010,358 1.6948
10 May 2013 23:12:49 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 570,240 964,446 1.6913
09 May 2013 18:12:44 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 544,320 917,918 1.6864
08 May 2013 20:51:56 1253770 15749634 hadcm3n_3mwt_1980_40_008334545_2 518,400 872,968 1.6840


©2024 cpdn.org