climateprediction.net home page
Task 16075324

Task 16075324

Name hadcm3n_o8wi_1900_40_008466661_1
Workunit 8617500
Created 29 Oct 2013, 0:39:58 UTC
Sent 29 Oct 2013, 0:40:21 UTC
Report deadline 28 Jan 2014, 8:07:32 UTC
Received 22 Nov 2013, 1:16:15 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1290294
Run time 23 days 4 hours 15 min 8 sec
CPU time 19 days 11 hours 59 min
Validate state Invalid
Credit 11,508.48
Device peak FLOPS 2.17 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.64</core_client_version>
<![CDATA[
<message>
The device does not recognize the command.
 (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
20:36:43 (6260): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:10:49 (6684): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:38:07 (5928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:39:39 (7012): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
05:41:43 (980): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:51:49 (4932): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:32:17 (2476): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:37:46 (2080): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:24:05 (3740): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:50:29 (5456): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
19:49:30 (468): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:44:16 (1956): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
22:44:42 (10912): Can't acquire lockfile (32) - waiting 35s
Suspended CPDN Monitor - Suspend request from BOINC...
04:32:21 (10912): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:35:58 (5324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
16:27:16 (5388): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2372, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Nov 2013 18:30:22 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 959,040 1,668,598 1.7399
21 Nov 2013 03:06:56 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 933,120 1,623,331 1.7397
20 Nov 2013 08:11:34 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 907,200 1,579,126 1.7407
19 Nov 2013 15:57:35 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 881,280 1,535,129 1.7419
18 Nov 2013 22:21:48 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 855,360 1,490,598 1.7427
18 Nov 2013 05:08:52 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 829,440 1,445,362 1.7426
17 Nov 2013 14:59:31 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 803,520 1,400,509 1.7430
17 Nov 2013 00:17:02 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 777,600 1,355,722 1.7435
16 Nov 2013 09:35:34 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 751,680 1,310,522 1.7435
15 Nov 2013 19:16:51 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 725,760 1,265,484 1.7437
15 Nov 2013 05:25:15 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 699,840 1,219,847 1.7430
14 Nov 2013 14:09:54 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 673,920 1,174,665 1.7430
13 Nov 2013 19:44:49 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 648,000 1,129,195 1.7426
13 Nov 2013 06:05:43 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 622,080 1,082,896 1.7408
12 Nov 2013 15:25:10 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 596,160 1,036,841 1.7392
12 Nov 2013 01:45:14 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 570,240 990,950 1.7378
11 Nov 2013 12:52:40 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 544,320 947,700 1.7411
11 Nov 2013 00:04:33 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 518,400 904,053 1.7439
10 Nov 2013 08:52:56 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 492,480 858,497 1.7432
09 Nov 2013 19:09:44 1290294 16075324 hadcm3n_o8wi_1900_40_008466661_1 466,560 812,854 1.7422


©2024 climateprediction.net