climateprediction.net home page
Task 15437071

Task 15437071

Name hadcm3n_zevj_1880_40_008239756_4
Workunit 8394880
Created 16 Nov 2012, 7:18:21 UTC
Sent 16 Nov 2012, 7:18:26 UTC
Report deadline 15 Feb 2013, 14:45:37 UTC
Received 19 Dec 2012, 17:04:52 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1120309
Run time 31 days 22 hours 1 min 33 sec
CPU time 18 days 15 hours 53 min 3 sec
Validate state Invalid
Credit 3,110.40
Device peak FLOPS 0.92 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
20:37:26 (168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:37:28 (168): No heartbeat from core client for 30 sec - exiting
20:37:29 (168): No heartbeat from core client for 30 sec - exiting
20:37:30 (168): No heartbeat from core client for 30 sec - exiting
20:37:31 (168): No heartbeat from core client for 30 sec - exiting
20:37:32 (168): No heartbeat from core client for 30 sec - exiting
20:37:34 (168): No heartbeat from core client for 30 sec - exiting
20:37:35 (168): No heartbeat from core client for 30 sec - exiting
21:46:27 (4496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:46:30 (4496): No heartbeat from core client for 30 sec - exiting
21:46:31 (4496): No heartbeat from core client for 30 sec - exiting
21:46:32 (4496): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
07:34:45 (6336): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:34:48 (6336): No heartbeat from core client for 30 sec - exiting
07:34:49 (6336): No heartbeat from core client for 30 sec - exiting
07:34:50 (6336): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:22:12 (2700): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:22:14 (2700): No heartbeat from core client for 30 sec - exiting
12:22:15 (2700): No heartbeat from core client for 30 sec - exiting
12:22:16 (2700): No heartbeat from core client for 30 sec - exiting
12:22:17 (2700): No heartbeat from core client for 30 sec - exiting
12:22:18 (2700): No heartbeat from core client for 30 sec - exiting
12:22:19 (2700): No heartbeat from core client for 30 sec - exiting
12:22:20 (2700): No heartbeat from core client for 30 sec - exiting
12:22:21 (2700): No heartbeat from core client for 30 sec - exiting
12:22:22 (2700): No heartbeat from core client for 30 sec - exiting
12:22:23 (2700): No heartbeat from core client for 30 sec - exiting
12:22:25 (2700): No heartbeat from core client for 30 sec - exiting
09:38:16 (1268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:56:42 (1356): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
19:56:45 (1356): No heartbeat from core client for 30 sec - exiting
19:56:46 (1356): No heartbeat from core client for 30 sec - exiting
19:56:47 (1356): No heartbeat from core client for 30 sec - exiting
19:56:48 (1356): No heartbeat from core client for 30 sec - exiting
19:56:49 (1356): No heartbeat from core client for 30 sec - exiting
20:29:32 (2416): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:29:34 (2416): No heartbeat from core client for 30 sec - exiting
20:29:36 (2416): No heartbeat from core client for 30 sec - exiting
04:14:26 (3596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:14:28 (3596): No heartbeat from core client for 30 sec - exiting
04:14:29 (3596): No heartbeat from core client for 30 sec - exiting
04:14:30 (3596): No heartbeat from core client for 30 sec - exiting
04:14:32 (3596): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
20:54:50 (2996): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
20:54:52 (2996): No heartbeat from core client for 30 sec - exiting
21:07:02 (4440): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:07:04 (4440): No heartbeat from core client for 30 sec - exiting
21:07:06 (4440): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
12:37:32 (5772): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:37:33 (5772): No heartbeat from core client for 30 sec - exiting
12:37:34 (5772): No heartbeat from core client for 30 sec - exiting
12:37:35 (5772): No heartbeat from core client for 30 sec - exiting
12:37:37 (5772): No heartbeat from core client for 30 sec - exiting
12:37:38 (5772): No heartbeat from core client for 30 sec - exiting
12:37:39 (5772): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Dec 2012 08:19:30 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 259,200 1,540,205 5.9421
14 Dec 2012 16:42:04 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 233,280 1,392,510 5.9693
14 Dec 2012 16:42:04 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 207,360 1,237,765 5.9692
07 Dec 2012 17:23:38 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 181,440 1,077,139 5.9366
05 Dec 2012 07:24:10 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 155,520 927,849 5.9661
01 Dec 2012 18:18:29 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 129,600 768,507 5.9298
28 Nov 2012 17:13:55 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 103,680 615,042 5.9321
26 Nov 2012 18:10:01 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 77,760 468,207 6.0212
23 Nov 2012 10:00:48 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 51,840 312,696 6.0319
20 Nov 2012 05:55:58 1120309 15437071 hadcm3n_zevj_1880_40_008239756_4 25,920 160,262 6.1829


©2024 cpdn.org