climateprediction.net home page
Task 15678828

Task 15678828

Name hadcm3n_zi87_1960_40_008335440_0
Workunit 8486301
Created 23 Mar 2013, 1:55:10 UTC
Sent 23 Mar 2013, 2:04:32 UTC
Report deadline 22 Jun 2013, 9:31:43 UTC
Received 12 Apr 2013, 1:02:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1027182
Run time 14 days 20 hours 44 min 39 sec
CPU time 12 days 23 hours 23 min 15 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.54 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:15:04 (12156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:15:05 (12156): No heartbeat from core client for 30 sec - exiting
04:15:07 (12156): No heartbeat from core client for 30 sec - exiting
04:15:08 (12156): No heartbeat from core client for 30 sec - exiting
04:15:09 (12156): No heartbeat from core client for 30 sec - exiting
11:32:45 (6220): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:43:39 (3508): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:31:59 (6568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
07:16:51 (7500): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:49:47 (9324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:56:52 (9808): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:51:59 (8960): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:28:20 (11196): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:20:16 (11000): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:00:33 (4776): No heartbeat from core client for 30 sec - exiting
19:00:34 (4776): No heartbeat from core client for 30 sec - exiting
19:00:35 (4776): No heartbeat from core client for 30 sec - exiting
19:00:36 (4776): No heartbeat from core client for 30 sec - exiting
19:00:37 (4776): No heartbeat from core client for 30 sec - exiting
19:00:38 (4776): No heartbeat from core client for 30 sec - exiting
19:00:39 (4776): No heartbeat from core client for 30 sec - exiting
19:00:40 (4776): No heartbeat from core client for 30 sec - exiting
19:00:42 (4776): No heartbeat from core client for 30 sec - exiting
19:00:43 (4776): No heartbeat from core client for 30 sec - exiting
19:00:44 (4776): No heartbeat from core client for 30 sec - exiting
19:00:45 (4776): No heartbeat from core client for 30 sec - exiting
19:00:46 (4776): No heartbeat from core client for 30 sec - exiting
19:00:47 (4776): No heartbeat from core client for 30 sec - exiting
19:00:48 (4776): No heartbeat from core client for 30 sec - exiting
19:00:49 (4776): No heartbeat from core client for 30 sec - exiting
19:00:50 (4776): No heartbeat from core client for 30 sec - exiting
19:00:51 (4776): No heartbeat from core client for 30 sec - exiting
19:00:52 (4776): No heartbeat from core client for 30 sec - exiting
19:00:54 (4776): No heartbeat from core client for 30 sec - exiting
19:00:55 (4776): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Apr 2013 00:16:23 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 648,000 1,119,158 1.7271
11 Apr 2013 10:39:36 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 622,080 1,078,803 1.7342
10 Apr 2013 21:14:51 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 596,160 1,034,641 1.7355
10 Apr 2013 08:07:59 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 570,240 990,579 1.7371
09 Apr 2013 04:28:29 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 544,320 946,140 1.7382
08 Apr 2013 14:05:13 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 518,400 899,892 1.7359
07 Apr 2013 22:23:10 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 492,480 853,193 1.7324
07 Apr 2013 08:22:46 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 466,560 808,017 1.7319
06 Apr 2013 14:30:28 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 440,640 764,694 1.7354
06 Apr 2013 02:21:03 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 414,720 722,556 1.7423
05 Apr 2013 11:20:50 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 388,800 679,516 1.7477
04 Apr 2013 05:19:18 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 362,880 636,910 1.7552
03 Apr 2013 04:48:57 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 336,960 592,645 1.7588
01 Apr 2013 17:11:36 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 311,040 547,812 1.7612
01 Apr 2013 02:02:05 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 285,120 502,071 1.7609
31 Mar 2013 11:40:02 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 259,200 455,676 1.7580
30 Mar 2013 21:20:23 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 233,280 410,122 1.7581
30 Mar 2013 06:13:36 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 207,360 363,313 1.7521
29 Mar 2013 14:50:31 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 181,440 318,234 1.7539
28 Mar 2013 23:56:12 1027182 15678828 hadcm3n_zi87_1960_40_008335440_0 155,520 273,484 1.7585


©2024 cpdn.org