climateprediction.net home page
Task 15282083

Task 15282083

Name hadcm3n_zj4y_1880_40_008201411_0
Workunit 8356535
Created 13 Sep 2012, 13:03:14 UTC
Sent 13 Sep 2012, 13:09:38 UTC
Report deadline 13 Dec 2012, 20:36:49 UTC
Received 28 Nov 2012, 0:15:00 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1040491
Run time 18 days 8 hours 0 min 22 sec
CPU time 16 days 6 hours 29 min 47 sec
Validate state Invalid
Credit 7,776.00
Device peak FLOPS 2.21 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:16:31 (948): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:57:42 (4040): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4316, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3376, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3452, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
29 Oct 2012 14:07:09 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 648,000 1,394,356 2.1518
28 Oct 2012 22:15:27 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 622,080 1,339,837 2.1538
28 Oct 2012 05:18:34 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 596,160 1,281,943 2.1503
27 Oct 2012 11:49:44 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 570,240 1,225,845 2.1497
26 Oct 2012 16:59:14 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 544,320 1,170,706 2.1508
26 Oct 2012 00:52:17 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 518,400 1,116,316 2.1534
25 Oct 2012 05:59:32 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 492,480 1,059,633 2.1516
23 Oct 2012 17:29:52 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 466,560 1,005,714 2.1556
22 Oct 2012 21:30:20 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 440,640 952,671 2.1620
22 Oct 2012 04:57:13 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 414,720 898,884 2.1674
21 Oct 2012 12:10:20 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 388,800 844,308 2.1716
20 Oct 2012 15:27:26 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 362,880 789,505 2.1757
19 Oct 2012 22:29:52 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 336,960 731,855 2.1719
19 Oct 2012 05:00:54 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 311,040 675,463 2.1716
18 Oct 2012 06:11:37 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 285,120 618,834 2.1704
23 Sep 2012 00:24:07 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 259,200 563,366 2.1735
22 Sep 2012 06:37:31 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 233,280 507,314 2.1747
21 Sep 2012 11:43:29 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 207,360 449,869 2.1695
20 Sep 2012 18:38:55 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 181,440 393,285 2.1676
20 Sep 2012 00:09:21 1040491 15282083 hadcm3n_zj4y_1880_40_008201411_0 155,520 337,122 2.1677


©2024 cpdn.org