climateprediction.net home page
Task 15477772

Task 15477772

Name hadcm3n_zkj3_1880_40_008245837_3
Workunit 8400961
Created 14 Dec 2012, 9:58:17 UTC
Sent 14 Dec 2012, 9:58:23 UTC
Report deadline 15 Mar 2013, 17:25:34 UTC
Received 7 Jan 2013, 22:45:10 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1257736
Run time 9 days 2 hours 39 min 31 sec
CPU time 8 days 2 hours 30 min 22 sec
Validate state Invalid
Credit 7,153.92
Device peak FLOPS 4.31 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3248, iMonCtr=1
Model crash detected, will try to restart...
18:05:18 (3460): No heartbeat from core client for 30 sec - exiting
18:05:19 (3460): No heartbeat from core client for 30 sec - exiting
18:05:20 (3460): No heartbeat from core client for 30 sec - exiting
18:05:21 (3460): No heartbeat from core client for 30 sec - exiting
18:05:22 (3460): No heartbeat from core client for 30 sec - exiting
18:05:23 (3460): No heartbeat from core client for 30 sec - exiting
18:05:24 (3460): No heartbeat from core client for 30 sec - exiting
18:05:25 (3460): No heartbeat from core client for 30 sec - exiting
18:05:26 (3460): No heartbeat from core client for 30 sec - exiting
18:05:27 (3460): No heartbeat from core client for 30 sec - exiting
18:05:28 (3460): No heartbeat from core client for 30 sec - exiting
18:05:29 (3460): No heartbeat from core client for 30 sec - exiting
18:05:30 (3460): No heartbeat from core client for 30 sec - exiting
18:05:31 (3460): No heartbeat from core client for 30 sec - exiting
18:05:32 (3460): No heartbeat from core client for 30 sec - exiting
18:05:33 (3460): No heartbeat from core client for 30 sec - exiting
18:05:34 (3460): No heartbeat from core client for 30 sec - exiting
18:05:35 (3460): No heartbeat from core client for 30 sec - exiting
18:05:36 (3460): No heartbeat from core client for 30 sec - exiting
18:05:37 (3460): No heartbeat from core client for 30 sec - exiting
18:05:38 (3460): No heartbeat from core client for 30 sec - exiting
18:05:39 (3460): No heartbeat from core client for 30 sec - exiting
18:05:40 (3460): No heartbeat from core client for 30 sec - exiting
18:05:41 (3460): No heartbeat from core client for 30 sec - exiting
18:05:42 (3460): No heartbeat from core client for 30 sec - exiting
18:05:43 (3460): No heartbeat from core client for 30 sec - exiting
18:05:44 (3460): No heartbeat from core client for 30 sec - exiting
18:05:45 (3460): No heartbeat from core client for 30 sec - exiting
18:05:46 (3460): No heartbeat from core client for 30 sec - exiting
18:05:47 (3460): No heartbeat from core client for 30 sec - exiting
18:05:48 (3460): No heartbeat from core client for 30 sec - exiting
18:05:49 (3460): No heartbeat from core client for 30 sec - exiting
18:05:50 (3460): No heartbeat from core client for 30 sec - exiting
18:05:51 (3460): No heartbeat from core client for 30 sec - exiting
18:05:52 (3460): No heartbeat from core client for 30 sec - exiting
18:05:53 (3460): No heartbeat from core client for 30 sec - exiting
18:05:54 (3460): No heartbeat from core client for 30 sec - exiting
18:05:55 (3460): No heartbeat from core client for 30 sec - exiting
18:05:56 (3460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3500, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Jan 2013 18:32:49 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 596,160 689,210 1.1561
02 Jan 2013 12:45:35 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 570,240 665,078 1.1663
24 Dec 2012 17:26:35 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 544,320 635,678 1.1678
24 Dec 2012 08:37:49 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 518,400 604,701 1.1665
23 Dec 2012 23:49:00 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 492,480 573,683 1.1649
23 Dec 2012 14:59:30 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 466,560 542,815 1.1634
23 Dec 2012 06:12:31 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 440,640 511,864 1.1616
22 Dec 2012 14:20:27 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 414,720 480,915 1.1596
22 Dec 2012 02:03:01 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 388,800 449,828 1.1570
21 Dec 2012 00:53:20 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 362,880 418,372 1.1529
20 Dec 2012 15:57:34 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 336,960 386,673 1.1475
20 Dec 2012 07:05:50 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 311,040 355,017 1.1414
19 Dec 2012 22:13:59 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 285,120 323,684 1.1353
19 Dec 2012 13:32:49 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 259,200 292,813 1.1297
19 Dec 2012 04:51:16 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 233,280 261,895 1.1227
18 Dec 2012 21:14:03 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 207,360 231,174 1.1148
18 Dec 2012 12:30:11 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 181,440 200,469 1.1049
18 Dec 2012 03:45:48 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 155,520 169,819 1.0919
16 Dec 2012 17:26:49 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 129,600 140,167 1.0815
16 Dec 2012 09:24:42 1257736 15477772 hadcm3n_zkj3_1880_40_008245837_3 103,680 112,008 1.0803


©2024 climateprediction.net