climateprediction.net home page
Task 15446013

Task 15446013

Name hadcm3n_zf94_1880_40_008248032_1
Workunit 8403156
Created 21 Nov 2012, 11:23:20 UTC
Sent 21 Nov 2012, 11:23:42 UTC
Report deadline 20 Feb 2013, 18:50:53 UTC
Received 17 Dec 2012, 22:45:09 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1224742
Run time 19 days 14 hours 36 min 5 sec
CPU time 10 days 4 hours 19 min 47 sec
Validate state Invalid
Credit 9,020.16
Device peak FLOPS 3.30 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
00:18:12 (2084): No heartbeat from core client for 30 sec - exiting
00:18:13 (2084): No heartbeat from core client for 30 sec - exiting
00:18:14 (2084): No heartbeat from core client for 30 sec - exiting
00:18:15 (2084): No heartbeat from core client for 30 sec - exiting
00:18:16 (2084): No heartbeat from core client for 30 sec - exiting
00:18:17 (2084): No heartbeat from core client for 30 sec - exiting
00:18:18 (2084): No heartbeat from core client for 30 sec - exiting
00:18:19 (2084): No heartbeat from core client for 30 sec - exiting
00:18:20 (2084): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
17:38:34 (4564): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3808, iMonCtr=1
Model crash detected, will try to restart...
23:46:12 (6112): No heartbeat from core client for 30 sec - exiting
23:46:13 (6112): No heartbeat from core client for 30 sec - exiting
23:46:14 (6112): No heartbeat from core client for 30 sec - exiting
23:46:15 (6112): No heartbeat from core client for 30 sec - exiting
23:46:16 (6112): No heartbeat from core client for 30 sec - exiting
23:46:17 (6112): No heartbeat from core client for 30 sec - exiting
23:46:18 (6112): No heartbeat from core client for 30 sec - exiting
23:46:19 (6112): No heartbeat from core client for 30 sec - exiting
23:46:20 (6112): No heartbeat from core client for 30 sec - exiting
23:46:21 (6112): No heartbeat from core client for 30 sec - exiting
23:46:22 (6112): No heartbeat from core client for 30 sec - exiting
23:46:23 (6112): No heartbeat from core client for 30 sec - exiting
23:46:24 (6112): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
09:59:40 (3756): No heartbeat from core client for 30 sec - exiting
09:59:41 (3756): No heartbeat from core client for 30 sec - exiting
09:59:42 (3756): No heartbeat from core client for 30 sec - exiting
09:59:43 (3756): No heartbeat from core client for 30 sec - exiting
09:59:44 (3756): No heartbeat from core client for 30 sec - exiting
09:59:45 (3756): No heartbeat from core client for 30 sec - exiting
09:59:46 (3756): No heartbeat from core client for 30 sec - exiting
09:59:47 (3756): No heartbeat from core client for 30 sec - exiting
09:59:48 (3756): No heartbeat from core client for 30 sec - exiting
09:59:49 (3756): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
04:15:24 (4548): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
10:11:42 (6952): No heartbeat from core client for 30 sec - exiting
10:11:43 (6952): No heartbeat from core client for 30 sec - exiting
10:11:44 (6952): No heartbeat from core client for 30 sec - exiting
10:11:45 (6952): No heartbeat from core client for 30 sec - exiting
10:11:46 (6952): No heartbeat from core client for 30 sec - exiting
10:11:47 (6952): No heartbeat from core client for 30 sec - exiting
10:11:48 (6952): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
22:39:15 (1328): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
17 Dec 2012 08:49:41 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 751,680 856,760 1.1398
16 Dec 2012 18:09:06 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 725,760 828,933 1.1422
16 Dec 2012 03:48:38 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 699,840 799,943 1.1430
15 Dec 2012 06:23:06 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 673,920 770,554 1.1434
14 Dec 2012 16:05:21 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 648,000 741,084 1.1436
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 622,080 711,871 1.1443
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 596,160 682,590 1.1450
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 570,240 653,177 1.1454
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 544,320 624,095 1.1466
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 518,400 594,934 1.1476
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 492,480 566,749 1.1508
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 466,560 538,903 1.1551
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 440,640 510,740 1.1591
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 414,720 481,156 1.1602
14 Dec 2012 09:30:46 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 388,800 451,949 1.1624
07 Dec 2012 22:34:12 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 362,880 422,948 1.1655
07 Dec 2012 06:11:32 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 336,960 393,841 1.1688
06 Dec 2012 15:30:21 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 311,040 364,492 1.1718
05 Dec 2012 23:41:34 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 285,120 335,478 1.1766
29 Nov 2012 18:17:34 1224742 15446013 hadcm3n_zf94_1880_40_008248032_1 259,200 306,247 1.1815


©2024 cpdn.org