climateprediction.net home page
Task 13761366

Task 13761366

Name hadcm3n_yckg_1940_40_007537272_4
Workunit 7734504
Created 10 Dec 2011, 21:27:42 UTC
Sent 10 Dec 2011, 21:28:33 UTC
Report deadline 11 Mar 2012, 4:55:44 UTC
Received 26 Jan 2012, 19:38:11 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1105487
Run time 20 days 18 hours 40 min 49 sec
CPU time 19 days 13 hours 13 min 5 sec
Validate state Invalid
Credit 11,508.48
Device peak FLOPS 3.05 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.12.34</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:57:43 (4324): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
14:57:44 (4324): No heartbeat from core client for 30 sec - exiting
14:57:45 (4324): No heartbeat from core client for 30 sec - exiting
14:57:46 (4324): No heartbeat from core client for 30 sec - exiting
14:57:47 (4324): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
10:13:36 (2556): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
21:25:08 (1568): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:25:09 (1568): No heartbeat from core client for 30 sec - exiting
21:25:10 (1568): No heartbeat from core client for 30 sec - exiting
21:25:11 (1568): No heartbeat from core client for 30 sec - exiting
21:25:12 (1568): No heartbeat from core client for 30 sec - exiting
21:25:14 (1568): No heartbeat from core client for 30 sec - exiting
21:25:15 (1568): No heartbeat from core client for 30 sec - exiting
21:25:16 (1568): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
15:44:51 (4428): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
10:27:53 (2976): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
20:24:37 (3800): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:22:18 (1120): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
18:50:27 (4172): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
19:41:57 (3860): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
08:02:17 (4232): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
08:02:19 (4232): No heartbeat from core client for 30 sec - exiting
08:02:20 (4232): No heartbeat from core client for 30 sec - exiting
08:02:21 (4232): No heartbeat from core client for 30 sec - exiting
08:02:22 (4232): No heartbeat from core client for 30 sec - exiting
08:02:23 (4232): No heartbeat from core client for 30 sec - exiting
19:31:43 (2624): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
18:34:10 (1764): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:34:11 (1764): No heartbeat from core client for 30 sec - exiting
18:34:12 (1764): No heartbeat from core client for 30 sec - exiting
18:34:14 (1764): No heartbeat from core client for 30 sec - exiting
18:34:15 (1764): No heartbeat from core client for 30 sec - exiting
18:34:16 (1764): No heartbeat from core client for 30 sec - exiting
18:34:17 (1764): No heartbeat from core client for 30 sec - exiting
18:34:18 (1764): No heartbeat from core client for 30 sec - exiting
18:34:19 (1764): No heartbeat from core client for 30 sec - exiting
18:34:20 (1764): No heartbeat from core client for 30 sec - exiting
18:34:21 (1764): No heartbeat from core client for 30 sec - exiting
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4788, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
26 Jan 2012 15:04:37 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 959,040 1,674,420 1.7459
25 Jan 2012 17:47:03 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 933,120 1,628,212 1.7449
24 Jan 2012 20:15:37 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 907,200 1,581,147 1.7429
24 Jan 2012 06:36:17 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 881,280 1,534,878 1.7416
23 Jan 2012 15:52:03 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 855,360 1,488,718 1.7405
22 Jan 2012 18:08:22 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 829,440 1,442,484 1.7391
21 Jan 2012 19:27:35 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 803,520 1,396,194 1.7376
20 Jan 2012 21:27:34 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 777,600 1,349,060 1.7349
19 Jan 2012 23:12:35 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 751,680 1,303,386 1.7340
19 Jan 2012 09:16:38 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 725,760 1,257,117 1.7321
18 Jan 2012 12:19:57 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 699,840 1,211,266 1.7308
17 Jan 2012 23:11:52 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 673,920 1,165,395 1.7293
17 Jan 2012 08:58:11 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 648,000 1,119,890 1.7282
16 Jan 2012 19:59:57 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 622,080 1,073,806 1.7262
16 Jan 2012 06:39:09 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 596,160 1,027,891 1.7242
15 Jan 2012 17:12:13 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 570,240 981,760 1.7217
15 Jan 2012 03:04:16 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 544,320 936,407 1.7203
14 Jan 2012 14:03:16 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 518,400 890,225 1.7173
13 Jan 2012 23:55:08 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 492,480 844,181 1.7141
13 Jan 2012 10:53:09 1105487 13761366 hadcm3n_yckg_1940_40_007537272_4 466,560 797,735 1.7098


©2024 cpdn.org