climateprediction.net home page
Task 15494935

Task 15494935

Name hadcm3n_3erj_1940_40_008265706_0
Workunit 8420830
Created 21 Dec 2012, 13:36:03 UTC
Sent 22 Dec 2012, 18:27:58 UTC
Report deadline 24 Mar 2013, 1:55:09 UTC
Received 13 Jan 2013, 3:01:51 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1003311
Run time 20 days 23 hours 10 min 56 sec
CPU time 16 days 15 hours 59 min 55 sec
Validate state Invalid
Credit 10,575.36
Device peak FLOPS 2.51 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
14:54:47 (4156): No heartbeat from core client for 30 sec - exiting
14:54:48 (4156): No heartbeat from core client for 30 sec - exiting
14:54:49 (4156): No heartbeat from core client for 30 sec - exiting
14:54:50 (4156): No heartbeat from core client for 30 sec - exiting
14:54:51 (4156): No heartbeat from core client for 30 sec - exiting
14:54:52 (4156): No heartbeat from core client for 30 sec - exiting
14:54:53 (4156): No heartbeat from core client for 30 sec - exiting
14:54:54 (4156): No heartbeat from core client for 30 sec - exiting
14:54:55 (4156): No heartbeat from core client for 30 sec - exiting
14:54:56 (4156): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:39:04 (1872): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2168, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
Signal 11 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5508, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Jan 2013 20:41:08 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 881,280 1,620,338 1.8386
12 Jan 2013 06:28:54 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 855,360 1,575,562 1.8420
11 Jan 2013 16:40:53 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 829,440 1,530,926 1.8457
11 Jan 2013 02:38:16 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 803,520 1,486,184 1.8496
10 Jan 2013 13:25:20 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 777,600 1,441,494 1.8538
09 Jan 2013 22:06:36 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 751,680 1,393,581 1.8540
09 Jan 2013 07:24:58 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 725,760 1,345,979 1.8546
08 Jan 2013 17:01:10 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 699,840 1,298,312 1.8552
08 Jan 2013 02:50:45 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 673,920 1,250,832 1.8561
07 Jan 2013 12:30:14 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 648,000 1,203,216 1.8568
06 Jan 2013 22:08:23 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 622,080 1,155,364 1.8573
06 Jan 2013 05:45:32 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 596,160 1,106,840 1.8566
05 Jan 2013 14:05:35 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 570,240 1,056,828 1.8533
04 Jan 2013 23:22:13 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 544,320 1,006,515 1.8491
04 Jan 2013 07:30:00 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 518,400 956,371 1.8449
03 Jan 2013 16:11:02 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 492,480 906,106 1.8399
02 Jan 2013 22:08:28 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 466,560 857,091 1.8370
02 Jan 2013 07:59:55 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 440,640 809,049 1.8361
01 Jan 2013 17:37:34 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 414,720 760,971 1.8349
01 Jan 2013 02:50:05 1003311 15494935 hadcm3n_3erj_1940_40_008265706_0 388,800 712,912 1.8336


©2024 climateprediction.net