climateprediction.net home page
Task 13665461

Task 13665461

Name hadcm3n_o7ex_1980_40_007540450_4
Workunit 7737682
Created 28 Nov 2011, 6:15:04 UTC
Sent 28 Nov 2011, 6:15:12 UTC
Report deadline 27 Feb 2012, 13:42:23 UTC
Received 16 Dec 2011, 2:18:18 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1033432
Run time 10 days 7 hours 42 min 49 sec
CPU time 8 days 4 hours 29 min 43 sec
Validate state Invalid
Credit 4,665.60
Device peak FLOPS 2.53 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
20:56:54 (12252): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
07:55:55 (24168): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
14:47:35 (3060): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
23:46:34 (11596): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
18:12:00 (29732): No heartbeat from core client for 30 sec - exiting
18:12:01 (29732): No heartbeat from core client for 30 sec - exiting
18:12:02 (29732): No heartbeat from core client for 30 sec - exiting
18:12:03 (29732): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
06:12:29 (2512): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:11:10 (14104): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:10:10 (17268): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
10:09:06 (26188): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
00:07:54 (48008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
00:06:49 (67128): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:00:27 (42292): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
04:59:25 (35148): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
12:58:21 (46720): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
02:30:36 (4404): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18624, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18624, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8676, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8676, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3132, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Dec 2011 22:38:59 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 388,800 673,590 1.7325
10 Dec 2011 04:35:11 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 362,880 629,275 1.7341
08 Dec 2011 07:22:47 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 336,960 584,591 1.7349
07 Dec 2011 07:05:32 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 311,040 543,025 1.7458
06 Dec 2011 06:29:08 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 285,120 499,626 1.7523
05 Dec 2011 05:35:12 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 259,200 450,657 1.7386
04 Dec 2011 08:49:56 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 233,280 406,945 1.7444
03 Dec 2011 15:48:02 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 207,360 362,562 1.7485
03 Dec 2011 00:49:12 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 181,440 317,793 1.7515
02 Dec 2011 10:12:20 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 155,520 272,365 1.7513
01 Dec 2011 19:07:23 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 129,600 227,982 1.7591
01 Dec 2011 04:34:18 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 103,680 184,355 1.7781
30 Nov 2011 04:20:12 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 77,760 137,610 1.7697
29 Nov 2011 12:51:39 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 51,840 90,038 1.7368
28 Nov 2011 21:26:56 1033432 13665461 hadcm3n_o7ex_1980_40_007540450_4 25,920 44,244 1.7069


©2024 cpdn.org