climateprediction.net home page
Task 15313001

Task 15313001

Name hadcm3n_z8hn_1880_40_008199038_3
Workunit 8354162
Created 27 Sep 2012, 2:45:06 UTC
Sent 27 Sep 2012, 2:49:38 UTC
Report deadline 27 Dec 2012, 10:16:49 UTC
Received 5 Oct 2012, 19:46:53 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1176093
Run time 6 days 18 hours 22 min 11 sec
CPU time 6 days 16 hours 6 min 58 sec
Validate state Invalid
Credit 8,398.08
Device peak FLOPS 3.36 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
11:12:25 (4936): No heartbeat from core client for 30 sec - exiting
11:12:26 (4936): No heartbeat from core client for 30 sec - exiting
11:12:27 (4936): No heartbeat from core client for 30 sec - exiting
11:12:28 (4936): No heartbeat from core client for 30 sec - exiting
11:12:29 (4936): No heartbeat from core client for 30 sec - exiting
11:12:30 (4936): No heartbeat from core client for 30 sec - exiting
11:12:31 (4936): No heartbeat from core client for 30 sec - exiting
11:12:32 (4936): No heartbeat from core client for 30 sec - exiting
11:12:33 (4936): No heartbeat from core client for 30 sec - exiting
11:12:34 (4936): No heartbeat from core client for 30 sec - exiting
11:12:35 (4936): No heartbeat from core client for 30 sec - exiting
11:12:36 (4936): No heartbeat from core client for 30 sec - exiting
11:12:37 (4936): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5780, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5780, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3988, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5524, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Oct 2012 18:34:42 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 699,840 573,139 0.8190
04 Oct 2012 09:38:16 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 673,920 552,388 0.8197
04 Oct 2012 02:46:48 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 648,000 531,180 0.8197
03 Oct 2012 20:50:09 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 622,080 510,411 0.8205
03 Oct 2012 11:11:57 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 596,160 489,276 0.8207
03 Oct 2012 00:34:50 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 570,240 467,404 0.8197
02 Oct 2012 17:43:34 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 544,320 446,968 0.8211
02 Oct 2012 05:56:24 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 518,400 425,212 0.8202
01 Oct 2012 23:34:36 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 492,480 404,569 0.8215
01 Oct 2012 16:17:06 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 466,560 383,523 0.8220
01 Oct 2012 09:00:26 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 440,640 362,773 0.8233
01 Oct 2012 02:54:13 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 414,720 341,097 0.8225
30 Sep 2012 20:52:35 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 388,800 319,292 0.8212
30 Sep 2012 15:07:44 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 362,880 298,931 0.8238
30 Sep 2012 09:21:40 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 336,960 278,054 0.8252
30 Sep 2012 03:20:34 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 311,040 256,842 0.8258
29 Sep 2012 21:38:57 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 285,120 236,326 0.8289
29 Sep 2012 15:52:20 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 259,200 215,554 0.8316
29 Sep 2012 09:51:04 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 233,280 194,114 0.8321
29 Sep 2012 03:59:50 1176093 15313001 hadcm3n_z8hn_1880_40_008199038_3 207,360 173,191 0.8352


©2024 cpdn.org