climateprediction.net home page
Task 11039256

Task 11039256

Name hadsm3dhet2_jqti_006597752_8
Workunit 6801125
Created 15 Mar 2010, 12:04:14 UTC
Sent 26 Sep 2010, 23:44:58 UTC
Report deadline 9 Sep 2011, 5:04:58 UTC
Received 16 Oct 2010, 8:46:48 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1103559
Run time 15 days 7 hours 16 min 46 sec
CPU time 11 days 2 hours 56 min 50 sec
Validate state Invalid
Credit 6,351.58
Device peak FLOPS 1.94 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5252, iMonCtr=1
Model crash detected, will try to restart...
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
MainError:	06:03:51 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4716, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6160, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3380, iMonCtr=1
Model crash detected, will try to restart...
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
MainError:	07:08:38 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4304, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4188, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: 7R

Model crashed: 7R

Model crashed: 7R

Model crashed: 7R

Model crashed: 7R

Model crashed: 7R
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
16 Oct 2010 07:47:19 1103559 11039256 hadsm3dhet2_jqti_006597752_8 172,832 958,575 1.3866
16 Oct 2010 03:22:38 1103559 11039256 hadsm3dhet2_jqti_006597752_8 162,030 943,441 1.3863
15 Oct 2010 18:04:26 1103559 11039256 hadsm3dhet2_jqti_006597752_8 151,228 928,595 1.3865
15 Oct 2010 09:35:58 1103559 11039256 hadsm3dhet2_jqti_006597752_8 140,426 913,919 1.3870
15 Oct 2010 03:23:30 1103559 11039256 hadsm3dhet2_jqti_006597752_8 129,624 899,093 1.3872
14 Oct 2010 19:55:51 1103559 11039256 hadsm3dhet2_jqti_006597752_8 118,822 884,268 1.3875
14 Oct 2010 13:19:46 1103559 11039256 hadsm3dhet2_jqti_006597752_8 108,020 869,437 1.3877
14 Oct 2010 07:33:33 1103559 11039256 hadsm3dhet2_jqti_006597752_8 97,218 854,731 1.3882
13 Oct 2010 20:36:43 1103559 11039256 hadsm3dhet2_jqti_006597752_8 86,416 839,944 1.3885
13 Oct 2010 15:12:54 1103559 11039256 hadsm3dhet2_jqti_006597752_8 75,614 824,574 1.3879
13 Oct 2010 04:46:52 1103559 11039256 hadsm3dhet2_jqti_006597752_8 64,812 809,357 1.3875
12 Oct 2010 20:12:30 1103559 11039256 hadsm3dhet2_jqti_006597752_8 54,010 794,583 1.3879
12 Oct 2010 12:28:06 1103559 11039256 hadsm3dhet2_jqti_006597752_8 43,208 779,669 1.3880
12 Oct 2010 03:07:10 1103559 11039256 hadsm3dhet2_jqti_006597752_8 32,406 765,058 1.3887
11 Oct 2010 20:08:18 1103559 11039256 hadsm3dhet2_jqti_006597752_8 21,604 749,782 1.3882
11 Oct 2010 13:22:43 1103559 11039256 hadsm3dhet2_jqti_006597752_8 10,802 734,945 1.3885
11 Oct 2010 07:13:44 1103559 11039256 hadsm3dhet2_jqti_006597752_8 259,248 719,935 1.3885
10 Oct 2010 23:37:48 1103559 11039256 hadsm3dhet2_jqti_006597752_8 248,446 704,563 1.3878
10 Oct 2010 12:11:31 1103559 11039256 hadsm3dhet2_jqti_006597752_8 237,644 689,524 1.3877
10 Oct 2010 03:46:37 1103559 11039256 hadsm3dhet2_jqti_006597752_8 226,842 674,767 1.3882


©2024 cpdn.org