climateprediction.net home page
Task 12502647

Task 12502647

Name hadsm3dhet2_u1fe_006725821_12
Workunit 6929164
Created 18 Jan 2011, 7:00:07 UTC
Sent 18 Jan 2011, 7:00:37 UTC
Report deadline 31 Dec 2011, 12:20:37 UTC
Received 20 Mar 2011, 6:00:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1046220
Run time
CPU time 9 days 18 hours 52 min 39 sec
Validate state Invalid
Credit 2,481.08
Device peak FLOPS 0.81 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4328, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4824, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3152, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5200, iMonCtr=1
Model crash detected, will try to restart...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3956, selfPID=3956, iMonCtr=1
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5260, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4948, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4604, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4796, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3344, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3816, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
MainError:	07:00:38 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5300, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4988, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Mar 2011 06:01:06 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 10,802 836,657 3.0982
16 Mar 2011 07:03:36 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 259,248 810,514 3.1264
15 Mar 2011 06:04:57 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 248,446 781,629 3.1461
13 Mar 2011 07:04:49 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 237,644 746,228 3.1401
12 Mar 2011 07:01:56 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 226,842 710,764 3.1333
12 Mar 2011 07:01:56 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 216,040 676,012 3.1291
12 Mar 2011 07:01:56 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 205,238 640,910 3.1228
12 Mar 2011 07:01:56 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 194,436 606,023 3.1168
26 Feb 2011 07:03:00 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 183,634 571,047 3.1097
24 Feb 2011 07:03:18 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 172,832 535,685 3.0995
22 Feb 2011 07:00:39 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 162,030 501,011 3.0921
20 Feb 2011 07:03:51 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 151,228 467,151 3.0891
18 Feb 2011 07:04:27 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 140,426 432,879 3.0826
16 Feb 2011 07:18:47 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 129,624 399,005 3.0782
16 Feb 2011 07:18:47 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 118,822 365,166 3.0732
12 Feb 2011 07:01:43 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 108,020 330,236 3.0572
12 Feb 2011 07:01:43 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 97,218 305,885 3.1464
10 Feb 2011 07:02:45 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 86,416 270,738 3.1330
08 Feb 2011 07:05:19 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 75,614 235,987 3.1209
06 Feb 2011 07:05:46 1046220 12502647 hadsm3dhet2_u1fe_006725821_12 64,812 202,231 3.1203


©2024 climateprediction.net