climateprediction.net home page
Task 11081787

Task 11081787

Name hadsm3dhet2_ju3n_006602005_8
Workunit 6805378
Created 15 Mar 2010, 12:09:52 UTC
Sent 11 Jun 2010, 1:08:24 UTC
Report deadline 24 May 2011, 6:28:24 UTC
Received 27 Jun 2010, 20:39:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 961915
Run time 8 days 11 hours 52 min 57 sec
CPU time 5 days 3 hours 18 min 19 sec
Validate state Invalid
Credit 2,679.57
Device peak FLOPS 2.99 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6492, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11180, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11180, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5336, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8700, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8700, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11496, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4684, iMonCtr=1
Model crash detected, will try to restart...
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
MainError:	09:59:16 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5044, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9848, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11688, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12604, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13892, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14820, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14820, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6028, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
27 Jun 2010 06:38:24 961915 11081787 hadsm3dhet2_ju3n_006602005_8 32,406 426,515 1.4624
26 Jun 2010 10:56:23 961915 11081787 hadsm3dhet2_ju3n_006602005_8 21,604 403,596 1.4370
25 Jun 2010 22:59:06 961915 11081787 hadsm3dhet2_ju3n_006602005_8 10,802 383,842 1.4214
25 Jun 2010 10:03:26 961915 11081787 hadsm3dhet2_ju3n_006602005_8 259,248 363,903 1.4037
24 Jun 2010 12:07:48 961915 11081787 hadsm3dhet2_ju3n_006602005_8 248,446 346,286 1.3938
24 Jun 2010 04:53:21 961915 11081787 hadsm3dhet2_ju3n_006602005_8 237,644 331,727 1.3959
23 Jun 2010 11:16:31 961915 11081787 hadsm3dhet2_ju3n_006602005_8 226,842 315,764 1.3920
23 Jun 2010 05:03:50 961915 11081787 hadsm3dhet2_ju3n_006602005_8 216,040 300,561 1.3912
22 Jun 2010 03:32:01 961915 11081787 hadsm3dhet2_ju3n_006602005_8 205,238 286,108 1.3940
21 Jun 2010 14:59:05 961915 11081787 hadsm3dhet2_ju3n_006602005_8 194,436 270,559 1.3915
19 Jun 2010 21:17:53 961915 11081787 hadsm3dhet2_ju3n_006602005_8 183,634 256,415 1.3963
19 Jun 2010 05:03:06 961915 11081787 hadsm3dhet2_ju3n_006602005_8 172,832 243,457 1.4086
18 Jun 2010 22:31:05 961915 11081787 hadsm3dhet2_ju3n_006602005_8 162,030 229,681 1.4175
18 Jun 2010 11:21:49 961915 11081787 hadsm3dhet2_ju3n_006602005_8 151,228 215,652 1.4260
18 Jun 2010 04:49:46 961915 11081787 hadsm3dhet2_ju3n_006602005_8 140,426 200,949 1.4310
17 Jun 2010 07:25:00 961915 11081787 hadsm3dhet2_ju3n_006602005_8 129,624 185,946 1.4345
16 Jun 2010 06:57:25 961915 11081787 hadsm3dhet2_ju3n_006602005_8 118,822 171,553 1.4438
15 Jun 2010 14:20:56 961915 11081787 hadsm3dhet2_ju3n_006602005_8 108,020 157,027 1.4537
15 Jun 2010 07:53:43 961915 11081787 hadsm3dhet2_ju3n_006602005_8 97,218 144,069 1.4819
14 Jun 2010 11:45:00 961915 11081787 hadsm3dhet2_ju3n_006602005_8 86,416 129,904 1.5032


©2024 cpdn.org