climateprediction.net home page
Task 10947145

Task 10947145

Name hadsm3dhet2_jjpo_006588542_0
Workunit 6791915
Created 15 Mar 2010, 11:50:27 UTC
Sent 26 Oct 2010, 14:37:49 UTC
Report deadline 8 Oct 2011, 19:57:49 UTC
Received 14 Dec 2010, 15:31:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1032008
Run time 23 days 3 hours 23 min 3 sec
CPU time 20 days 19 hours 44 min 47 sec
Validate state Invalid
Credit 4,862.93
Device peak FLOPS 0.96 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4620, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4524, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4616, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=276, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4208, iMonCtr=1
Model crash detected, will try to restart...
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
MainError:	07:02:13 PM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5816, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4236, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5368, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5964, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5168, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5236, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4428, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4892, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5064, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4816, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5572, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4324, iMonCtr=1
Model crash detected, will try to restart...
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
MainError:	04:31:54 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5248, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Dec 2010 15:02:58 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 10,802 1,765,628 3.3358
13 Dec 2010 09:31:09 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 259,248 1,730,117 3.3368
12 Dec 2010 17:42:50 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 248,446 1,694,313 3.3373
11 Dec 2010 17:40:05 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 237,644 1,657,911 3.3366
10 Dec 2010 21:14:48 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 226,842 1,621,798 3.3364
09 Dec 2010 22:01:42 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 216,040 1,584,330 3.3334
08 Dec 2010 22:07:17 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 205,238 1,544,716 3.3256
07 Dec 2010 21:33:31 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 194,436 1,506,542 3.3207
06 Dec 2010 19:56:04 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 183,634 1,470,767 3.3209
05 Dec 2010 20:25:19 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 172,832 1,435,233 3.3217
05 Dec 2010 09:50:07 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 162,030 1,399,679 3.3225
04 Dec 2010 23:11:42 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 151,228 1,364,326 3.3238
04 Dec 2010 11:34:34 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 140,426 1,328,171 3.3231
03 Dec 2010 13:33:39 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 129,624 1,292,300 3.3232
03 Dec 2010 09:24:41 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 118,822 1,256,825 3.3243
02 Dec 2010 14:41:20 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 108,020 1,220,706 3.3237
01 Dec 2010 07:40:52 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 97,218 1,184,810 3.3238
29 Nov 2010 21:30:20 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 86,416 1,149,243 3.3247
28 Nov 2010 15:23:06 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 75,614 1,113,276 3.3246
27 Nov 2010 19:56:53 1032008 10947145 hadsm3dhet2_jjpo_006588542_0 64,812 1,077,613 3.3254


©2024 climateprediction.net