climateprediction.net home page
Task 11890080

Task 11890080

Name hadsm3dhet2_u0bj_006724966_4
Workunit 6928309
Created 17 Sep 2010, 8:07:18 UTC
Sent 23 Sep 2010, 21:21:22 UTC
Report deadline 6 Sep 2011, 2:41:22 UTC
Received 4 Jun 2011, 22:53:50 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1040491
Run time 17 days 18 hours 42 min 50 sec
CPU time 13 days 10 hours 11 min 29 sec
Validate state Invalid
Credit 4,664.44
Device peak FLOPS 2.08 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=4240, selfPID=4240, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=4612, selfPID=4612, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=2480, selfPID=2480, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=2692, selfPID=2692, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=1436, selfPID=1436, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=2620, selfPID=2620, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=4996, selfPID=4996, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=4824, selfPID=4824, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3620, selfPID=3620, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3408, selfPID=3408, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=4232, selfPID=4232, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=4712, selfPID=4712, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.

CPDN Monitor - Quit request from BOINC...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5076, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3164, selfPID=3164, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
MainError:	06:32:13 PM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=4024, selfPID=4024, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=3960, selfPID=3960, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=2664, selfPID=2664, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3488, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2568, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2568, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2568, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2568, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2568, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.

CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2568, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
20 Apr 2011 18:57:25 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 248,446 819,653 1.6145
20 Apr 2011 18:57:25 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 237,644 803,107 1.6163
12 Apr 2011 23:29:44 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 226,842 786,517 1.6180
11 Apr 2011 17:11:42 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 216,040 770,629 1.6214
10 Apr 2011 18:33:31 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 205,238 755,367 1.6262
08 Apr 2011 16:01:05 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 194,436 737,833 1.6263
07 Apr 2011 17:00:34 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 183,634 720,889 1.6277
05 Apr 2011 20:43:27 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 172,832 704,949 1.6315
04 Apr 2011 02:26:47 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 162,030 689,227 1.6360
01 Apr 2011 23:45:12 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 151,228 673,383 1.6405
31 Mar 2011 20:51:01 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 140,426 656,708 1.6431
30 Mar 2011 14:15:25 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 129,624 638,010 1.6407
29 Mar 2011 00:49:33 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 118,822 620,522 1.6413
28 Mar 2011 00:26:06 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 108,020 603,125 1.6422
26 Mar 2011 15:33:00 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 97,218 585,836 1.6435
25 Mar 2011 07:08:40 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 86,416 568,145 1.6436
23 Mar 2011 13:39:45 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 75,614 549,687 1.6415
21 Mar 2011 22:41:27 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 64,812 533,341 1.6458
20 Mar 2011 12:34:58 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 54,010 517,059 1.6506
18 Mar 2011 15:14:58 1040491 11890080 hadsm3dhet2_u0bj_006724966_4 43,208 499,847 1.6526


©2024 climateprediction.net