climateprediction.net home page
Task 10974521

Task 10974521

Name hadsm3dhet2_jltp_006591279_4
Workunit 6794652
Created 15 Mar 2010, 11:55:56 UTC
Sent 18 Oct 2010, 19:07:32 UTC
Report deadline 1 Oct 2011, 0:27:32 UTC
Received 24 Oct 2010, 3:30:59 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 922180
Run time 4 days 9 hours 25 min 23 sec
CPU time 4 days 4 hours 41 min 39 sec
Validate state Invalid
Credit 2,381.84
Device peak FLOPS 2.25 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=5352, selfPID=5352, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=7060, selfPID=7060, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=8132, selfPID=8132, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=7260, selfPID=7260, iMonCtr=1
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
MainError:	04:34:42 AM	No files match the supplied pattern.
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8744, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8744, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8744, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8744, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8744, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8744, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
23 Oct 2010 04:38:59 922180 10974521 hadsm3dhet2_jltp_006591279_4 259,248 351,477 1.3558
23 Oct 2010 03:28:16 922180 10974521 hadsm3dhet2_jltp_006591279_4 248,446 336,566 1.3547
22 Oct 2010 20:22:16 922180 10974521 hadsm3dhet2_jltp_006591279_4 237,644 321,689 1.3537
22 Oct 2010 16:19:28 922180 10974521 hadsm3dhet2_jltp_006591279_4 226,842 306,939 1.3531
22 Oct 2010 12:19:19 922180 10974521 hadsm3dhet2_jltp_006591279_4 216,040 292,187 1.3525
22 Oct 2010 08:25:23 922180 10974521 hadsm3dhet2_jltp_006591279_4 205,238 277,443 1.3518
22 Oct 2010 04:33:26 922180 10974521 hadsm3dhet2_jltp_006591279_4 194,436 263,021 1.3527
22 Oct 2010 03:45:59 922180 10974521 hadsm3dhet2_jltp_006591279_4 183,634 248,645 1.3540
21 Oct 2010 20:48:30 922180 10974521 hadsm3dhet2_jltp_006591279_4 172,832 234,016 1.3540
21 Oct 2010 16:49:48 922180 10974521 hadsm3dhet2_jltp_006591279_4 162,030 219,223 1.3530
21 Oct 2010 12:50:15 922180 10974521 hadsm3dhet2_jltp_006591279_4 151,228 204,310 1.3510
21 Oct 2010 08:49:28 922180 10974521 hadsm3dhet2_jltp_006591279_4 140,426 189,864 1.3521
21 Oct 2010 04:49:38 922180 10974521 hadsm3dhet2_jltp_006591279_4 129,624 174,999 1.3501
21 Oct 2010 03:46:39 922180 10974521 hadsm3dhet2_jltp_006591279_4 118,822 160,120 1.3476
20 Oct 2010 18:53:31 922180 10974521 hadsm3dhet2_jltp_006591279_4 108,020 145,694 1.3488
20 Oct 2010 15:04:34 922180 10974521 hadsm3dhet2_jltp_006591279_4 97,218 131,224 1.3498
20 Oct 2010 11:18:27 922180 10974521 hadsm3dhet2_jltp_006591279_4 86,416 116,782 1.3514
20 Oct 2010 07:24:19 922180 10974521 hadsm3dhet2_jltp_006591279_4 75,614 102,396 1.3542
20 Oct 2010 03:36:34 922180 10974521 hadsm3dhet2_jltp_006591279_4 64,812 87,899 1.3562
19 Oct 2010 23:34:29 922180 10974521 hadsm3dhet2_jltp_006591279_4 54,010 72,819 1.3483


©2024 climateprediction.net