climateprediction.net home page
Task 10951311

Task 10951311

Name hadsm3dhet2_jk18_006588958_5
Workunit 6792331
Created 15 Mar 2010, 11:51:09 UTC
Sent 25 Oct 2010, 10:29:08 UTC
Report deadline 7 Oct 2011, 15:49:08 UTC
Received 13 Sep 2011, 1:32:46 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 988469
Run time 10 days 19 hours 58 min 26 sec
CPU time 9 days 5 hours 15 min 57 sec
Validate state Invalid
Credit 3,374.28
Device peak FLOPS 1.68 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.6.36</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3824, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6324, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5660, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6004, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
MainError:	09:08:41 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5788, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5444, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4584, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3192, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7516, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5924, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4648, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2404, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
forrtl: Access is denied.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5132, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jk18_006588958/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jk18_006588958/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jk18_006588958/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jk18_006588958/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jk18_006588958/dataout/restart.day

Model crashed: (null)
cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/hadsm3dhet2_jk18_006588958/dataout/restart.day

Model crashed: (null)
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
11 Sep 2011 12:01:29 988469 10951311 hadsm3dhet2_jk18_006588958_5 108,020 795,142 2.1650
10 Aug 2011 05:23:20 988469 10951311 hadsm3dhet2_jk18_006588958_5 97,218 756,957 2.1235
29 Jul 2011 02:03:12 988469 10951311 hadsm3dhet2_jk18_006588958_5 86,416 718,938 2.0799
25 Jul 2011 18:55:30 988469 10951311 hadsm3dhet2_jk18_006588958_5 75,614 691,380 2.0647
25 Jul 2011 15:34:36 988469 10951311 hadsm3dhet2_jk18_006588958_5 64,812 667,601 2.0601
02 Jul 2011 03:42:45 988469 10951311 hadsm3dhet2_jk18_006588958_5 54,010 644,584 2.0577
19 Jun 2011 22:01:52 988469 10951311 hadsm3dhet2_jk18_006588958_5 43,208 621,992 2.0565
12 Jun 2011 10:54:01 988469 10951311 hadsm3dhet2_jk18_006588958_5 32,406 600,088 2.0575
07 Jun 2011 05:27:36 988469 10951311 hadsm3dhet2_jk18_006588958_5 21,604 578,005 2.0580
05 Jun 2011 04:51:17 988469 10951311 hadsm3dhet2_jk18_006588958_5 10,802 555,752 2.0580
03 Jun 2011 09:12:35 988469 10951311 hadsm3dhet2_jk18_006588958_5 259,248 533,684 2.0586
28 May 2011 04:28:21 988469 10951311 hadsm3dhet2_jk18_006588958_5 248,446 511,500 2.0588
08 May 2011 13:53:15 988469 10951311 hadsm3dhet2_jk18_006588958_5 237,644 489,366 2.0592
22 Apr 2011 03:00:10 988469 10951311 hadsm3dhet2_jk18_006588958_5 226,842 466,849 2.0580
13 Apr 2011 12:52:16 988469 10951311 hadsm3dhet2_jk18_006588958_5 216,040 444,857 2.0591
01 Apr 2011 12:07:00 988469 10951311 hadsm3dhet2_jk18_006588958_5 205,238 422,395 2.0581
24 Mar 2011 05:13:07 988469 10951311 hadsm3dhet2_jk18_006588958_5 194,436 399,960 2.0570
20 Mar 2011 04:51:03 988469 10951311 hadsm3dhet2_jk18_006588958_5 183,634 377,631 2.0564
19 Mar 2011 06:26:12 988469 10951311 hadsm3dhet2_jk18_006588958_5 172,832 356,878 2.0649
15 Mar 2011 13:16:49 988469 10951311 hadsm3dhet2_jk18_006588958_5 162,030 334,320 2.0633


©2024 climateprediction.net