climateprediction.net home page
Task 11907927

Task 11907927

Name hadsm3dhet2_u6jo_006726751_1
Workunit 6930094
Created 17 Sep 2010, 8:09:52 UTC
Sent 17 Sep 2010, 23:08:30 UTC
Report deadline 31 Aug 2011, 4:28:30 UTC
Received 21 Apr 2011, 21:44:38 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 255 (0x000000FF) Unknown error code
Computer ID 1292564
Run time 15 days 14 hours 1 min 11 sec
CPU time 13 days 22 hours 7 min 31 sec
Validate state Invalid
Credit 3,473.52
Device peak FLOPS 2.00 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
The extended attributes are inconsistent. (0xff) - exit code 255 (0xff)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4460, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CNo heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4644, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4544, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4452, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4320, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2324, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2324, iMonCtr=1
Model crash detected, will try to restart...
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
MainError:	07:33:49 AM	No files match the supplied pattern.
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3224, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3388, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3908, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3300, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3272, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3552, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3920, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3904, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3884, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x77CB3800 read attempt to address 0x7CF00000

Engaging BOINC Windows Runtime Debugger...


</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
21 Mar 2011 02:58:57 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 118,822 711,408 1.8817
20 Mar 2011 01:59:41 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 108,020 691,038 1.8816
08 Mar 2011 18:06:49 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 97,218 670,328 1.8805
08 Mar 2011 18:06:49 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 86,416 649,418 1.8788
08 Mar 2011 18:06:49 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 75,614 629,311 1.8793
08 Mar 2011 18:06:47 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 64,812 608,837 1.8788
22 Feb 2011 05:48:59 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 54,010 588,562 1.8788
20 Feb 2011 03:51:39 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 43,208 568,897 1.8809
19 Feb 2011 15:59:21 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 32,406 548,493 1.8806
19 Feb 2011 15:59:21 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 21,604 528,484 1.8817
15 Feb 2011 15:58:35 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 10,802 507,781 1.8803
12 Feb 2011 07:38:26 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 259,248 486,959 1.8784
11 Feb 2011 16:56:53 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 248,446 466,503 1.8777
11 Feb 2011 16:56:51 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 237,644 446,681 1.8796
04 Feb 2011 00:19:40 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 226,842 426,207 1.8789
31 Jan 2011 18:27:30 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 216,040 405,935 1.8790
28 Jan 2011 19:55:40 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 205,238 385,352 1.8776
27 Jan 2011 23:22:22 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 194,436 365,049 1.8775
23 Dec 2010 17:15:16 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 183,634 344,905 1.8782
22 Dec 2010 00:13:07 1090551 11907927 hadsm3dhet2_u6jo_006726751_1 172,832 324,642 1.8784


©2024 climateprediction.net