climateprediction.net home page
Task 11004920

Task 11004920

Name hadsm3dhet2_jo65_006594319_3
Workunit 6797692
Created 15 Mar 2010, 11:59:50 UTC
Sent 10 Oct 2010, 9:16:43 UTC
Report deadline 22 Sep 2011, 14:36:43 UTC
Received 14 Dec 2010, 16:41:45 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS
Computer ID 1106036
Run time 6 days 4 hours 59 min 12 sec
CPU time 5 days 4 hours 3 min 1 sec
Validate state Invalid
Credit 2,679.57
Device peak FLOPS 2.14 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.07
windows_intelx86
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
too many exit(0)s
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4888, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5832, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4660, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2108, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5360, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5208, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1516, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4536, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7152, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5408, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5408, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=544, selfPID=544, iMonCtr=1
CNo Process Handle
CPDN process is not running, exiting, bRetVal = 1, checkPID=7640, selfPID=7640, iMonCtr=1
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5552, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=4152, selfPID=4152, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4512, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4764, iMonCtr=1
Model crash detected, will try to restart...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
MainError:	03:51:56 PM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Dec 2010 16:47:39 1106036 11004920 hadsm3dhet2_jo65_006594319_3 32,406 440,430 1.5101
05 Dec 2010 14:07:57 1106036 11004920 hadsm3dhet2_jo65_006594319_3 21,604 424,144 1.5102
03 Dec 2010 12:29:52 1106036 11004920 hadsm3dhet2_jo65_006594319_3 10,802 408,124 1.5113
30 Nov 2010 17:55:16 1106036 11004920 hadsm3dhet2_jo65_006594319_3 259,248 392,296 1.5132
27 Nov 2010 21:35:59 1106036 11004920 hadsm3dhet2_jo65_006594319_3 248,446 376,275 1.5145
26 Nov 2010 11:33:45 1106036 11004920 hadsm3dhet2_jo65_006594319_3 237,644 360,496 1.5170
24 Nov 2010 08:50:48 1106036 11004920 hadsm3dhet2_jo65_006594319_3 226,842 344,375 1.5181
20 Nov 2010 13:14:52 1106036 11004920 hadsm3dhet2_jo65_006594319_3 216,040 328,061 1.5185
20 Nov 2010 13:14:52 1106036 11004920 hadsm3dhet2_jo65_006594319_3 205,238 311,827 1.5193
15 Nov 2010 17:25:03 1106036 11004920 hadsm3dhet2_jo65_006594319_3 194,436 295,188 1.5182
14 Nov 2010 11:26:20 1106036 11004920 hadsm3dhet2_jo65_006594319_3 183,634 278,551 1.5169
14 Nov 2010 11:26:20 1106036 11004920 hadsm3dhet2_jo65_006594319_3 172,832 262,193 1.5170
07 Nov 2010 12:58:26 1106036 11004920 hadsm3dhet2_jo65_006594319_3 162,030 245,868 1.5174
05 Nov 2010 15:39:00 1106036 11004920 hadsm3dhet2_jo65_006594319_3 151,228 229,827 1.5197
03 Nov 2010 13:59:14 1106036 11004920 hadsm3dhet2_jo65_006594319_3 140,426 213,389 1.5196
31 Oct 2010 22:46:26 1106036 11004920 hadsm3dhet2_jo65_006594319_3 129,624 196,948 1.5194
29 Oct 2010 19:09:11 1106036 11004920 hadsm3dhet2_jo65_006594319_3 118,822 180,215 1.5167
28 Oct 2010 17:07:31 1106036 11004920 hadsm3dhet2_jo65_006594319_3 108,020 163,584 1.5144
26 Oct 2010 18:09:36 1106036 11004920 hadsm3dhet2_jo65_006594319_3 97,218 147,248 1.5146
25 Oct 2010 17:19:28 1106036 11004920 hadsm3dhet2_jo65_006594319_3 86,416 130,812 1.5137


©2024 climateprediction.net