climateprediction.net home page
Task 10986647

Task 10986647

Name hadsm3dhet2_jmre_006592492_0
Workunit 6795865
Created 15 Mar 2010, 11:57:29 UTC
Sent 15 Oct 2010, 5:29:24 UTC
Report deadline 27 Sep 2011, 10:49:24 UTC
Received 18 Oct 2010, 18:30:06 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1078114
Run time
CPU time 1 days 11 hours 1 min 46 sec
Validate state Invalid
Credit 1,389.41
Device peak FLOPS 2.98 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.08
i686-pc-linux-gnu
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=71537, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1548, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
18 Oct 2010 12:55:09 1078114 10986647 hadsm3dhet2_jmre_006592492_0 151,228 135,502 0.8960
18 Oct 2010 10:46:36 1078114 10986647 hadsm3dhet2_jmre_006592492_0 140,426 127,990 0.9114
18 Oct 2010 08:29:18 1078114 10986647 hadsm3dhet2_jmre_006592492_0 129,624 120,007 0.9258
18 Oct 2010 06:08:47 1078114 10986647 hadsm3dhet2_jmre_006592492_0 118,822 111,554 0.9388
18 Oct 2010 04:20:23 1078114 10986647 hadsm3dhet2_jmre_006592492_0 108,020 100,484 0.9302
18 Oct 2010 04:20:23 1078114 10986647 hadsm3dhet2_jmre_006592492_0 97,218 91,252 0.9386
17 Oct 2010 21:30:47 1078114 10986647 hadsm3dhet2_jmre_006592492_0 86,416 80,693 0.9338
17 Oct 2010 18:32:58 1078114 10986647 hadsm3dhet2_jmre_006592492_0 75,614 70,109 0.9272
17 Oct 2010 15:34:48 1078114 10986647 hadsm3dhet2_jmre_006592492_0 64,812 59,538 0.9186
17 Oct 2010 12:35:20 1078114 10986647 hadsm3dhet2_jmre_006592492_0 54,010 48,989 0.9070
17 Oct 2010 09:41:40 1078114 10986647 hadsm3dhet2_jmre_006592492_0 43,208 38,614 0.8937
17 Oct 2010 06:47:07 1078114 10986647 hadsm3dhet2_jmre_006592492_0 32,406 28,309 0.8736
17 Oct 2010 03:52:26 1078114 10986647 hadsm3dhet2_jmre_006592492_0 21,604 17,908 0.8289
17 Oct 2010 03:17:27 1078114 10986647 hadsm3dhet2_jmre_006592492_0 10,802 8,754 0.8104


©2024 cpdn.org