climateprediction.net home page
Task 11900145

Task 11900145

Name hadsm3dhet2_u3gp_006725972_9
Workunit 6929315
Created 17 Sep 2010, 8:08:46 UTC
Sent 20 Sep 2010, 16:18:17 UTC
Report deadline 2 Sep 2011, 21:38:17 UTC
Received 13 Dec 2010, 0:05:36 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1102172
Run time 11 days 15 hours 8 min 19 sec
CPU time 11 days 11 hours 26 min 57 sec
Validate state Invalid
Credit 3,771.25
Device peak FLOPS 1.44 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.08
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN process is not running, exiting, bRetVal = 1, checkPID=5046, selfPID=5046, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=10228, iMonCtr=1
Model crash detected, will try to restart...
CPDN process is not running, exiting, bRetVal = 1, checkPID=10602, selfPID=10602, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=7077, selfPID=7077, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=9013, selfPID=9013, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=9798, selfPID=9798, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=10324, selfPID=10324, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=10324, selfPID=10324, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=13458, selfPID=13458, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=13892, selfPID=13892, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=14649, selfPID=14649, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=14649, selfPID=14649, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=15024, selfPID=15024, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=16106, selfPID=16106, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=18755, selfPID=18755, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=20252, selfPID=20252, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=21405, selfPID=21405, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=24585, selfPID=24585, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=24585, selfPID=24585, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=24585, selfPID=24585, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=24585, selfPID=24585, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=25718, selfPID=25718, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=25718, selfPID=25718, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=9319, selfPID=9319, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=9960, selfPID=9960, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=11142, selfPID=11142, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=11142, selfPID=11142, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=12355, selfPID=12355, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=12779, selfPID=12779, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=13543, selfPID=13543, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=13870, selfPID=13870, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=13870, selfPID=13870, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=14656, selfPID=14656, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=16104, selfPID=16104, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=16671, selfPID=16671, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=16854, selfPID=16854, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=17680, selfPID=17680, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=17944, selfPID=17944, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=17944, selfPID=17944, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=18296, selfPID=18296, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=18627, selfPID=18627, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=18940, selfPID=18940, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=18940, selfPID=18940, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 1, checkPID=19356, selfPID=19356, iMonCtr=1
CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0
CPDN process is not running, exiting, bRetVal = 1, checkPID=20234, selfPID=20234, iMonCtr=1
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
MainError:	12:47:18 AM	No files match the supplied pattern.
Suspended CPDN Monitor - Quit request from BOINC...
CPDN process is not running, exiting, bRetVal = 1, checkPID=30106, selfPID=30106, iMonCtr=1
/scratch/.boinc.beo-34/projects/climateprediction.net/hadsm3_um_6.08_i686-pc-linux-gnu: error while loading shared libraries: libm.so.6: wrong ELF class: ELFCLASS64
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=997, iMonCtr=1
Model crash detected, will try to restart...
/scratch/.boinc.beo-34/projects/climateprediction.net/hadsm3_um_6.08_i686-pc-linux-gnu: error while loading shared libraries: libm.so.6: wrong ELF class: ELFCLASS64
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=997, iMonCtr=1
Model crash detected, will try to restart...
/scratch/.boinc.beo-34/projects/climateprediction.net/hadsm3_um_6.08_i686-pc-linux-gnu: error while loading shared libraries: libm.so.6: wrong ELF class: ELFCLASS64
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=997, iMonCtr=1
Model crash detected, will try to restart...
/scratch/.boinc.beo-34/projects/climateprediction.net/hadsm3_um_6.08_i686-pc-linux-gnu: error while loading shared libraries: libm.so.6: wrong ELF class: ELFCLASS64
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=997, iMonCtr=1
Model crash detected, will try to restart...
/scratch/.boinc.beo-34/projects/climateprediction.net/hadsm3_um_6.08_i686-pc-linux-gnu: error while loading shared libraries: libm.so.6: wrong ELF class: ELFCLASS64
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=997, iMonCtr=1
Model crash detected, will try to restart...
/scratch/.boinc.beo-34/projects/climateprediction.net/hadsm3_um_6.08_i686-pc-linux-gnu: error while loading shared libraries: libm.so.6: wrong ELF class: ELFCLASS64
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=997, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
28 Oct 2010 07:09:27 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 151,228 972,352 2.3688
28 Oct 2010 03:29:22 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 140,426 949,715 2.3762
16 Oct 2010 15:52:41 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 129,624 923,721 2.3754
14 Oct 2010 08:44:48 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 118,822 898,185 2.3757
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 108,020 871,262 2.3723
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 97,218 847,874 2.3786
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 86,416 824,579 2.3855
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 75,614 798,353 2.3841
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 64,812 770,938 2.3790
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 54,010 744,045 2.3752
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 43,208 715,780 2.3666
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 32,406 679,921 2.3313
09 Oct 2010 21:59:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 21,604 646,605 2.3023
06 Oct 2010 10:11:16 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 10,802 620,918 2.2993
06 Oct 2010 03:51:21 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 259,248 587,465 2.2660
03 Oct 2010 21:45:16 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 248,446 563,917 2.2698
01 Oct 2010 11:17:43 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 237,644 538,131 2.2644
01 Oct 2010 03:36:39 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 226,842 513,228 2.2625
30 Sep 2010 20:20:24 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 216,040 487,316 2.2557
30 Sep 2010 12:57:03 1102172 11900145 hadsm3dhet2_u3gp_006725972_9 205,238 460,859 2.2455


©2024 climateprediction.net