climateprediction.net home page
Task 11900545

Task 11900545

Name hadsm3dhet2_u4fp_006726012_9
Workunit 6929355
Created 17 Sep 2010, 8:08:53 UTC
Sent 20 Sep 2010, 13:09:54 UTC
Report deadline 2 Sep 2011, 18:29:54 UTC
Received 5 Oct 2010, 8:32:03 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1101570
Run time
CPU time 5 days 19 hours 29 min 46 sec
Validate state Invalid
Credit 3,275.03
Device peak FLOPS 1.00 GFLOPS
Application version UK Met Office HadSM3 Slab Model v6.08
i686-pc-linux-gnu
Stderr
<core_client_version>6.2.14</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
SIGSEGV: segmentation violation
Stack trace (7 frames):
[0x80a1fb7]
[0x805d018]
[0x8108308]
[0x804bdea]
[0x804c012]
[0x81014b0]
[0x8048121]

Exiting...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
MainError:	12:26:17 AM	No files match the supplied pattern.
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: 

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: 
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /paja/projects/climateprediction.net/hadsm3dhet2_u4fp_006726012/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
hadsm3_um_6.08_i6  087C31CC  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087C0D75  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087A2FD3  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0877238E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  08794B6E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087935C3  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0869CF0E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  085EB9F7  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086C0712  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086FEB0F  Unknown               Unknown  Unknown
Unknown            40075455  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0804BB31  Unknown               Unknown  Unknown
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12864, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /paja/projects/climateprediction.net/hadsm3dhet2_u4fp_006726012/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
hadsm3_um_6.08_i6  087C31CC  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087C0D75  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087A2FD3  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0877238E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  08794B6E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087935C3  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0869CF0E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  085EB9F7  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086C0712  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086FEB0F  Unknown               Unknown  Unknown
Unknown            40075455  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0804BB31  Unknown               Unknown  Unknown
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12864, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /paja/projects/climateprediction.net/hadsm3dhet2_u4fp_006726012/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
hadsm3_um_6.08_i6  087C31CC  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087C0D75  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087A2FD3  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0877238E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  08794310  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  081B18EF  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  082E7441  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086D3B2E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086EFF99  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086C0886  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086FEB0F  Unknown               Unknown  Unknown
Unknown            40075455  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0804BB31  Unknown               Unknown  Unknown
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12864, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /paja/projects/climateprediction.net/hadsm3dhet2_u4fp_006726012/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
hadsm3_um_6.08_i6  087C31CC  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087C0D75  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  087A2FD3  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0877238E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  08794310  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  081B18EF  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  082E7441  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086D3B2E  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086EFF99  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086C0886  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  086FEB0F  Unknown               Unknown  Unknown
Unknown            40075455  Unknown               Unknown  Unknown
hadsm3_um_6.08_i6  0804BB31  Unknown               Unknown  Unknown
CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12864, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
05 Oct 2010 04:21:17 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 97,218 489,497 1.3732
05 Oct 2010 03:09:44 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 86,416 474,339 1.3723
04 Oct 2010 19:50:12 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 75,614 458,959 1.3706
04 Oct 2010 15:25:05 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 64,812 443,852 1.3697
04 Oct 2010 09:36:49 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 54,010 428,490 1.3679
04 Oct 2010 05:24:23 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 43,208 413,802 1.3681
04 Oct 2010 03:17:57 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 32,406 398,982 1.3680
03 Oct 2010 21:04:31 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 21,604 384,169 1.3679
03 Oct 2010 16:33:28 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 10,802 368,407 1.3642
03 Oct 2010 12:27:06 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 259,248 353,823 1.3648
03 Oct 2010 07:27:58 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 248,446 338,967 1.3643
03 Oct 2010 03:29:00 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 237,644 324,356 1.3649
02 Oct 2010 23:17:40 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 226,842 309,618 1.3649
02 Oct 2010 19:08:26 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 216,040 294,534 1.3633
02 Oct 2010 14:58:30 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 205,238 279,677 1.3627
02 Oct 2010 10:41:13 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 194,436 264,339 1.3595
02 Oct 2010 06:31:59 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 183,634 249,525 1.3588
02 Oct 2010 03:14:34 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 172,832 234,103 1.3545
01 Oct 2010 22:08:36 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 162,030 219,350 1.3538
01 Oct 2010 18:00:11 1101570 11900545 hadsm3dhet2_u4fp_006726012_9 151,228 204,536 1.3525


©2024 cpdn.org