climateprediction.net home page
Task 11833437

Task 11833437

Name famous_vmbh_599_200_006714691_3
Workunit 6917944
Created 26 Aug 2010, 17:47:57 UTC
Sent 31 Oct 2010, 9:58:37 UTC
Report deadline 30 Jan 2011, 17:25:48 UTC
Received 12 Nov 2010, 14:43:23 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1070784
Run time 2 days 20 hours 39 min 45 sec
CPU time 1 days 18 hours 4 min 25 sec
Validate state Invalid
Credit 586.84
Device peak FLOPS 1.06 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
 (2102): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (2102): No heartbeat from core client for 30 sec - exiting
 (12144): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (12144): No heartbeat from core client for 30 sec - exiting
 (13059): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (13059): No heartbeat from core client for 30 sec - exiting
 (13059): No heartbeat from core client for 30 sec - exiting
 (13059): No heartbeat from core client for 30 sec - exiting
 (13059): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
Model crash detected, will try to restart...
 (4240): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4240, iMonCtr=1
 (4240): No heartbeat from core client for 30 sec - exiting
Model crash detected, will try to restart...
CPDN Monitor - No 'heartbeat' from BOINC...
 (4240): No heartbeat from core client for 30 sec - exiting
 (4240): No heartbeat from core client for 30 sec - exiting
 (4240): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4453, iMonCtr=1
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
 (1902): No heartbeat from core client for 30 sec - exiting
 (1902): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (1902): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
 (1784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (1784): No heartbeat from core client for 30 sec - exiting
 (1784): No heartbeat from core client for 30 sec - exiting
 (1784): No heartbeat from core client for 30 sec - exiting
 (1784): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
 (3185): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (3185): No heartbeat from core client for 30 sec - exiting
 (3185): No heartbeat from core client for 30 sec - exiting
 (3185): No heartbeat from core client for 30 sec - exiting
 (5711): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (5718): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (5727): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (5727): No heartbeat from core client for 30 sec - exiting
 (5727): No heartbeat from core client for 30 sec - exiting
 (5727): No heartbeat from core client for 30 sec - exiting
 (5727): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5736, iMonCtr=1
Model crash detected, will try to restart...
 (5736): No heartbeat from core client for 30 sec - exiting
 (5736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: STWORK  : Error in PP_FILE                                                                                                                                                                                                                                      tmp/pipe_dummy                                                                  
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/famous_vmbh_599_200_006714691/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
famous_um_6.11_i6  0840BA88  Unknown               Unknown  Unknown
famous_um_6.11_i6  0840A221  Unknown               Unknown  Unknown
famous_um_6.11_i6  083ED787  Unknown               Unknown  Unknown
famous_um_6.11_i6  083BA000  Unknown               Unknown  Unknown
famous_um_6.11_i6  083E03A1  Unknown               Unknown  Unknown
famous_um_6.11_i6  08267EB8  Unknown               Unknown  Unknown
famous_um_6.11_i6  082EFC2C  Unknown               Unknown  Unknown
famous_um_6.11_i6  08328714  Unknown               Unknown  Unknown
famous_um_6.11_i6  08330BDF  Unknown               Unknown  Unknown
famous_um_6.11_i6  083415D2  Unknown               Unknown  Unknown
libc.so.6          006C3BD6  Unknown               Unknown  Unknown
famous_um_6.11_i6  0804C041  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3109, iMonCtr=1
Model crash detected, will try to restart...

BUFFOUT: Write Failed: No space left on device
BUFFOUT: C I/O Error - Return code = 1

Model crashed: WRITHEAD: I/O error                                                                                                                                                                                                                                             tmp/pipe_dummy                                                                  
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/famous_vmbh_599_200_006714691/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
famous_um_6.11_i6  0840BA88  Unknown               Unknown  Unknown
famous_um_6.11_i6  0840A221  Unknown               Unknown  Unknown
famous_um_6.11_i6  083ED787  Unknown               Unknown  Unknown
famous_um_6.11_i6  083BA000  Unknown               Unknown  Unknown
famous_um_6.11_i6  083E03A1  Unknown               Unknown  Unknown
famous_um_6.11_i6  083DFAFC  Unknown               Unknown  Unknown
famous_um_6.11_i6  08133BD3  Unknown               Unknown  Unknown
famous_um_6.11_i6  082AEFA7  Unknown               Unknown  Unknown
famous_um_6.11_i6  0833098C  Unknown               Unknown  Unknown
famous_um_6.11_i6  083415D2  Unknown               Unknown  Unknown
libc.so.6          00909BD6  Unknown               Unknown  Unknown
famous_um_6.11_i6  0804C041  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3109, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/famous_vmbh_599_200_006714691/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
famous_um_6.11_i6  0840BA88  Unknown               Unknown  Unknown
famous_um_6.11_i6  0840A221  Unknown               Unknown  Unknown
famous_um_6.11_i6  083ED787  Unknown               Unknown  Unknown
famous_um_6.11_i6  083BA000  Unknown               Unknown  Unknown
famous_um_6.11_i6  083E03A1  Unknown               Unknown  Unknown
famous_um_6.11_i6  083DFAFC  Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3109, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/famous_vmbh_599_200_006714691/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
famous_um_6.11_i6  0840BA88  Unknown               Unknown  Unknown
famous_um_6.11_i6  0840A221  Unknown               Unknown  Unknown
famous_um_6.11_i6  083ED787  Unknown               Unknown  Unknown
famous_um_6.11_i6  083BA000  Unknown               Unknown  Unknown
famous_um_6.11_i6  083E03A1  Unknown               Unknown  Unknown
famous_um_6.11_i6  083DFAFC  Unknown               Unknown  Unknown
famous_um_6.11_i6  08133B55  Unknown               Unknown  Unknown
famous_um_6.11_i6  082AEFA7  Unknown               Unknown  Unknown
famous_um_6.11_i6  0833098C  Unknown               Unknown  Unknown
famous_um_6.11_i6  083415D2  Unknown               Unknown  Unknown
libc.so.6          00CB1BD6  Unknown               Unknown  Unknown
famous_um_6.11_i6  0804C041  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3109, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/famous_vmbh_599_200_006714691/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
famous_um_6.11_i6  0840BA88  Unknown               Unknown  Unknown
famous_um_6.11_i6  0840A221  Unknown               Unknown  Unknown
famous_um_6.11_i6  083ED787  Unknown               Unknown  Unknown
famous_um_6.11_i6  083BA000  Unknown               Unknown  Unknown
famous_um_6.11_i6  083E03A1  Unknown               Unknown  Unknown
famous_um_6.11_i6  083DFAFC  Unknown               Unknown  Unknown
famous_um_6.11_i6  08133B55  Unknown               Unknown  Unknown
famous_um_6.11_i6  082AEFA7  Unknown               Unknown  Unknown
famous_um_6.11_i6  0833098C  Unknown               Unknown  Unknown
famous_um_6.11_i6  083415D2  Unknown               Unknown  Unknown
libc.so.6          00126BD6  Unknown               Unknown  Unknown
famous_um_6.11_i6  0804C041  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3109, iMonCtr=1
Model crash detected, will try to restart...
forrtl: No space left on device
forrtl: severe (38): error during write, unit 6, file /var/lib/boinc-client/projects/climateprediction.net/famous_vmbh_599_200_006714691/dataout/stdout_um.txt
Image              PC        Routine            Line        Source             
famous_um_6.11_i6  0840BA88  Unknown               Unknown  Unknown
famous_um_6.11_i6  0840A221  Unknown               Unknown  Unknown
famous_um_6.11_i6  083ED787  Unknown               Unknown  Unknown
famous_um_6.11_i6  083BA000  Unknown               Unknown  Unknown
famous_um_6.11_i6  083E03A1  Unknown               Unknown  Unknown
famous_um_6.11_i6  083DFAFC  Unknown               Unknown  Unknown
famous_um_6.11_i6  08133B55  Unknown               Unknown  Unknown
famous_um_6.11_i6  082AEFA7  Unknown               Unknown  Unknown
famous_um_6.11_i6  0833098C  Unknown               Unknown  Unknown
famous_um_6.11_i6  083415D2  Unknown               Unknown  Unknown
libc.so.6          00329BD6  Unknown               Unknown  Unknown
famous_um_6.11_i6  0804C041  Unknown               Unknown  Unknown
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3109, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
 (3109): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
12 Nov 2010 13:22:18 1070784 11833437 famous_vmbh_599_200_006714691_3 177,866 148,745 0.8363
11 Nov 2010 21:44:25 1070784 11833437 famous_vmbh_599_200_006714691_3 168,506 140,842 0.8358
11 Nov 2010 18:06:36 1070784 11833437 famous_vmbh_599_200_006714691_3 159,146 132,993 0.8357
11 Nov 2010 14:29:25 1070784 11833437 famous_vmbh_599_200_006714691_3 149,786 125,129 0.8354
11 Nov 2010 10:10:17 1070784 11833437 famous_vmbh_599_200_006714691_3 140,426 117,266 0.8351
10 Nov 2010 18:15:10 1070784 11833437 famous_vmbh_599_200_006714691_3 131,066 109,416 0.8348
10 Nov 2010 15:00:50 1070784 11833437 famous_vmbh_599_200_006714691_3 121,706 101,625 0.8350
09 Nov 2010 21:48:55 1070784 11833437 famous_vmbh_599_200_006714691_3 112,346 93,767 0.8346
08 Nov 2010 22:48:43 1070784 11833437 famous_vmbh_599_200_006714691_3 102,986 85,894 0.8340
07 Nov 2010 23:17:56 1070784 11833437 famous_vmbh_599_200_006714691_3 93,626 77,926 0.8323
07 Nov 2010 20:28:47 1070784 11833437 famous_vmbh_599_200_006714691_3 84,266 70,083 0.8317
07 Nov 2010 13:42:41 1070784 11833437 famous_vmbh_599_200_006714691_3 74,906 62,296 0.8317
07 Nov 2010 10:58:18 1070784 11833437 famous_vmbh_599_200_006714691_3 65,546 54,555 0.8323
06 Nov 2010 12:27:20 1070784 11833437 famous_vmbh_599_200_006714691_3 56,186 46,835 0.8336
04 Nov 2010 21:36:23 1070784 11833437 famous_vmbh_599_200_006714691_3 46,826 39,117 0.8354
02 Nov 2010 23:11:53 1070784 11833437 famous_vmbh_599_200_006714691_3 37,466 31,261 0.8344
02 Nov 2010 16:09:56 1070784 11833437 famous_vmbh_599_200_006714691_3 28,106 23,449 0.8343
02 Nov 2010 13:23:00 1070784 11833437 famous_vmbh_599_200_006714691_3 18,746 15,717 0.8384
02 Nov 2010 10:33:47 1070784 11833437 famous_vmbh_599_200_006714691_3 9,386 7,994 0.8517


©2024 climateprediction.net