climateprediction.net home page
Task 11490442

Task 11490442

Name famous_ubmv_599_200_006647922_1
Workunit 6851294
Created 10 Jun 2010, 13:12:04 UTC
Sent 12 Aug 2010, 4:30:06 UTC
Report deadline 11 Nov 2010, 11:57:17 UTC
Received 5 Sep 2010, 1:59:56 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 132 (0x00000084) Unknown error code
Computer ID 1085551
Run time
CPU time 6 days 23 hours 54 min 1 sec
Validate state Invalid
Credit 3,335.30
Device peak FLOPS 1.85 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
process got signal 4
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2998, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
 (2998): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75417, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75417, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=75417, iMonCtr=1
Model crash detected, will try to restart...
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
Signal 15 received, exiting...
 (75419): called boinc_finish
Signal 15 received, exiting...
 (41496): called boinc_finish
 (41496): called boinc_finish
Signal 15 received, exiting...
 (48887): called boinc_finish
Signal 15 received, exiting...
 (82135): called boinc_finish
Signal 15 received, exiting...
Signal 15 received, exiting...
 (73568): called boinc_finish
Signal 15 received, exiting...
 (2226): called boinc_finish
SIGSEGV: segmentation violation
Signal 15 received, exiting...
Signal 15 received, exiting...
Signal 15 received, exiting...
 (2224): called boinc_finish
  (34206): called boinc_finish
(31491):SIGSEGV: segmentation violation
Signal 15 received, exiting...
 (34594): called boinc_finish
Signal 15 received, exiting...
Signal 15 received, exiting...
 (62384): called boinc_finish
 (44758): called boinc_finish
famous_um_6.11_i686-pc-linux-gnu: vfprintf.c:1611: _IO_vfprintf_internal: Assertion `(size_t) done <= (size_t) 2147483647' failed.
 (34276): called boinc_finish
SIGSEGV: segmentation violation
Signal 15 received, exiting...
Signal 15 received, exiting...
Signal 15 received, exiting...
 (12350): called boinc_finish
 (8012): called boinc_finish
 (8012): called boinc_finish

Signal 15 received, exiting...
 (47711): called boinc_finish
 (30830): called boinc_finish
Signal 15 received, exiting...
Signal 15 received, exiting...
 (37876): called boinc_finish
Signal 15 received, exiting...
Signal 15 received, exiting...
 (38102): called boinc_finish
SIGSEGV: segmentation violation
Stack trace (2 frames):
/var/db/boinc/projects/climateprediction.net/famous_um_6.11_i686-pc-linux-gnu(boinc_catch_signal+0x58)[0x83aa400]
[0xffffefb7]

Exiting...
 (25172): called boinc_finish
Signal 15 received, exiting...
Signal 15 received, exiting...
Signal 15 received, exiting...
Signal 15 received, exiting...
 (4773): called boinc_finish
 (3000): called boinc_finish
 (6049): called boinc_finish
 (4771): called boinc_finish
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
 (20785): called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1734701856, selfPID=21335, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
 (21335): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
04 Sep 2010 12:13:47 1085551 11490442 famous_ubmv_599_200_006647922_1 1,010,906 634,118 0.6273
04 Sep 2010 01:36:02 1085551 11490442 famous_ubmv_599_200_006647922_1 1,001,546 619,053 0.6181
04 Sep 2010 01:36:02 1085551 11490442 famous_ubmv_599_200_006647922_1 992,186 610,706 0.6155
04 Sep 2010 01:36:02 1085551 11490442 famous_ubmv_599_200_006647922_1 982,826 604,289 0.6148
04 Sep 2010 01:36:02 1085551 11490442 famous_ubmv_599_200_006647922_1 973,466 597,958 0.6143
04 Sep 2010 01:36:02 1085551 11490442 famous_ubmv_599_200_006647922_1 964,106 582,597 0.6043
04 Sep 2010 01:36:01 1085551 11490442 famous_ubmv_599_200_006647922_1 954,746 575,921 0.6032
04 Sep 2010 01:36:01 1085551 11490442 famous_ubmv_599_200_006647922_1 945,386 604,180 0.6391
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 936,026 597,676 0.6385
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 926,666 591,118 0.6379
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 917,306 584,450 0.6371
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 907,946 577,724 0.6363
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 898,586 571,081 0.6355
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 889,226 556,374 0.6257
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 879,866 549,833 0.6249
01 Sep 2010 11:54:20 1085551 11490442 famous_ubmv_599_200_006647922_1 870,506 543,385 0.6242
01 Sep 2010 10:23:00 1085551 11490442 famous_ubmv_599_200_006647922_1 861,146 536,239 0.6227
31 Aug 2010 05:58:44 1085551 11490442 famous_ubmv_599_200_006647922_1 851,786 514,420 0.6039
31 Aug 2010 04:07:54 1085551 11490442 famous_ubmv_599_200_006647922_1 842,426 507,653 0.6026
31 Aug 2010 02:11:25 1085551 11490442 famous_ubmv_599_200_006647922_1 833,066 500,908 0.6013


©2024 climateprediction.net