climateprediction.net home page
Task 12087618

Task 12087618

Name famous_w65j_599_200_006754663_2
Workunit 6957979
Created 18 Nov 2010, 9:25:57 UTC
Sent 18 Nov 2010, 9:32:23 UTC
Report deadline 17 Feb 2011, 16:59:34 UTC
Received 3 Dec 2010, 9:32:04 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 1116162
Run time 7 days 11 hours 14 min 20 sec
CPU time 7 days 6 hours 19 min 38 sec
Validate state Invalid
Credit 3,119.13
Device peak FLOPS 2.22 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.10.17</core_client_version>
<![CDATA[
<message>
process exited with code 22 (0x16, -234)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
 (722): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (722): No heartbeat from core client for 30 sec - exiting
 (722): No heartbeat from core client for 30 sec - exiting
 (722): No heartbeat from core client for 30 sec - exiting
 (722): No heartbeat from core client for 30 sec - exiting
 (722): No heartbeat from core client for 30 sec - exiting
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15885, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15885, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15885, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15885, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15885, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15885, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
 (15885): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
 (15885): No heartbeat from core client for 30 sec - exiting
 (15885): No heartbeat from core client for 30 sec - exiting

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
MainError:	11:50:49 PM	No files match the supplied pattern.
MainError:	11:50:49 PM	No files match the supplied pattern.
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=24509, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
 (24509): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
02 Dec 2010 23:30:06 1116162 12087618 famous_w65j_599_200_006754663_2 945,386 626,341 0.6625
02 Dec 2010 18:29:59 1116162 12087618 famous_w65j_599_200_006754663_2 936,026 619,953 0.6623
02 Dec 2010 12:56:58 1116162 12087618 famous_w65j_599_200_006754663_2 926,666 613,735 0.6623
02 Dec 2010 07:04:47 1116162 12087618 famous_w65j_599_200_006754663_2 917,306 607,446 0.6622
02 Dec 2010 00:18:13 1116162 12087618 famous_w65j_599_200_006754663_2 907,946 601,224 0.6622
01 Dec 2010 22:33:40 1116162 12087618 famous_w65j_599_200_006754663_2 898,586 595,076 0.6622
01 Dec 2010 16:18:30 1116162 12087618 famous_w65j_599_200_006754663_2 889,226 588,905 0.6623
01 Dec 2010 14:34:13 1116162 12087618 famous_w65j_599_200_006754663_2 879,866 582,765 0.6623
01 Dec 2010 08:58:32 1116162 12087618 famous_w65j_599_200_006754663_2 870,506 576,583 0.6624
01 Dec 2010 08:58:32 1116162 12087618 famous_w65j_599_200_006754663_2 861,146 570,434 0.6624
01 Dec 2010 08:58:32 1116162 12087618 famous_w65j_599_200_006754663_2 851,786 564,286 0.6625
01 Dec 2010 08:58:32 1116162 12087618 famous_w65j_599_200_006754663_2 842,426 558,128 0.6625
30 Nov 2010 22:54:02 1116162 12087618 famous_w65j_599_200_006754663_2 833,066 551,977 0.6626
30 Nov 2010 21:12:32 1116162 12087618 famous_w65j_599_200_006754663_2 823,706 545,841 0.6627
30 Nov 2010 11:51:27 1116162 12087618 famous_w65j_599_200_006754663_2 814,346 539,612 0.6626
30 Nov 2010 08:47:50 1116162 12087618 famous_w65j_599_200_006754663_2 804,986 533,298 0.6625
30 Nov 2010 07:08:36 1116162 12087618 famous_w65j_599_200_006754663_2 795,626 527,138 0.6625
30 Nov 2010 00:22:03 1116162 12087618 famous_w65j_599_200_006754663_2 786,266 520,989 0.6626
29 Nov 2010 17:41:30 1116162 12087618 famous_w65j_599_200_006754663_2 776,906 514,818 0.6627
29 Nov 2010 12:59:53 1116162 12087618 famous_w65j_599_200_006754663_2 767,546 508,506 0.6625


©2024 climateprediction.net