climateprediction.net home page
Task 11558676

Task 11558676

Name famous_um5t_1599_200_006661564_1
Workunit 6864936
Created 10 Jun 2010, 15:11:33 UTC
Sent 8 Jul 2010, 23:02:16 UTC
Report deadline 8 Oct 2010, 6:29:27 UTC
Received 14 Jul 2010, 10:06:47 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 132 (0x00000084) Unknown error code
Computer ID 1085551
Run time 1 days 9 hours 23 min 4 sec
CPU time 1 days 9 hours 23 min 4 sec
Validate state Invalid
Credit 772.13
Device peak FLOPS 2.47 GFLOPS
Application version UK Met Office FAMOUS v6.11
i686-pc-linux-gnu
Stderr
<core_client_version>6.4.5</core_client_version>
<![CDATA[
<message>
process got signal 4
</message>
<stderr_txt>
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=23429, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
 (23431): called boinc_finish
Signal 15 received, exiting...
 (32983): called boinc_finish
Suspended CPDN Monitor - Suspend request from BOINC...
 (42542): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
 (42544): called boinc_finish
Signal 15 received, exiting...
 (42613): called boinc_finish

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
CPDN Monitor - Quit request from BOINC...
Signal 15 received, exiting...
 (42853): called boinc_finish

BUFFIN: Read Failed: Inappropriate ioctl for device
BUFFIN: C I/O Error feof - Unit 60 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 61 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 68 - Return code = 1

BUFFIN: Read Failed: Invalid argument
BUFFIN: C I/O Error feof - Unit 69 - Return code = 1
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43347, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43347, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43347, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43347, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43347, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=43347, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
 (43347): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - No 'heartbeat' from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
13 Jul 2010 17:26:59 1085551 11558676 famous_um5t_1599_200_006661564_1 234,026 170,235 0.7274
13 Jul 2010 13:17:26 1085551 11558676 famous_um5t_1599_200_006661564_1 224,666 155,067 0.6902
13 Jul 2010 10:49:17 1085551 11558676 famous_um5t_1599_200_006661564_1 215,306 146,681 0.6813
13 Jul 2010 09:09:09 1085551 11558676 famous_um5t_1599_200_006661564_1 205,946 140,491 0.6822
13 Jul 2010 06:37:16 1085551 11558676 famous_um5t_1599_200_006661564_1 196,586 131,616 0.6695
13 Jul 2010 03:40:44 1085551 11558676 famous_um5t_1599_200_006661564_1 187,226 120,986 0.6462
12 Jul 2010 13:19:48 1085551 11558676 famous_um5t_1599_200_006661564_1 177,866 201,325 1.1319
12 Jul 2010 11:24:55 1085551 11558676 famous_um5t_1599_200_006661564_1 168,506 194,586 1.1548
12 Jul 2010 08:24:14 1085551 11558676 famous_um5t_1599_200_006661564_1 159,146 184,478 1.1592
12 Jul 2010 02:58:42 1085551 11558676 famous_um5t_1599_200_006661564_1 149,786 169,934 1.1345
11 Jul 2010 21:59:36 1085551 11558676 famous_um5t_1599_200_006661564_1 140,426 156,454 1.1141
11 Jul 2010 19:32:18 1085551 11558676 famous_um5t_1599_200_006661564_1 131,066 148,338 1.1318
11 Jul 2010 17:18:46 1085551 11558676 famous_um5t_1599_200_006661564_1 121,706 140,764 1.1566
11 Jul 2010 12:59:18 1085551 11558676 famous_um5t_1599_200_006661564_1 112,346 126,108 1.1225
11 Jul 2010 09:57:03 1085551 11558676 famous_um5t_1599_200_006661564_1 102,986 115,295 1.1195
11 Jul 2010 02:13:21 1085551 11558676 famous_um5t_1599_200_006661564_1 93,626 93,310 0.9966
10 Jul 2010 20:54:20 1085551 11558676 famous_um5t_1599_200_006661564_1 84,266 81,278 0.9645
10 Jul 2010 14:32:19 1085551 11558676 famous_um5t_1599_200_006661564_1 74,906 68,627 0.9162
09 Jul 2010 18:46:56 1085551 11558676 famous_um5t_1599_200_006661564_1 65,546 58,723 0.8959
09 Jul 2010 16:20:49 1085551 11558676 famous_um5t_1599_200_006661564_1 56,186 50,193 0.8933


©2024 cpdn.org