Name | famous_vj3n_1899_200_006710521_3 |
Workunit | 6913774 |
Created | 26 Aug 2010, 17:12:11 UTC |
Sent | 9 Nov 2010, 7:14:03 UTC |
Report deadline | 8 Feb 2011, 14:41:14 UTC |
Received | 23 Nov 2010, 10:29:42 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 11 (0x0000000B) Unknown error code |
Computer ID | 1107692 |
Run time | 3 days 6 hours 58 min 59 sec |
CPU time | 3 days 1 hours 40 min 58 sec |
Validate state | Invalid |
Credit | 1,266.23 |
Device peak FLOPS | 1.81 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 i686-pc-linux-gnu |
Stderr | <core_client_version>6.12.5</core_client_version> <![CDATA[ <message> process got signal 11 </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2485, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2485, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2485, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2485, iMonCtr=1 Model crash detected, will try to restart... (2485): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (1858): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 1 received, exiting... (2172): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1925, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1925, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1925, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1925, iMonCtr=1 Model crash detected, will try to restart... (1925): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... (1925): No heartbeat from core client for 30 sec - exiting (13603): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13719, iMonCtr=1 Model crash detected, will try to restart... (13996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1844, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1844, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1844, iMonCtr=1 Model crash detected, will try to restart... (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting (1844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=15015, selfPID=15015, iMonCtr=1 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 383,786 | 262,741 | 0.6846 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 374,426 | 256,335 | 0.6846 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 365,066 | 249,927 | 0.6846 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 355,706 | 243,529 | 0.6846 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 346,346 | 237,141 | 0.6847 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 336,986 | 230,743 | 0.6847 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 327,626 | 224,343 | 0.6848 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 318,266 | 217,952 | 0.6848 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 308,906 | 211,560 | 0.6849 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 299,546 | 205,160 | 0.6849 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 290,186 | 198,762 | 0.6849 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 280,826 | 192,374 | 0.6850 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 271,466 | 185,972 | 0.6851 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 262,106 | 179,448 | 0.6846 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 252,746 | 173,047 | 0.6847 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 243,386 | 166,642 | 0.6847 |
23 Nov 2010 10:30:54 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 234,026 | 160,236 | 0.6847 |
17 Nov 2010 22:54:31 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 224,666 | 153,836 | 0.6847 |
17 Nov 2010 21:05:25 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 215,306 | 147,438 | 0.6848 |
17 Nov 2010 19:19:17 | 1107692 | 11812513 | famous_vj3n_1899_200_006710521_3 | 205,946 | 141,029 | 0.6848 |
©2025 cpdn.org