Name | famous_wrx5_1499_200_007122944_2 |
Workunit | 7321304 |
Created | 19 Jan 2011, 18:33:58 UTC |
Sent | 19 Jan 2011, 18:45:16 UTC |
Report deadline | 21 Apr 2011, 2:12:27 UTC |
Received | 9 Feb 2011, 12:38:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1120441 |
Run time | 9 days 2 hours 42 min 4 sec |
CPU time | 8 days 12 hours 52 min 49 sec |
Validate state | Invalid |
Credit | 3,489.71 |
Device peak FLOPS | 1.76 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3160, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3092, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3144, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3072, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3260, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2972, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3036, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3252, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3252, iMonCtr=1 Model crash detected, will try to restart... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3212, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3172, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3800, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:22:18 (5600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:24:31 (4668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... 12:08:11 (3556): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=600, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 12:08:18 (1116): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=600, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 12:08:23 (1024): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=600, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 12:08:28 (3076): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=600, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 12:08:33 (3656): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=600, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... 12:08:38 (1496): called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=600, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 12:08:42 (600): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
09 Feb 2011 12:44:13 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,057,706 | 735,125 | 0.6950 |
09 Feb 2011 12:44:12 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,048,346 | 729,043 | 0.6954 |
08 Feb 2011 19:04:35 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,038,986 | 723,130 | 0.6960 |
08 Feb 2011 16:38:43 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,029,626 | 717,143 | 0.6965 |
08 Feb 2011 13:22:54 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,020,266 | 711,224 | 0.6971 |
08 Feb 2011 11:45:28 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,010,906 | 705,319 | 0.6977 |
08 Feb 2011 10:54:07 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 1,001,546 | 699,275 | 0.6982 |
08 Feb 2011 10:54:07 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 992,186 | 693,233 | 0.6987 |
07 Feb 2011 16:40:13 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 982,826 | 687,357 | 0.6994 |
07 Feb 2011 16:40:13 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 973,466 | 681,316 | 0.6999 |
07 Feb 2011 16:40:13 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 964,106 | 675,318 | 0.7005 |
07 Feb 2011 16:40:13 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 954,746 | 669,377 | 0.7011 |
07 Feb 2011 12:35:51 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 945,386 | 661,383 | 0.6996 |
06 Feb 2011 23:45:04 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 936,026 | 648,816 | 0.6932 |
06 Feb 2011 20:38:34 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 926,666 | 638,419 | 0.6889 |
06 Feb 2011 18:36:51 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 917,306 | 629,856 | 0.6866 |
05 Feb 2011 13:08:32 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 907,946 | 623,845 | 0.6871 |
05 Feb 2011 13:08:32 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 898,586 | 617,995 | 0.6877 |
05 Feb 2011 13:08:32 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 889,226 | 612,149 | 0.6884 |
05 Feb 2011 13:08:32 | 1120441 | 12506250 | famous_wrx5_1499_200_007122944_2 | 879,866 | 606,287 | 0.6891 |
©2024 cpdn.org