Name | famous_wa3s_1399_200_007058327_2 |
Workunit | 7261643 |
Created | 31 Mar 2011, 22:18:42 UTC |
Sent | 31 Mar 2011, 22:18:45 UTC |
Report deadline | 1 Jul 2011, 5:45:56 UTC |
Received | 17 Aug 2011, 15:36:58 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1005404 |
Run time | 13 days 16 hours 40 min 20 sec |
CPU time | 11 days 13 hours 26 min 52 sec |
Validate state | Invalid |
Credit | 4,261.75 |
Device peak FLOPS | 1.53 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.6.36</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 23:16:15 (6268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:16:16 (6268): No heartbeat from core client for 30 sec - exiting 23:16:20 (6268): No heartbeat from core client for 30 sec - exiting 23:16:21 (6268): No heartbeat from core client for 30 sec - exiting 23:16:22 (6268): No heartbeat from core client for 30 sec - exiting 23:16:23 (6268): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2876, selfPID=2876, iMonCtr=1 01:05:52 (4892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:05:53 (4892): No heartbeat from core client for 30 sec - exiting 01:05:54 (4892): No heartbeat from core client for 30 sec - exiting 01:05:55 (4892): No heartbeat from core client for 30 sec - exiting 01:05:56 (4892): No heartbeat from core client for 30 sec - exiting 01:05:57 (4892): No heartbeat from core client for 30 sec - exiting 01:05:58 (4892): No heartbeat from core client for 30 sec - exiting 01:05:59 (4892): No heartbeat from core client for 30 sec - exiting 01:06:00 (4892): No heartbeat from core client for 30 sec - exiting 01:06:01 (4892): No heartbeat from core client for 30 sec - exiting 01:06:02 (4892): No heartbeat from core client for 30 sec - exiting 01:06:03 (4892): No heartbeat from core client for 30 sec - exiting 01:06:04 (4892): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6572, selfPID=6572, iMonCtr=1 01:48:20 (6000): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:48:21 (6000): No heartbeat from core client for 30 sec - exiting 01:48:22 (6000): No heartbeat from core client for 30 sec - exiting 01:48:23 (6000): No heartbeat from core client for 30 sec - exiting 01:48:25 (6000): No heartbeat from core client for 30 sec - exiting 01:48:26 (6000): No heartbeat from core client for 30 sec - exiting 01:48:27 (6000): No heartbeat from core client for 30 sec - exiting 01:48:28 (6000): No heartbeat from core client for 30 sec - exiting 01:48:29 (6000): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6448, selfPID=6448, iMonCtr=1 02:28:36 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:28:37 (5928): No heartbeat from core client for 30 sec - exiting 02:28:53 (7072): Can't acquire lockfile (32) - waiting 35s Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6424, selfPID=6424, iMonCtr=1 05:12:01 (7072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:12:02 (7072): No heartbeat from core client for 30 sec - exiting 05:12:03 (7072): No heartbeat from core client for 30 sec - exiting 05:12:04 (7072): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5424, selfPID=5424, iMonCtr=1 05:12:30 (7736): Can't acquire lockfile (32) - waiting 35s 07:41:43 (7736): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:41:44 (7736): No heartbeat from core client for 30 sec - exiting 07:41:45 (7736): No heartbeat from core client for 30 sec - exiting 07:41:46 (7736): No heartbeat from core client for 30 sec - exiting 07:41:47 (7736): No heartbeat from core client for 30 sec - exiting 07:41:48 (7736): No heartbeat from core client for 30 sec - exiting 07:41:49 (7736): No heartbeat from core client for 30 sec - exiting 07:41:50 (7736): No heartbeat from core client for 30 sec - exiting 07:41:51 (7736): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7896, selfPID=7896, iMonCtr=1 09:11:47 (7388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:11:48 (7388): No heartbeat from core client for 30 sec - exiting 09:11:57 (7388): No heartbeat from core client for 30 sec - exiting 09:11:58 (7388): No heartbeat from core client for 30 sec - exiting 09:11:59 (7388): No heartbeat from core client for 30 sec - exiting 09:12:00 (7388): No heartbeat from core client for 30 sec - exiting 09:12:01 (7388): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7956, selfPID=7956, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4592, selfPID=4592, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5520, selfPID=5520, iMonCtr=1 19:52:29 (4620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:52:30 (4620): No heartbeat from core client for 30 sec - exiting 19:52:31 (4620): No heartbeat from core client for 30 sec - exiting 19:52:32 (4620): No heartbeat from core client for 30 sec - exiting Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6084, selfPID=6084, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=2740, selfPID=2740, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=1864, selfPID=1864, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=7080, selfPID=7080, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=3992, selfPID=3992, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6588, selfPID=6588, iMonCtr=1 CPDN Monitor - Quit request from BOINC... No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6944, selfPID=6944, iMonCtr=1 23:21:34 (7004): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:21:35 (7004): No heartbeat from core client for 30 sec - exiting 23:21:36 (7004): No heartbeat from core client for 30 sec - exiting 23:21:37 (7004): No heartbeat from core client for 30 sec - exiting 23:21:38 (7004): No heartbeat from core client for 30 sec - exiting forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_wa3s_1399_200_007058327\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_wa3s_1399_200_007058327\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_wa3s_1399_200_007058327\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_wa3s_1399_200_007058327\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_wa3s_1399_200_007058327\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... forrtl: severe (47): write to READONLY file, unit 6, file C:\ProgramData\BOINC\projects\climateprediction.net\famous_wa3s_1399_200_007058327\dataout\stdout_um.txt Image PC Routine Line Source famous_um_6.11_wi 008846D2 Unknown Unknown Unknown Stack trace terminated abnormally. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4224, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( 23:23:38 (4224): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Aug 2011 15:36:11 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,291,706 | 996,598 | 0.7715 |
17 Aug 2011 15:36:11 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,282,346 | 989,373 | 0.7715 |
17 Aug 2011 15:36:11 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,272,986 | 982,260 | 0.7716 |
17 Aug 2011 15:36:11 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,263,626 | 975,160 | 0.7717 |
17 Aug 2011 15:36:11 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,254,266 | 968,290 | 0.7720 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,244,906 | 961,197 | 0.7721 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,235,546 | 953,726 | 0.7719 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,226,186 | 946,073 | 0.7716 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,216,826 | 938,493 | 0.7713 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,207,466 | 930,895 | 0.7709 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,198,106 | 923,313 | 0.7706 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,188,746 | 915,699 | 0.7703 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,179,386 | 908,010 | 0.7699 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,170,026 | 900,378 | 0.7695 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,160,666 | 892,877 | 0.7693 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,151,306 | 885,325 | 0.7690 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,141,946 | 877,717 | 0.7686 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,132,586 | 870,086 | 0.7682 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,123,226 | 862,326 | 0.7677 |
17 Aug 2011 15:36:10 | 1005404 | 12759655 | famous_wa3s_1399_200_007058327_2 | 1,113,866 | 854,896 | 0.7675 |
©2024 cpdn.org