Name | famous_u27e_1799_200_006635701_2 |
Workunit | 6839073 |
Created | 10 Jun 2010, 11:25:27 UTC |
Sent | 18 Jul 2010, 1:24:03 UTC |
Report deadline | 17 Oct 2010, 8:51:14 UTC |
Received | 26 Jul 2010, 19:58:24 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1074321 |
Run time | 7 days 3 hours 5 min 52 sec |
CPU time | 4 days 19 hours 20 min 26 sec |
Validate state | Invalid |
Credit | 2,347.09 |
Device peak FLOPS | 2.09 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 13:10:15 (3772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:10:16 (3772): No heartbeat from core client for 30 sec - exiting Could not launch model process. Last Error=1450 13:10:56 (62576): called boinc_finish 13:50:24 (62948): Can't set up shared mem: -1. Will run in standalone mode. 13:50:27 (61460): Can't set up shared mem: -1. Will run in standalone mode. BUFFOUT: Write Failed: Invalid argument BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 16:23:59 (62144): Can't set up shared mem: -1. Will run in standalone mode. 05:44:56 (5240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit reWorker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5536, selfPID=5536, iMonCtr=1 07:55:40 (4492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:55:41 (4492): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 08:13:57 (5712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:13:58 (5712): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=5400, selfPID=5400, iMonCtr=1 Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 0, checkPID=0, selfPID=0, iMonCtr=0 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC...CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 CPDN Monitor - Quit request from BOINC...CPDN Monitor - Quit request from BOINC.No Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=4152, selfPID=4152, iMonCtr=1 19:57:27 (3220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BONo Process Handle Worker:: CPDN process is not running, exiting, bRetVal = 1, checkPID=6000, selfPID=6000, iMonCtr=1 Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Model crashed: ATM_DYN : INVALID THETA DETECTED. tmp/pipe_dummy Sorry, too many model crashes! :-( 19:57:13 (4964): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Jul 2010 17:39:22 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 711,386 | 411,335 | 0.5782 |
26 Jul 2010 14:23:14 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 702,026 | 405,650 | 0.5778 |
26 Jul 2010 11:09:19 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 692,666 | 400,057 | 0.5776 |
26 Jul 2010 09:25:16 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 683,306 | 394,982 | 0.5780 |
26 Jul 2010 07:41:50 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 673,946 | 389,983 | 0.5787 |
26 Jul 2010 06:06:44 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 664,586 | 384,955 | 0.5792 |
26 Jul 2010 04:15:55 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 655,226 | 379,921 | 0.5798 |
25 Jul 2010 18:47:22 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 645,866 | 374,346 | 0.5796 |
25 Jul 2010 16:04:36 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 636,506 | 369,062 | 0.5798 |
25 Jul 2010 14:24:30 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 627,146 | 363,966 | 0.5804 |
25 Jul 2010 12:42:59 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 617,786 | 358,859 | 0.5809 |
25 Jul 2010 11:19:00 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 608,426 | 353,720 | 0.5814 |
25 Jul 2010 09:25:01 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 599,066 | 348,524 | 0.5818 |
25 Jul 2010 07:34:09 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 589,706 | 343,470 | 0.5824 |
25 Jul 2010 03:58:47 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 580,346 | 338,237 | 0.5828 |
25 Jul 2010 02:41:12 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 570,986 | 333,101 | 0.5834 |
25 Jul 2010 00:40:41 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 561,626 | 327,949 | 0.5839 |
24 Jul 2010 22:19:30 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 552,266 | 322,726 | 0.5844 |
24 Jul 2010 20:28:31 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 542,906 | 317,644 | 0.5851 |
24 Jul 2010 18:29:26 | 1074321 | 11429311 | famous_u27e_1799_200_006635701_2 | 533,546 | 312,544 | 0.5858 |
©2024 cpdn.org