Name | famous_up0m_599_200_006665265_0 |
Workunit | 6868637 |
Created | 10 Jun 2010, 15:44:17 UTC |
Sent | 17 Jun 2010, 12:02:08 UTC |
Report deadline | 16 Sep 2010, 19:29:19 UTC |
Received | 10 Oct 2010, 13:06:02 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 980556 |
Run time | 8 days 14 hours 16 min 48 sec |
CPU time | 8 days 7 hours 5 min 44 sec |
Validate state | Invalid |
Credit | 5,157.32 |
Device peak FLOPS | 2.06 GFLOPS |
Application version | UK Met Office FAMOUS v6.10 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8656, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7592, iMonCtr=1 Model crash detected, will try to restart... 07:15:37 (4544): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5136, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:49:13 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:41:07 (7584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:41:13 (7584): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7380, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1516, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6984, iMonCtr=1 Model crash detected, will try to restart... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 21:16:39 (4008): No heartbeat from core client for 30 sec - exiting 21:16:40 (4008): No heartbeat from core client for 30 sec - exiting 21:16:41 (4008): No heartbeat from core client for 30 sec - exiting 21:16:42 (4008): No heartbeat from core client for 30 sec - exiting 21:16:43 (4008): No heartbeat from core client for 30 sec - exiting 21:16:44 (4008): No heartbeat from core client for 30 sec - exiting 21:16:45 (4008): No heartbeat from core client for 30 sec - exiting 21:16:46 (4008): No heartbeat from core client for 30 sec - exiting 21:16:47 (4008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:16:48 (4008): No heartbeat from core client for 30 sec - exiting 21:18:22 (5928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6728, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1 Model crash detected, will try to restart... 18:54:07 (7316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:54:08 (7316): No heartbeat from core client for 30 sec - exiting 18:54:09 (7316): No heartbeat from core client for 30 sec - exiting 18:54:10 (7316): No heartbeat from core client for 30 sec - exiting 18:54:11 (7316): No heartbeat from core client for 30 sec - exiting 18:54:12 (7316): No heartbeat from core client for 30 sec - exiting 18:54:13 (7316): No heartbeat from core client for 30 sec - exiting 18:54:14 (7316): No heartbeat from core client for 30 sec - exiting 18:54:15 (7316): No heartbeat from core client for 30 sec - exiting 18:54:16 (7316): No heartbeat from core client for 30 sec - exiting 18:54:17 (7316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 60 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 61 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 CPDN Monitor - Quit request from BOINC... 21:09:49 (5460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy Sorry, too many model crashes! :-( 23:04:51 (6896): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
10 Oct 2010 12:16:51 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,563,146 | 713,756 | 0.4566 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,553,786 | 709,292 | 0.4565 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,544,426 | 704,829 | 0.4564 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,535,066 | 700,359 | 0.4562 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,525,706 | 695,892 | 0.4561 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,516,346 | 691,433 | 0.4560 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,506,986 | 686,978 | 0.4559 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,497,626 | 682,473 | 0.4557 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,488,266 | 677,897 | 0.4555 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,478,906 | 673,290 | 0.4553 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,469,546 | 668,759 | 0.4551 |
10 Oct 2010 11:11:07 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,460,186 | 664,270 | 0.4549 |
06 Oct 2010 12:33:13 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,450,826 | 659,862 | 0.4548 |
03 Oct 2010 12:58:00 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,441,466 | 655,299 | 0.4546 |
03 Oct 2010 03:23:02 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,432,106 | 650,846 | 0.4545 |
30 Sep 2010 11:38:58 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,422,746 | 646,629 | 0.4545 |
30 Sep 2010 10:20:23 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,413,386 | 642,136 | 0.4543 |
30 Sep 2010 04:50:48 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,404,026 | 637,585 | 0.4541 |
30 Sep 2010 04:50:48 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,394,666 | 633,033 | 0.4539 |
25 Sep 2010 14:17:03 | 980556 | 11577187 | famous_up0m_599_200_006665265_0 | 1,385,306 | 629,003 | 0.4541 |
©2024 cpdn.org