climateprediction.net home page
Task 11577187

Task 11577187

Name famous_up0m_599_200_006665265_0
Workunit 6868637
Created 10 Jun 2010, 15:44:17 UTC
Sent 17 Jun 2010, 12:02:08 UTC
Report deadline 16 Sep 2010, 19:29:19 UTC
Received 10 Oct 2010, 13:06:02 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 22 (0x00000016) Unknown error code
Computer ID 980556
Run time 8 days 14 hours 16 min 48 sec
CPU time 8 days 7 hours 5 min 44 sec
Validate state Invalid
Credit 5,157.32
Device peak FLOPS 2.06 GFLOPS
Application version UK Met Office FAMOUS v6.10
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The device does not recognize the command. (0x16) - exit code 22 (0x16)
</message>
<stderr_txt>
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8656, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7592, iMonCtr=1
Model crash detected, will try to restart...
07:15:37 (4544): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5136, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1
Model crash detected, will try to restart...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
CPDN Monitor - Quit request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
10:49:13 (5612): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:41:07 (7584): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
11:41:13 (7584): No heartbeat from core client for 30 sec - exiting
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Suspend request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7380, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1516, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6984, iMonCtr=1
Model crash detected, will try to restart...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
Suspended CPDN Monitor - Suspend request from BOINC...
Suspended CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
21:16:39 (4008): No heartbeat from core client for 30 sec - exiting
21:16:40 (4008): No heartbeat from core client for 30 sec - exiting
21:16:41 (4008): No heartbeat from core client for 30 sec - exiting
21:16:42 (4008): No heartbeat from core client for 30 sec - exiting
21:16:43 (4008): No heartbeat from core client for 30 sec - exiting
21:16:44 (4008): No heartbeat from core client for 30 sec - exiting
21:16:45 (4008): No heartbeat from core client for 30 sec - exiting
21:16:46 (4008): No heartbeat from core client for 30 sec - exiting
21:16:47 (4008): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
21:16:48 (4008): No heartbeat from core client for 30 sec - exiting
21:18:22 (5928): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6728, iMonCtr=1
Model crash detected, will try to restart...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3724, iMonCtr=1
Model crash detected, will try to restart...
18:54:07 (7316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
18:54:08 (7316): No heartbeat from core client for 30 sec - exiting
18:54:09 (7316): No heartbeat from core client for 30 sec - exiting
18:54:10 (7316): No heartbeat from core client for 30 sec - exiting
18:54:11 (7316): No heartbeat from core client for 30 sec - exiting
18:54:12 (7316): No heartbeat from core client for 30 sec - exiting
18:54:13 (7316): No heartbeat from core client for 30 sec - exiting
18:54:14 (7316): No heartbeat from core client for 30 sec - exiting
18:54:15 (7316): No heartbeat from core client for 30 sec - exiting
18:54:16 (7316): No heartbeat from core client for 30 sec - exiting
18:54:17 (7316): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - Quit request from BOINC...

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 60 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 61 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 68 - Return code = 16

BUFFIN: Read Failed: No such file or directory
BUFFIN: C I/O Error feof - Unit 69 - Return code = 16
CPDN Monitor - Quit request from BOINC...
21:09:49 (5460): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  

Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED.                                                                                                                                                                                                                     tmp/pipe_dummy                                                                  
Sorry, too many model crashes! :-(
23:04:51 (6896): called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
10 Oct 2010 12:16:51 980556 11577187 famous_up0m_599_200_006665265_0 1,563,146 713,756 0.4566
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,553,786 709,292 0.4565
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,544,426 704,829 0.4564
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,535,066 700,359 0.4562
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,525,706 695,892 0.4561
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,516,346 691,433 0.4560
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,506,986 686,978 0.4559
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,497,626 682,473 0.4557
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,488,266 677,897 0.4555
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,478,906 673,290 0.4553
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,469,546 668,759 0.4551
10 Oct 2010 11:11:07 980556 11577187 famous_up0m_599_200_006665265_0 1,460,186 664,270 0.4549
06 Oct 2010 12:33:13 980556 11577187 famous_up0m_599_200_006665265_0 1,450,826 659,862 0.4548
03 Oct 2010 12:58:00 980556 11577187 famous_up0m_599_200_006665265_0 1,441,466 655,299 0.4546
03 Oct 2010 03:23:02 980556 11577187 famous_up0m_599_200_006665265_0 1,432,106 650,846 0.4545
30 Sep 2010 11:38:58 980556 11577187 famous_up0m_599_200_006665265_0 1,422,746 646,629 0.4545
30 Sep 2010 10:20:23 980556 11577187 famous_up0m_599_200_006665265_0 1,413,386 642,136 0.4543
30 Sep 2010 04:50:48 980556 11577187 famous_up0m_599_200_006665265_0 1,404,026 637,585 0.4541
30 Sep 2010 04:50:48 980556 11577187 famous_up0m_599_200_006665265_0 1,394,666 633,033 0.4539
25 Sep 2010 14:17:03 980556 11577187 famous_up0m_599_200_006665265_0 1,385,306 629,003 0.4541


©2024 cpdn.org