|
Name | famous_vj6a_1999_200_006710616_0 |
Workunit | 6913869 |
Created | 26 Aug 2010, 17:12:59 UTC |
Sent | 8 Nov 2010, 23:33:56 UTC |
Report deadline | 8 Feb 2011, 7:01:07 UTC |
Received | 11 Dec 2010, 9:25:59 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1104109 |
Run time | 7 days 3 hours 13 min 7 sec |
CPU time | 6 days 9 hours 15 min 19 sec |
Validate state | Invalid |
Credit | 4,539.69 |
Device peak FLOPS | 2.48 GFLOPS |
Application version | UK Met Office FAMOUS v6.11 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 04:10:33 (4700): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 19:01:59 (4540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:28:53 (4568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Ocean Restart file copy failed on vj6alo#da00000211741+ Ocean Restart file copy failed on vj6alo#da0000021174g+ Ocean Restart file copy failed on vj6alo#da00000211751+ Ocean Restart file copy failed on vj6alo#da0000021175g+ Ocean Restart file copy failed on vj6alo#da00000211761+ Ocean Restart file copy failed on vj6alo#da0000021176g+ Ocean Restart file copy failed on vj6alo#da00000211771+ Ocean Restart file copy failed on vj6alo#da0000021177g+ Ocean Restart file copy failed on vj6alo#da00000211781+ Ocean Restart file copy failed on vj6alo#da0000021178g+ Ocean Restart file copy failed on vj6alo#da00000211791+ Ocean Restart file copy failed on vj6alo#da0000021179g+ Ocean Restart file copy failed on vj6alo#da000002117a1+ Ocean Restart file copy failed on vj6alo#da000002117ag+ Ocean Restart file copy failed on vj6alo#da000002117b1+ Ocean Restart file copy failed on vj6alo#da000002117bg+ Ocean Restart file copy failed on vj6alo#da000002117c1+ Ocean Restart file copy failed on vj6alo#da000002117cg+ Ocean Restart file copy failed on vj6alo#da00000211811+ Ocean Restart file copy failed on vj6alo#da0000021181g+ Ocean Restart file copy failed on vj6alo#da00000211821+ Ocean Restart file copy failed on vj6alo#da0000021182g+ Ocean Restart file copy failed on vj6alo#da00000211831+ Ocean Restart file copy failed on vj6alo#da0000021183g+ Ocean Restart file copy failed on vj6alo#da00000211841+ Ocean Restart file copy failed on vj6alo#da0000021184g+ Ocean Restart file copy failed on vj6alo#da00000211851+ Ocean Restart file copy failed on vj6alo#da0000021185g+ Ocean Restart file copy failed on vj6alo#da00000211861+ Ocean Restart file copy failed on vj6alo#da0000021186g+ Ocean Restart file copy failed on vj6alo#da00000211871+ Ocean Restart file copy failed on vj6alo#da0000021187g+ Ocean Restart file copy failed on vj6alo#da00000211881+ Ocean Restart file copy failed on vj6alo#da0000021188g+ Ocean Restart file copy failed on vj6alo#da00000211891+ Ocean Restart file copy failed on vj6alo#da0000021189g+ Ocean Restart file copy failed on vj6alo#da000002118a1+ Ocean Restart file copy failed on vj6alo#da000002118ag+ Ocean Restart file copy failed on vj6alo#da000002118b1+ Ocean Restart file copy failed on vj6alo#da000002118bg+ Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Atmos Hold Restart file rename failed on atmos_restart.hold Suspended CPDN Monitor - Suspend request from BOINC... 04:27:45 (620): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Model crashed: TEMPHIST: Write ERROR on history file for namelistNLIHISTO tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy 19:50:47 (5976): called boinc_finish cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy 19:51:51 (5348): called boinc_finish cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy 19:52:59 (2604): called boinc_finish 19:53:44 (2176): No heartbeat from core client for 30 sec - exiting 19:53:45 (2176): No heartbeat from core client for 30 sec - exiting 19:53:46 (2176): No heartbeat from core client for 30 sec - exiting 19:53:47 (2176): No heartbeat from core client for 30 sec - exiting 19:53:48 (2176): No heartbeat from core client for 30 sec - exiting 19:53:49 (2176): No heartbeat from core client for 30 sec - exiting 19:53:50 (2176): No heartbeat from core client for 30 sec - exiting 19:53:51 (2176): No heartbeat from core client for 30 sec - exiting 19:53:52 (2176): No heartbeat from core client for 30 sec - exiting 19:53:53 (2176): No heartbeat from core client for 30 sec - exiting 19:53:54 (2176): No heartbeat from core client for 30 sec - exiting 19:53:55 (2176): No heartbeat from core client for 30 sec - exiting 19:53:56 (2176): No heartbeat from core client for 30 sec - exiting 19:53:57 (2176): No heartbeat from core client for 30 sec - exiting 19:53:58 (2176): No heartbeat from core client for 30 sec - exiting 19:53:59 (2176): No heartbeat from core client for 30 sec - exiting 19:54:00 (2176): No heartbeat from core client for 30 sec - exiting 19:54:01 (2176): No heartbeat from core client for 30 sec - exiting 19:54:02 (2176): No heartbeat from core client for 30 sec - exiting 19:54:03 (2176): No heartbeat from core client for 30 sec - exiting 19:54:04 (2176): No heartbeat from core client for 30 sec - exiting cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts CPDN Monitor - No 'heartbeat' from BOINC... cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy cpdnmonitor: cannot open input file C:\ProgramData\BOINC/projects/climateprediction.net/famous_vj6a_1999_200_006710616/dataout/atmos_restart.day after 11 attempts Model crashed: DRLANDF1 : Error in FILE_OPEN. tmp/pipe_dummy Sorry, too many model crashes! :-( 21:14:29 (2792): called boinc_finish 21:14:30 (2792): No heartbeat from core client for 30 sec - exiting 21:14:31 (2792): No heartbeat from core client for 30 sec - exiting error: cannot delete old famous_vj6a_1999_200_006710616/jobs/afyel.PRESM_O 21:25:44 (4820): called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 Dec 2010 09:30:50 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,375,946 | 548,968 | 0.3990 |
11 Dec 2010 00:05:59 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,366,586 | 545,222 | 0.3990 |
10 Dec 2010 22:58:47 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,357,226 | 541,482 | 0.3990 |
10 Dec 2010 20:22:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,347,866 | 537,740 | 0.3990 |
10 Dec 2010 19:14:25 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,338,506 | 534,026 | 0.3990 |
10 Dec 2010 18:06:38 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,329,146 | 530,326 | 0.3990 |
10 Dec 2010 16:48:11 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,319,786 | 526,652 | 0.3990 |
10 Dec 2010 15:24:10 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,310,426 | 522,971 | 0.3991 |
10 Dec 2010 14:10:36 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,301,066 | 519,239 | 0.3991 |
10 Dec 2010 12:54:50 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,291,706 | 515,517 | 0.3991 |
10 Dec 2010 10:48:44 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,282,346 | 511,794 | 0.3991 |
10 Dec 2010 10:19:08 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,272,986 | 508,020 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,263,626 | 504,295 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,254,266 | 500,546 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,244,906 | 496,809 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,235,546 | 493,067 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,226,186 | 489,315 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,216,826 | 485,611 | 0.3991 |
10 Dec 2010 10:19:07 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,207,466 | 481,817 | 0.3990 |
09 Dec 2010 23:59:10 | 1104109 | 11812988 | famous_vj6a_1999_200_006710616_0 | 1,198,106 | 478,069 | 0.3990 |
©2024 climateprediction.net