Name | hadcm3n_ycfm_2020_40_008364795_0 |
Workunit | 8515654 |
Created | 10 May 2013, 22:11:00 UTC |
Sent | 10 May 2013, 22:18:04 UTC |
Report deadline | 10 Aug 2013, 5:45:15 UTC |
Received | 16 Jun 2013, 15:54:13 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1036109 |
Run time | 12 days 18 hours 58 min 51 sec |
CPU time | 12 days 17 hours 37 min 24 sec |
Validate state | Invalid |
Credit | 10,575.36 |
Device peak FLOPS | 2.87 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> 17:57:03 (4308): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:54:43 (4372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:45:10 (5536): No heartbeat from core client for 30 sec - exiting 17:45:11 (5536): No heartbeat from core client for 30 sec - exiting 17:45:12 (5536): No heartbeat from core client for 30 sec - exiting 17:45:13 (5536): No heartbeat from core client for 30 sec - exiting 17:45:14 (5536): No heartbeat from core client for 30 sec - exiting 17:45:15 (5536): No heartbeat from core client for 30 sec - exiting 17:45:16 (5536): No heartbeat from core client for 30 sec - exiting 17:45:18 (5536): No heartbeat from core client for 30 sec - exiting 17:45:19 (5536): No heartbeat from core client for 30 sec - exiting 17:45:20 (5536): No heartbeat from core client for 30 sec - exiting 17:45:21 (5536): No heartbeat from core client for 30 sec - exiting 17:45:22 (5536): No heartbeat from core client for 30 sec - exiting 17:45:23 (5536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:45:24 (5536): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 18:10:36 (5424): No heartbeat from core client for 30 sec - exiting 18:10:37 (5424): No heartbeat from core client for 30 sec - exiting 18:10:38 (5424): No heartbeat from core client for 30 sec - exiting 18:10:39 (5424): No heartbeat from core client for 30 sec - exiting 18:10:40 (5424): No heartbeat from core client for 30 sec - exiting 18:10:41 (5424): No heartbeat from core client for 30 sec - exiting 18:10:42 (5424): No heartbeat from core client for 30 sec - exiting 18:10:43 (5424): No heartbeat from core client for 30 sec - exiting 18:10:45 (5424): No heartbeat from core client for 30 sec - exiting 18:10:46 (5424): No heartbeat from core client for 30 sec - exiting 18:10:47 (5424): No heartbeat from core client for 30 sec - exiting 18:10:48 (5424): No heartbeat from core client for 30 sec - exiting 18:10:49 (5424): No heartbeat from core client for 30 sec - exiting 18:10:50 (5424): No heartbeat from core client for 30 sec - exiting 18:10:51 (5424): No heartbeat from core client for 30 sec - exiting 18:10:52 (5424): No heartbeat from core client for 30 sec - exiting 18:10:53 (5424): No heartbeat from core client for 30 sec - exiting 18:10:54 (5424): No heartbeat from core client for 30 sec - exiting 18:10:55 (5424): No heartbeat from core client for 30 sec - exiting 18:10:57 (5424): No heartbeat from core client for 30 sec - exiting 18:10:58 (5424): No heartbeat from core client for 30 sec - exiting 18:10:59 (5424): No heartbeat from core client for 30 sec - exiting 18:11:00 (5424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:40:34 (5892): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:45:36 (4048): No heartbeat from core client for 30 sec - exiting 17:45:37 (4048): No heartbeat from core client for 30 sec - exiting 17:45:38 (4048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:44:54 (5464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:50:28 (5476): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:06:20 (3684): No heartbeat from core client for 30 sec - exiting 17:06:22 (3684): No heartbeat from core client for 30 sec - exiting 17:06:23 (3684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5444, iMonCtr=1 Model crash detected, will try to restart... 17:43:15 (5332): No heartbeat from core client for 30 sec - exiting 17:43:17 (5332): No heartbeat from core client for 30 sec - exiting 17:43:18 (5332): No heartbeat from core client for 30 sec - exiting 17:43:19 (5332): No heartbeat from core client for 30 sec - exiting 17:43:20 (5332): No heartbeat from core client for 30 sec - exiting 17:43:21 (5332): No heartbeat from core client for 30 sec - exiting 17:43:22 (5332): No heartbeat from core client for 30 sec - exiting 17:43:23 (5332): No heartbeat from core client for 30 sec - exiting 17:43:24 (5332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:38:56 (5316): No heartbeat from core client for 30 sec - exiting 17:38:57 (5316): No heartbeat from core client for 30 sec - exiting 17:38:58 (5316): No heartbeat from core client for 30 sec - exiting 17:39:00 (5316): No heartbeat from core client for 30 sec - exiting 17:39:01 (5316): No heartbeat from core client for 30 sec - exiting 17:39:02 (5316): No heartbeat from core client for 30 sec - exiting 17:39:03 (5316): No heartbeat from core client for 30 sec - exiting 17:39:04 (5316): No heartbeat from core client for 30 sec - exiting 17:39:05 (5316): No heartbeat from core client for 30 sec - exiting 17:39:06 (5316): No heartbeat from core client for 30 sec - exiting 17:39:07 (5316): No heartbeat from core client for 30 sec - exiting 17:39:08 (5316): No heartbeat from core client for 30 sec - exiting 17:39:09 (5316): No heartbeat from core client for 30 sec - exiting 17:39:10 (5316): No heartbeat from core client for 30 sec - exiting 17:39:12 (5316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:00:54 (5440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:30:19 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:15:47 (5612): No heartbeat from core client for 30 sec - exiting 18:15:48 (5612): No heartbeat from core client for 30 sec - exiting 18:15:49 (5612): No heartbeat from core client for 30 sec - exiting 18:15:50 (5612): No heartbeat from core client for 30 sec - exiting 18:15:51 (5612): No heartbeat from core client for 30 sec - exiting 18:15:52 (5612): No heartbeat from core client for 30 sec - exiting 18:15:54 (5612): No heartbeat from core client for 30 sec - exiting 18:15:55 (5612): No heartbeat from core client for 30 sec - exiting 18:15:56 (5612): No heartbeat from core client for 30 sec - exiting 18:15:57 (5612): No heartbeat from core client for 30 sec - exiting 18:15:58 (5612): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:51:20 (5980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:00:48 (6092): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Jun 2013 18:05:30 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 881,280 | 1,100,282 | 1.2485 |
15 Jun 2013 07:44:16 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 855,360 | 1,067,424 | 1.2479 |
14 Jun 2013 21:54:03 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 829,440 | 1,034,519 | 1.2472 |
11 Jun 2013 16:47:11 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 803,520 | 1,002,422 | 1.2475 |
09 Jun 2013 18:13:48 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 777,600 | 969,912 | 1.2473 |
09 Jun 2013 09:24:56 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 751,680 | 938,438 | 1.2485 |
08 Jun 2013 21:29:52 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 725,760 | 904,456 | 1.2462 |
08 Jun 2013 13:00:28 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 699,840 | 873,898 | 1.2487 |
08 Jun 2013 04:21:49 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 673,920 | 842,963 | 1.2508 |
07 Jun 2013 19:40:16 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 648,000 | 811,713 | 1.2526 |
06 Jun 2013 16:23:14 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 622,080 | 779,586 | 1.2532 |
04 Jun 2013 19:47:45 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 596,160 | 747,138 | 1.2533 |
02 Jun 2013 18:45:30 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 570,240 | 714,741 | 1.2534 |
02 Jun 2013 10:24:18 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 544,320 | 684,248 | 1.2571 |
02 Jun 2013 01:10:55 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 518,400 | 651,201 | 1.2562 |
01 Jun 2013 16:22:44 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 492,480 | 617,594 | 1.2540 |
01 Jun 2013 07:17:52 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 466,560 | 587,115 | 1.2584 |
31 May 2013 23:05:34 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 440,640 | 556,436 | 1.2628 |
30 May 2013 23:07:29 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 414,720 | 524,611 | 1.2650 |
29 May 2013 19:58:28 | 1036109 | 15774436 | hadcm3n_ycfm_2020_40_008364795_0 | 388,800 | 493,115 | 1.2683 |
©2024 cpdn.org