Name | hadcm3n_o1w1_1980_40_008384612_1 |
Workunit | 8535471 |
Created | 18 Aug 2013, 8:28:39 UTC |
Sent | 18 Aug 2013, 8:28:54 UTC |
Report deadline | 17 Nov 2013, 15:56:05 UTC |
Received | 25 Sep 2013, 15:07:38 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1016016 |
Run time | 27 days 11 hours 55 min 12 sec |
CPU time | 25 days 20 hours 45 min |
Validate state | Invalid |
Credit | 10,264.32 |
Device peak FLOPS | 1.46 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.56</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3376, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 21:07:56 (1184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:07:57 (1184): No heartbeat from core client for 30 sec - exiting 21:07:58 (1184): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=21464, iMonCtr=1 Model crash detected, will try to restart... 06:49:10 (748): No heartbeat from core client for 30 sec - exiting 06:49:12 (748): No heartbeat from core client for 30 sec - exiting 06:49:13 (748): No heartbeat from core client for 30 sec - exiting 06:49:14 (748): No heartbeat from core client for 30 sec - exiting 06:49:15 (748): No heartbeat from core client for 30 sec - exiting 06:49:16 (748): No heartbeat from core client for 30 sec - exiting 06:49:17 (748): No heartbeat from core client for 30 sec - exiting 06:49:18 (748): No heartbeat from core client for 30 sec - exiting 06:49:19 (748): No heartbeat from core client for 30 sec - exiting 06:49:20 (748): No heartbeat from core client for 30 sec - exiting 06:49:21 (748): No heartbeat from core client for 30 sec - exiting 06:49:22 (748): No heartbeat from core client for 30 sec - exiting 06:49:24 (748): No heartbeat from core client for 30 sec - exiting 06:49:25 (748): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1892, iMonCtr=1 Model crash detected, will try to restart... 00:33:27 (3664): No heartbeat from core client for 30 sec - exiting 00:33:28 (3664): No heartbeat from core client for 30 sec - exiting 00:33:29 (3664): No heartbeat from core client for 30 sec - exiting 00:33:30 (3664): No heartbeat from core client for 30 sec - exiting 00:33:32 (3664): No heartbeat from core client for 30 sec - exiting 00:33:33 (3664): No heartbeat from core client for 30 sec - exiting 00:33:34 (3664): No heartbeat from core client for 30 sec - exiting 00:33:35 (3664): No heartbeat from core client for 30 sec - exiting 00:33:36 (3664): No heartbeat from core client for 30 sec - exiting 00:33:37 (3664): No heartbeat from core client for 30 sec - exiting 00:33:38 (3664): No heartbeat from core client for 30 sec - exiting 00:33:39 (3664): No heartbeat from core client for 30 sec - exiting 00:33:40 (3664): No heartbeat from core client for 30 sec - exiting 00:33:41 (3664): No heartbeat from core client for 30 sec - exiting 00:33:42 (3664): No heartbeat from core client for 30 sec - exiting 00:33:44 (3664): No heartbeat from core client for 30 sec - exiting 00:33:45 (3664): No heartbeat from core client for 30 sec - exiting 00:33:46 (3664): No heartbeat from core client for 30 sec - exiting 00:33:47 (3664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3376, iMonCtr=1 Model crash detected, will try to restart... 22:12:49 (3344): No heartbeat from core client for 30 sec - exiting 22:12:51 (3344): No heartbeat from core client for 30 sec - exiting 22:12:52 (3344): No heartbeat from core client for 30 sec - exiting 22:12:53 (3344): No heartbeat from core client for 30 sec - exiting 22:13:24 (3344): No heartbeat from core client for 30 sec - exiting 22:13:25 (3344): No heartbeat from core client for 30 sec - exiting 22:13:26 (3344): No heartbeat from core client for 30 sec - exiting 22:13:27 (3344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=1760, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 07:48:49 (4704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:05:39 (2376): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5016, iMonCtr=1 Model crash detected, will try to restart... 08:17:26 (4444): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:01:28 (14644): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:01:29 (14644): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11032, iMonCtr=1 Model crash detected, will try to restart... 08:42:56 (3312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 03:05:06 (17900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
23 Sep 2013 22:33:52 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 855,360 | 2,186,679 | 2.5564 |
23 Sep 2013 11:16:55 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 829,440 | 2,120,444 | 2.5565 |
23 Sep 2013 11:16:55 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 803,520 | 2,054,543 | 2.5569 |
21 Sep 2013 09:35:44 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 777,600 | 1,988,545 | 2.5573 |
20 Sep 2013 14:06:33 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 751,680 | 1,923,059 | 2.5583 |
19 Sep 2013 15:35:36 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 725,760 | 1,856,578 | 2.5581 |
18 Sep 2013 18:53:13 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 699,840 | 1,788,983 | 2.5563 |
17 Sep 2013 22:30:32 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 673,920 | 1,722,450 | 2.5559 |
17 Sep 2013 00:55:54 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 648,000 | 1,656,956 | 2.5570 |
16 Sep 2013 05:00:11 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 622,080 | 1,591,910 | 2.5590 |
15 Sep 2013 08:28:43 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 596,160 | 1,526,153 | 2.5600 |
14 Sep 2013 12:23:59 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 570,240 | 1,460,604 | 2.5614 |
13 Sep 2013 15:12:32 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 544,320 | 1,394,325 | 2.5616 |
12 Sep 2013 18:17:33 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 518,400 | 1,327,746 | 2.5612 |
11 Sep 2013 17:33:25 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 492,480 | 1,262,212 | 2.5630 |
10 Sep 2013 21:07:41 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 466,560 | 1,197,279 | 2.5662 |
10 Sep 2013 00:05:25 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 440,640 | 1,131,560 | 2.5680 |
09 Sep 2013 04:32:05 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 414,720 | 1,067,293 | 2.5735 |
07 Sep 2013 23:38:33 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 388,800 | 1,001,014 | 2.5746 |
07 Sep 2013 04:07:55 | 1016016 | 15925081 | hadcm3n_o1w1_1980_40_008384612_1 | 362,880 | 935,898 | 2.5791 |
©2024 cpdn.org