Name | hadcm3n_ykjm_1900_40_007514315_3 |
Workunit | 7711790 |
Created | 25 Nov 2011, 5:42:20 UTC |
Sent | 25 Nov 2011, 6:00:48 UTC |
Report deadline | 24 Feb 2012, 13:27:59 UTC |
Received | 3 Jan 2012, 1:54:55 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1051495 |
Run time | 15 days 6 hours 55 min 35 sec |
CPU time | 14 days 22 hours 21 min 56 sec |
Validate state | Invalid |
Credit | 8,709.12 |
Device peak FLOPS | 2.47 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4292, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4016, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2320, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5116, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4280, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4800, iMonCtr=1 Model crash detected, will try to restart... 11:31:41 (5524): No heartbeat from core client for 30 sec - exiting 11:31:42 (5524): No heartbeat from core client for 30 sec - exiting 11:31:43 (5524): No heartbeat from core client for 30 sec - exiting 11:31:44 (5524): No heartbeat from core client for 30 sec - exiting 11:31:45 (5524): No heartbeat from core client for 30 sec - exiting 11:31:46 (5524): No heartbeat from core client for 30 sec - exiting 11:31:47 (5524): No heartbeat from core client for 30 sec - exiting 11:31:48 (5524): No heartbeat from core client for 30 sec - exiting 11:31:49 (5524): No heartbeat from core client for 30 sec - exiting 11:31:50 (5524): No heartbeat from core client for 30 sec - exiting 11:31:51 (5524): No heartbeat from core client for 30 sec - exiting 11:31:52 (5524): No heartbeat from core client for 30 sec - exiting 11:31:53 (5524): No heartbeat from core client for 30 sec - exiting 11:31:54 (5524): No heartbeat from core client for 30 sec - exiting 11:31:55 (5524): No heartbeat from core client for 30 sec - exiting 11:31:56 (5524): No heartbeat from core client for 30 sec - exiting 11:31:57 (5524): No heartbeat from core client for 30 sec - exiting 11:31:58 (5524): No heartbeat from core client for 30 sec - exiting 11:31:59 (5524): No heartbeat from core client for 30 sec - exiting 11:32:00 (5524): No heartbeat from core client for 30 sec - exiting 11:32:01 (5524): No heartbeat from core client for 30 sec - exiting 11:32:02 (5524): No heartbeat from core client for 30 sec - exiting 11:32:03 (5524): No heartbeat from core client for 30 sec - exiting 11:32:04 (5524): No heartbeat from core client for 30 sec - exiting 11:32:05 (5524): No heartbeat from core client for 30 sec - exiting 11:32:06 (5524): No heartbeat from core client for 30 sec - exiting 11:32:07 (5524): No heartbeat from core client for 30 sec - exiting 11:32:08 (5524): No heartbeat from core client for 30 sec - exiting 11:32:09 (5524): No heartbeat from core client for 30 sec - exiting 11:32:10 (5524): No heartbeat from core client for 30 sec - exiting 11:32:11 (5524): No heartbeat from core client for 30 sec - exiting 11:32:12 (5524): No heartbeat from core client for 30 sec - exiting 11:32:13 (5524): No heartbeat from core client for 30 sec - exiting 11:32:14 (5524): No heartbeat from core client for 30 sec - exiting 11:32:15 (5524): No heartbeat from core client for 30 sec - exiting 11:32:16 (5524): No heartbeat from core client for 30 sec - exiting 11:32:17 (5524): No heartbeat from core client for 30 sec - exiting 11:32:18 (5524): No heartbeat from core client for 30 sec - exiting 11:32:19 (5524): No heartbeat from core client for 30 sec - exiting 11:32:20 (5524): No heartbeat from core client for 30 sec - exiting 11:32:21 (5524): No heartbeat from core client for 30 sec - exiting 11:32:22 (5524): No heartbeat from core client for 30 sec - exiting 11:32:23 (5524): No heartbeat from core client for 30 sec - exiting 11:32:24 (5524): No heartbeat from core client for 30 sec - exiting 11:32:25 (5524): No heartbeat from core client for 30 sec - exiting 11:32:26 (5524): No heartbeat from core client for 30 sec - exiting 11:32:27 (5524): No heartbeat from core client for 30 sec - exiting 11:32:28 (5524): No heartbeat from core client for 30 sec - exiting 11:32:29 (5524): No heartbeat from core client for 30 sec - exiting 11:32:30 (5524): No heartbeat from core client for 30 sec - exiting 11:32:31 (5524): No heartbeat from core client for 30 sec - exiting 11:32:32 (5524): No heartbeat from core client for 30 sec - exiting 11:32:33 (5524): No heartbeat from core client for 30 sec - exiting 11:32:34 (5524): No heartbeat from core client for 30 sec - exiting 11:32:35 (5524): No heartbeat from core client for 30 sec - exiting 11:32:36 (5524): No heartbeat from core client for 30 sec - exiting 11:32:37 (5524): No heartbeat from core client for 30 sec - exiting 11:32:38 (5524): No heartbeat from core client for 30 sec - exiting 11:32:39 (5524): No heartbeat from core client for 30 sec - exiting 11:32:40 (5524): No heartbeat from core client for 30 sec - exiting 11:32:41 (5524): No heartbeat from core client for 30 sec - exiting 11:32:42 (5524): No heartbeat from core client for 30 sec - exiting 11:32:43 (5524): No heartbeat from core client for 30 sec - exiting 11:32:44 (5524): No heartbeat from core client for 30 sec - exiting 11:32:45 (5524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:35:30 (4448): No heartbeat from core client for 30 sec - exiting 11:35:31 (4448): No heartbeat from core client for 30 sec - exiting 11:35:32 (4448): No heartbeat from core client for 30 sec - exiting 11:35:33 (4448): No heartbeat from core client for 30 sec - exiting 11:36:05 (4448): No heartbeat from core client for 30 sec - exiting 11:36:06 (4448): No heartbeat from core client for 30 sec - exiting 11:36:07 (4448): No heartbeat from core client for 30 sec - exiting 11:36:08 (4448): No heartbeat from core client for 30 sec - exiting 11:36:09 (4448): No heartbeat from core client for 30 sec - exiting 11:36:10 (4448): No heartbeat from core client for 30 sec - exiting 11:36:11 (4448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:37:43 (5680): No heartbeat from core client for 30 sec - exiting 11:37:44 (5680): No heartbeat from core client for 30 sec - exiting 11:37:45 (5680): No heartbeat from core client for 30 sec - exiting 11:37:46 (5680): No heartbeat from core client for 30 sec - exiting 11:38:19 (5680): No heartbeat from core client for 30 sec - exiting 11:38:20 (5680): No heartbeat from core client for 30 sec - exiting 11:38:21 (5680): No heartbeat from core client for 30 sec - exiting 11:38:22 (5680): No heartbeat from core client for 30 sec - exiting 11:38:23 (5680): No heartbeat from core client for 30 sec - exiting 11:38:24 (5680): No heartbeat from core client for 30 sec - exiting 11:38:25 (5680): No heartbeat from core client for 30 sec - exiting 11:38:26 (5680): No heartbeat from core client for 30 sec - exiting 11:38:27 (5680): No heartbeat from core client for 30 sec - exiting 11:38:28 (5680): No heartbeat from core client for 30 sec - exiting 11:38:29 (5680): No heartbeat from core client for 30 sec - exiting 11:38:30 (5680): No heartbeat from core client for 30 sec - exiting 11:38:31 (5680): No heartbeat from core client for 30 sec - exiting 11:38:32 (5680): No heartbeat from core client for 30 sec - exiting 11:38:33 (5680): No heartbeat from core client for 30 sec - exiting 11:38:34 (5680): No heartbeat from core client for 30 sec - exiting 11:38:35 (5680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:40:43 (5944): No heartbeat from core client for 30 sec - exiting 11:40:44 (5944): No heartbeat from core client for 30 sec - exiting 11:40:45 (5944): No heartbeat from core client for 30 sec - exiting 11:40:46 (5944): No heartbeat from core client for 30 sec - exiting 11:40:47 (5944): No heartbeat from core client for 30 sec - exiting 11:40:48 (5944): No heartbeat from core client for 30 sec - exiting 11:40:49 (5944): No heartbeat from core client for 30 sec - exiting 11:40:50 (5944): No heartbeat from core client for 30 sec - exiting 11:40:51 (5944): No heartbeat from core client for 30 sec - exiting 11:40:52 (5944): No heartbeat from core client for 30 sec - exiting 11:40:53 (5944): No heartbeat from core client for 30 sec - exiting 11:40:54 (5944): No heartbeat from core client for 30 sec - exiting 11:40:55 (5944): No heartbeat from core client for 30 sec - exiting 11:40:56 (5944): No heartbeat from core client for 30 sec - exiting 11:40:57 (5944): No heartbeat from core client for 30 sec - exiting 11:40:58 (5944): No heartbeat from core client for 30 sec - exiting 11:40:59 (5944): No heartbeat from core client for 30 sec - exiting 11:41:00 (5944): No heartbeat from core client for 30 sec - exiting 11:41:01 (5944): No heartbeat from core client for 30 sec - exiting 11:41:02 (5944): No heartbeat from core client for 30 sec - exiting 11:41:03 (5944): No heartbeat from core client for 30 sec - exiting 11:41:04 (5944): No heartbeat from core client for 30 sec - exiting 11:41:05 (5944): No heartbeat from core client for 30 sec - exiting 11:41:06 (5944): No heartbeat from core client for 30 sec - exiting 11:41:07 (5944): No heartbeat from core client for 30 sec - exiting 11:41:08 (5944): No heartbeat from core client for 30 sec - exiting 11:41:09 (5944): No heartbeat from core client for 30 sec - exiting 11:41:10 (5944): No heartbeat from core client for 30 sec - exiting 11:41:11 (5944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:44:31 (5740): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4192, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1 Model crash detected, will try to restart... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
02 Jan 2012 11:40:28 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 725,760 | 1,290,120 | 1.7776 |
01 Jan 2012 23:52:44 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 699,840 | 1,247,759 | 1.7829 |
01 Jan 2012 01:35:51 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 673,920 | 1,204,771 | 1.7877 |
30 Dec 2011 20:06:58 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 648,000 | 1,162,192 | 1.7935 |
30 Dec 2011 08:24:54 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 622,080 | 1,120,132 | 1.8006 |
28 Dec 2011 23:56:18 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 596,160 | 1,078,078 | 1.8084 |
27 Dec 2011 04:56:42 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 570,240 | 1,034,882 | 1.8148 |
25 Dec 2011 23:58:24 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 544,320 | 991,419 | 1.8214 |
24 Dec 2011 21:35:53 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 518,400 | 948,921 | 1.8305 |
22 Dec 2011 10:29:03 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 492,480 | 902,239 | 1.8320 |
22 Dec 2011 04:57:18 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 466,560 | 854,770 | 1.8321 |
20 Dec 2011 08:57:51 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 440,640 | 807,822 | 1.8333 |
19 Dec 2011 05:26:54 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 414,720 | 761,042 | 1.8351 |
17 Dec 2011 17:38:54 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 388,800 | 726,695 | 1.8691 |
17 Dec 2011 03:47:43 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 362,880 | 677,602 | 1.8673 |
16 Dec 2011 09:06:37 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 336,960 | 629,353 | 1.8677 |
15 Dec 2011 19:46:11 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 311,040 | 581,519 | 1.8696 |
15 Dec 2011 06:43:48 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 285,120 | 534,858 | 1.8759 |
14 Dec 2011 12:13:47 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 259,200 | 488,129 | 1.8832 |
07 Dec 2011 04:39:33 | 1051495 | 13660465 | hadcm3n_ykjm_1900_40_007514315_3 | 233,280 | 440,206 | 1.8870 |
©2024 cpdn.org