climateprediction.net home page
Task 14102676

Task 14102676

Name hadcm3n_yl7j_1980_40_007752993_1
Workunit 7908102
Created 16 Feb 2012, 22:30:34 UTC
Sent 16 Feb 2012, 22:30:43 UTC
Report deadline 18 May 2012, 5:57:54 UTC
Received 16 Mar 2012, 23:53:32 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 25 (0x00000019) Unknown error code
Computer ID 1038505
Run time 18 days 9 hours 6 min 43 sec
CPU time 14 days 0 hours 46 min 48 sec
Validate state Invalid
Credit 4,354.56
Device peak FLOPS 2.36 GFLOPS
Application version UK Met Office Coupled Model Full Resolution Ocean v6.07
windows_intelx86
Stderr
<core_client_version>6.10.18</core_client_version>
<![CDATA[
<message>
The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19)
</message>
<stderr_txt>
CPDN Monitor - Quit request from BOINC...
13:06:43 (6412): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
12:06:05 (6736): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
CPDN Monitor - Quit request from BOINC...
17:04:02 (6504): No heartbeat from core client for 30 sec - exiting
17:04:03 (6504): No heartbeat from core client for 30 sec - exiting
17:04:04 (6504): No heartbeat from core client for 30 sec - exiting
17:04:05 (6504): No heartbeat from core client for 30 sec - exiting
17:04:06 (6504): No heartbeat from core client for 30 sec - exiting
17:04:07 (6504): No heartbeat from core client for 30 sec - exiting
17:04:08 (6504): No heartbeat from core client for 30 sec - exiting
17:04:09 (6504): No heartbeat from core client for 30 sec - exiting
17:04:10 (6504): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Signal 22 received, exiting...
Called boinc_finish
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6796, iMonCtr=1
Model crash detected, will try to restart...
Sorry, too many model crashes! :-(
Called boinc_finish
22:31:12 (2728): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...

Model crashed: ATM_DYN : INVALID THETA DETECTED.                                                                                                                                                                                                                               tmp/pipe_dummy                                                                  2048    
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6644, iMonCtr=1
Model crash detected, will try to restart...
20:51:30 (6632): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6944, iMonCtr=1
Model crash detected, will try to restart...
13:36:50 (5796): No heartbeat from core client for 30 sec - exiting
13:36:51 (5796): No heartbeat from core client for 30 sec - exiting
13:36:52 (5796): No heartbeat from core client for 30 sec - exiting
13:36:53 (5796): No heartbeat from core client for 30 sec - exiting
13:36:54 (5796): No heartbeat from core client for 30 sec - exiting
13:36:55 (5796): No heartbeat from core client for 30 sec - exiting
13:36:56 (5796): No heartbeat from core client for 30 sec - exiting
13:36:57 (5796): No heartbeat from core client for 30 sec - exiting
13:36:58 (5796): No heartbeat from core client for 30 sec - exiting
13:36:59 (5796): No heartbeat from core client for 30 sec - exiting
13:37:00 (5796): No heartbeat from core client for 30 sec - exiting
13:37:01 (5796): No heartbeat from core client for 30 sec - exiting
13:37:02 (5796): No heartbeat from core client for 30 sec - exiting
13:37:03 (5796): No heartbeat from core client for 30 sec - exiting
13:37:04 (5796): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
CPDN Monitor - Quit request from BOINC...
01:12:31 (6636): No heartbeat from core client for 30 sec - exiting
01:12:32 (6636): No heartbeat from core client for 30 sec - exiting
01:12:33 (6636): No heartbeat from core client for 30 sec - exiting
01:12:34 (6636): No heartbeat from core client for 30 sec - exiting
01:12:35 (6636): No heartbeat from core client for 30 sec - exiting
01:12:36 (6636): No heartbeat from core client for 30 sec - exiting
01:12:37 (6636): No heartbeat from core client for 30 sec - exiting
01:12:38 (6636): No heartbeat from core client for 30 sec - exiting
01:12:39 (6636): No heartbeat from core client for 30 sec - exiting
01:12:40 (6636): No heartbeat from core client for 30 sec - exiting
01:12:41 (6636): No heartbeat from core client for 30 sec - exiting
01:12:42 (6636): No heartbeat from core client for 30 sec - exiting
01:12:43 (6636): No heartbeat from core client for 30 sec - exiting
01:12:44 (6636): No heartbeat from core client for 30 sec - exiting
01:12:45 (6636): No heartbeat from core client for 30 sec - exiting
01:12:46 (6636): No heartbeat from core client for 30 sec - exiting
01:12:47 (6636): No heartbeat from core client for 30 sec - exiting
01:12:48 (6636): No heartbeat from core client for 30 sec - exiting
01:12:49 (6636): No heartbeat from core client for 30 sec - exiting
01:12:50 (6636): No heartbeat from core client for 30 sec - exiting
01:12:51 (6636): No heartbeat from core client for 30 sec - exiting
01:12:52 (6636): No heartbeat from core client for 30 sec - exiting
01:12:53 (6636): No heartbeat from core client for 30 sec - exiting
01:12:54 (6636): No heartbeat from core client for 30 sec - exiting
01:12:55 (6636): No heartbeat from core client for 30 sec - exiting
01:12:56 (6636): No heartbeat from core client for 30 sec - exiting
01:12:57 (6636): No heartbeat from core client for 30 sec - exiting
01:12:58 (6636): No heartbeat from core client for 30 sec - exiting
01:12:59 (6636): No heartbeat from core client for 30 sec - exiting
01:13:00 (6636): No heartbeat from core client for 30 sec - exiting
01:13:01 (6636): No heartbeat from core client for 30 sec - exiting
01:13:02 (6636): No heartbeat from core client for 30 sec - exiting
01:13:03 (6636): No heartbeat from core client for 30 sec - exiting
01:13:04 (6636): No heartbeat from core client for 30 sec - exiting
01:13:05 (6636): No heartbeat from core client for 30 sec - exiting
01:13:06 (6636): No heartbeat from core client for 30 sec - exiting
01:13:07 (6636): No heartbeat from core client for 30 sec - exiting
01:13:08 (6636): No heartbeat from core client for 30 sec - exiting
01:13:09 (6636): No heartbeat from core client for 30 sec - exiting
01:13:10 (6636): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
03:35:22 (1424): No heartbeat from core client for 30 sec - exiting
03:35:23 (1424): No heartbeat from core client for 30 sec - exiting
03:35:24 (1424): No heartbeat from core client for 30 sec - exiting
03:35:25 (1424): No heartbeat from core client for 30 sec - exiting
03:35:26 (1424): No heartbeat from core client for 30 sec - exiting
03:35:27 (1424): No heartbeat from core client for 30 sec - exiting
03:35:28 (1424): No heartbeat from core client for 30 sec - exiting
03:35:29 (1424): No heartbeat from core client for 30 sec - exiting
03:35:30 (1424): No heartbeat from core client for 30 sec - exiting
03:35:31 (1424): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
09:47:10 (1256): No heartbeat from core client for 30 sec - exiting
09:47:11 (1256): No heartbeat from core client for 30 sec - exiting
09:47:12 (1256): No heartbeat from core client for 30 sec - exiting
09:47:13 (1256): No heartbeat from core client for 30 sec - exiting
09:47:14 (1256): No heartbeat from core client for 30 sec - exiting
09:47:15 (1256): No heartbeat from core client for 30 sec - exiting
09:47:16 (1256): No heartbeat from core client for 30 sec - exiting
09:47:17 (1256): No heartbeat from core client for 30 sec - exiting
09:47:18 (1256): No heartbeat from core client for 30 sec - exiting
09:47:19 (1256): No heartbeat from core client for 30 sec - exiting
09:47:20 (1256): No heartbeat from core client for 30 sec - exiting
09:47:21 (1256): No heartbeat from core client for 30 sec - exiting
09:47:22 (1256): No heartbeat from core client for 30 sec - exiting
09:47:23 (1256): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
13:18:32 (6744): No heartbeat from core client for 30 sec - exiting
CPDN Monitor - No 'heartbeat' from BOINC...
Called boinc_finish

</stderr_txt>
]]>
Latest Trickles Received
Time Sent (UTC) Host ID Result ID Result Name Timestep CPU Time (sec) Average (sec/TS)
01 Mar 2012 06:23:23 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 362,880 679,727 1.8731
29 Feb 2012 14:44:00 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 336,960 633,804 1.8809
28 Feb 2012 19:12:20 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 311,040 586,480 1.8855
28 Feb 2012 00:51:33 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 285,120 540,767 1.8966
26 Feb 2012 22:55:48 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 259,200 492,113 1.8986
25 Feb 2012 22:00:23 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 233,280 442,353 1.8962
25 Feb 2012 04:15:42 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 207,360 393,138 1.8959
24 Feb 2012 03:22:39 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 181,440 341,754 1.8836
23 Feb 2012 10:11:29 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 155,520 288,650 1.8560
22 Feb 2012 06:20:11 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 129,600 236,329 1.8235
21 Feb 2012 13:59:20 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 103,680 187,432 1.8078
20 Feb 2012 20:02:25 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 77,760 138,293 1.7785
18 Feb 2012 17:54:30 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 51,840 93,030 1.7946
18 Feb 2012 00:01:51 1038505 14102676 hadcm3n_yl7j_1980_40_007752993_1 25,920 46,797 1.8054


©2024 climateprediction.net