Name | hadcm3n_yak3_1900_40_007520769_3 |
Workunit | 7718244 |
Created | 3 Nov 2011, 11:54:23 UTC |
Sent | 3 Nov 2011, 14:56:28 UTC |
Report deadline | 2 Feb 2012, 22:23:39 UTC |
Received | 3 Dec 2011, 21:14:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1170512 |
Run time | 26 days 5 hours 12 min 6 sec |
CPU time | 22 days 5 hours 47 min 5 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.70 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.12.34</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CreateFile error 32 when trying set file time CPDN Monitor - Quit request from BOINC... 17:04:29 (20512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:05:24 (20512): No heartbeat from core client for 30 sec - exiting 17:05:25 (20512): No heartbeat from core client for 30 sec - exiting 17:05:27 (20512): No heartbeat from core client for 30 sec - exiting 17:05:28 (20512): No heartbeat from core client for 30 sec - exiting 17:05:29 (20512): No heartbeat from core client for 30 sec - exiting 17:05:30 (20512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 10:22:41 (24160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:43 (24160): No heartbeat from core client for 30 sec - exiting 10:22:44 (24160): No heartbeat from core client for 30 sec - exiting 10:22:45 (24160): No heartbeat from core client for 30 sec - exiting 11:47:18 (38836): No heartbeat from core client for 30 sec - exiting 11:47:19 (38836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:47:20 (38836): No heartbeat from core client for 30 sec - exiting 11:47:21 (38836): No heartbeat from core client for 30 sec - exiting 11:47:22 (38836): No heartbeat from core client for 30 sec - exiting 11:49:56 (37292): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:49:57 (37292): No heartbeat from core client for 30 sec - exiting 11:49:58 (37292): No heartbeat from core client for 30 sec - exiting 11:49:59 (37292): No heartbeat from core client for 30 sec - exiting 11:50:00 (37292): No heartbeat from core client for 30 sec - exiting 11:50:01 (37292): No heartbeat from core client for 30 sec - exiting 11:50:02 (37292): No heartbeat from core client for 30 sec - exiting 11:50:03 (37292): No heartbeat from core client for 30 sec - exiting 11:50:04 (37292): No heartbeat from core client for 30 sec - exiting 11:50:05 (37292): No heartbeat from core client for 30 sec - exiting 11:50:06 (37292): No heartbeat from core client for 30 sec - exiting 11:58:17 (38708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:58:18 (38708): No heartbeat from core client for 30 sec - exiting 12:23:31 (36132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:23:41 (37788): Can't acquire lockfile (32) - waiting 35s 12:36:45 (37788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:41:26 (35124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:59:11 (38552): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:59:12 (38552): No heartbeat from core client for 30 sec - exiting 12:59:13 (38552): No heartbeat from core client for 30 sec - exiting 13:10:29 (37752): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:10:30 (37752): No heartbeat from core client for 30 sec - exiting 13:34:56 (37904): No heartbeat from core client for 30 sec - exiting 13:34:58 (37904): No heartbeat from core client for 30 sec - exiting 13:35:02 (37904): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:53:54 (35188): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:54:25 (35188): No heartbeat from core client for 30 sec - exiting 22:14:37 (39152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:14:39 (39152): No heartbeat from core client for 30 sec - exiting 22:14:40 (39152): No heartbeat from core client for 30 sec - exiting 22:14:41 (39152): No heartbeat from core client for 30 sec - exiting 22:14:42 (39152): No heartbeat from core client for 30 sec - exiting 22:14:43 (39152): No heartbeat from core client for 30 sec - exiting 22:14:44 (39152): No heartbeat from core client for 30 sec - exiting 22:14:46 (39152): No heartbeat from core client for 30 sec - exiting 22:14:47 (39152): No heartbeat from core client for 30 sec - exiting 22:14:49 (39152): No heartbeat from core client for 30 sec - exiting 22:14:50 (39152): No heartbeat from core client for 30 sec - exiting 22:14:51 (39152): No heartbeat from core client for 30 sec - exiting 22:18:13 (37260): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:36:09 (38524): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:51:57 (3272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:52:05 (3272): No heartbeat from core client for 30 sec - exiting 01:52:06 (3272): No heartbeat from core client for 30 sec - exiting 01:52:07 (3272): No heartbeat from core client for 30 sec - exiting 01:52:09 (3272): No heartbeat from core client for 30 sec - exiting 01:52:10 (3272): No heartbeat from core client for 30 sec - exiting 01:52:11 (3272): No heartbeat from core client for 30 sec - exiting 01:52:12 (3272): No heartbeat from core client for 30 sec - exiting 01:52:13 (3272): No heartbeat from core client for 30 sec - exiting 01:56:28 (12880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:57:53 (13088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:58:59 (10452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:59:56 (12672): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3200, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3200, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2768, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2768, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3560, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3696, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3596, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
03 Dec 2011 21:06:43 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 1,036,800 | 1,921,616 | 1.8534 |
03 Dec 2011 06:10:40 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 1,010,880 | 1,872,007 | 1.8519 |
02 Dec 2011 13:28:14 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 984,960 | 1,822,357 | 1.8502 |
01 Dec 2011 14:42:57 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 959,040 | 1,772,770 | 1.8485 |
30 Nov 2011 20:56:00 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 933,120 | 1,721,489 | 1.8449 |
30 Nov 2011 03:24:52 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 907,200 | 1,670,693 | 1.8416 |
29 Nov 2011 10:17:06 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 881,280 | 1,620,203 | 1.8385 |
28 Nov 2011 18:23:52 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 855,360 | 1,570,621 | 1.8362 |
28 Nov 2011 04:44:25 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 829,440 | 1,522,335 | 1.8354 |
27 Nov 2011 09:10:49 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 803,520 | 1,475,246 | 1.8360 |
26 Nov 2011 11:55:22 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 777,600 | 1,426,745 | 1.8348 |
25 Nov 2011 14:12:20 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 751,680 | 1,378,227 | 1.8335 |
24 Nov 2011 22:53:08 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 725,760 | 1,328,882 | 1.8310 |
24 Nov 2011 07:56:22 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 699,840 | 1,279,383 | 1.8281 |
23 Nov 2011 16:14:05 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 673,920 | 1,229,687 | 1.8247 |
23 Nov 2011 00:55:40 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 648,000 | 1,180,376 | 1.8216 |
22 Nov 2011 07:08:20 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 622,080 | 1,130,105 | 1.8167 |
21 Nov 2011 14:29:58 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 596,160 | 1,079,853 | 1.8113 |
20 Nov 2011 22:45:56 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 570,240 | 1,030,382 | 1.8069 |
19 Nov 2011 07:48:23 | 1170512 | 13590050 | hadcm3n_yak3_1900_40_007520769_3 | 544,320 | 981,957 | 1.8040 |
©2024 climateprediction.net