Name | hadcm3n_t0vi_1940_40_007442790_0 |
Workunit | 7640293 |
Created | 8 Sep 2011, 22:14:39 UTC |
Sent | 8 Sep 2011, 22:15:56 UTC |
Report deadline | 9 Dec 2011, 5:43:07 UTC |
Received | 7 Oct 2011, 22:03:18 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1161345 |
Run time | 13 days 16 hours 47 min 22 sec |
CPU time | 13 days 9 hours 19 min 38 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.65 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 10:06:53 (3584): No heartbeat from core client for 30 sec - exiting 10:06:54 (3584): No heartbeat from core client for 30 sec - exiting 10:06:55 (3584): No heartbeat from core client for 30 sec - exiting 10:06:56 (3584): No heartbeat from core client for 30 sec - exiting 10:06:57 (3584): No heartbeat from core client for 30 sec - exiting 10:06:58 (3584): No heartbeat from core client for 30 sec - exiting 10:06:59 (3584): No heartbeat from core client for 30 sec - exiting 10:07:00 (3584): No heartbeat from core client for 30 sec - exiting 10:07:01 (3584): No heartbeat from core client for 30 sec - exiting 10:07:03 (3584): No heartbeat from core client for 30 sec - exiting 10:07:04 (3584): No heartbeat from core client for 30 sec - exiting 10:07:05 (3584): No heartbeat from core client for 30 sec - exiting 10:07:06 (3584): No heartbeat from core client for 30 sec - exiting 10:07:07 (3584): No heartbeat from core client for 30 sec - exiting 10:07:08 (3584): No heartbeat from core client for 30 sec - exiting 10:07:09 (3584): No heartbeat from core client for 30 sec - exiting 10:07:10 (3584): No heartbeat from core client for 30 sec - exiting 10:07:11 (3584): No heartbeat from core client for 30 sec - exiting 10:07:12 (3584): No heartbeat from core client for 30 sec - exiting 10:07:13 (3584): No heartbeat from core client for 30 sec - exiting 10:07:15 (3584): No heartbeat from core client for 30 sec - exiting 10:07:16 (3584): No heartbeat from core client for 30 sec - exiting 10:07:17 (3584): No heartbeat from core client for 30 sec - exiting 10:07:18 (3584): No heartbeat from core client for 30 sec - exiting 10:07:19 (3584): No heartbeat from core client for 30 sec - exiting 10:07:20 (3584): No heartbeat from core client for 30 sec - exiting 10:07:21 (3584): No heartbeat from core client for 30 sec - exiting 10:07:22 (3584): No heartbeat from core client for 30 sec - exiting 10:07:23 (3584): No heartbeat from core client for 30 sec - exiting 10:07:24 (3584): No heartbeat from core client for 30 sec - exiting 10:07:25 (3584): No heartbeat from core client for 30 sec - exiting 10:07:27 (3584): No heartbeat from core client for 30 sec - exiting 10:07:28 (3584): No heartbeat from core client for 30 sec - exiting 10:07:29 (3584): No heartbeat from core client for 30 sec - exiting 10:07:30 (3584): No heartbeat from core client for 30 sec - exiting 10:07:31 (3584): No heartbeat from core client for 30 sec - exiting 10:07:32 (3584): No heartbeat from core client for 30 sec - exiting 10:07:33 (3584): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2088, iMonCtr=1 Model crash detected, will try to restart... CController:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4488, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3692, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4828, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=516, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=736, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 11:58:12 (3868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4232, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3704, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3704, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4912, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
07 Oct 2011 22:02:29 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 777,600 | 1,156,774 | 1.4876 |
07 Oct 2011 03:29:44 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 751,680 | 1,119,232 | 1.4890 |
06 Oct 2011 08:27:53 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 725,760 | 1,081,944 | 1.4908 |
04 Oct 2011 11:19:08 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 699,840 | 1,044,525 | 1.4925 |
04 Oct 2011 00:08:34 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 673,920 | 1,007,043 | 1.4943 |
03 Oct 2011 04:58:24 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 648,000 | 969,453 | 1.4961 |
02 Oct 2011 12:03:27 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 622,080 | 932,482 | 1.4990 |
02 Oct 2011 01:50:17 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 596,160 | 895,322 | 1.5018 |
01 Oct 2011 04:09:03 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 570,240 | 857,963 | 1.5046 |
30 Sep 2011 09:11:35 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 544,320 | 820,528 | 1.5074 |
29 Sep 2011 04:04:11 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 518,400 | 782,735 | 1.5099 |
27 Sep 2011 09:08:19 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 492,480 | 744,946 | 1.5126 |
26 Sep 2011 22:05:31 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 466,560 | 705,831 | 1.5128 |
26 Sep 2011 03:41:34 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 440,640 | 666,534 | 1.5126 |
25 Sep 2011 06:14:08 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 414,720 | 627,070 | 1.5120 |
24 Sep 2011 08:11:09 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 388,800 | 587,492 | 1.5110 |
23 Sep 2011 10:15:13 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 362,880 | 548,412 | 1.5113 |
22 Sep 2011 04:50:20 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 336,960 | 509,432 | 1.5118 |
21 Sep 2011 10:02:23 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 311,040 | 470,116 | 1.5114 |
20 Sep 2011 23:00:03 | 1161345 | 13347343 | hadcm3n_t0vi_1940_40_007442790_0 | 285,120 | 430,867 | 1.5112 |
©2024 cpdn.org