Name | hadcm3n_zl69_1960_40_008393601_4 |
Workunit | 8544460 |
Created | 9 Oct 2013, 20:08:21 UTC |
Sent | 9 Oct 2013, 20:34:04 UTC |
Report deadline | 9 Jan 2014, 4:01:15 UTC |
Received | 13 Nov 2013, 6:36:17 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1310502 |
Run time | 19 days 18 hours 21 min 42 sec |
CPU time | 17 days 14 hours 48 min 16 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.60 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=17712, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:36:45 (5628): No heartbeat from core client for 30 sec - exiting 18:36:46 (5628): No heartbeat from core client for 30 sec - exiting 18:36:47 (5628): No heartbeat from core client for 30 sec - exiting 18:36:48 (5628): No heartbeat from core client for 30 sec - exiting 18:36:49 (5628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2812, iMonCtr=1 Model crash detected, will try to restart... 00:47:53 (4336): No heartbeat from core client for 30 sec - exiting 00:47:54 (4336): No heartbeat from core client for 30 sec - exiting 00:47:55 (4336): No heartbeat from core client for 30 sec - exiting 00:47:56 (4336): No heartbeat from core client for 30 sec - exiting 00:47:58 (4336): No heartbeat from core client for 30 sec - exiting 00:47:59 (4336): No heartbeat from core client for 30 sec - exiting 00:48:00 (4336): No heartbeat from core client for 30 sec - exiting 00:48:01 (4336): No heartbeat from core client for 30 sec - exiting 00:48:02 (4336): No heartbeat from core client for 30 sec - exiting 00:48:03 (4336): No heartbeat from core client for 30 sec - exiting 00:48:04 (4336): No heartbeat from core client for 30 sec - exiting 00:48:06 (4336): No heartbeat from core client for 30 sec - exiting 00:48:07 (4336): No heartbeat from core client for 30 sec - exiting 00:48:08 (4336): No heartbeat from core client for 30 sec - exiting 00:48:09 (4336): No heartbeat from core client for 30 sec - exiting 00:48:10 (4336): No heartbeat from core client for 30 sec - exiting 00:48:11 (4336): No heartbeat from core client for 30 sec - exiting 00:48:12 (4336): No heartbeat from core client for 30 sec - exiting 00:48:13 (4336): No heartbeat from core client for 30 sec - exiting 00:48:14 (4336): No heartbeat from core client for 30 sec - exiting 00:48:16 (4336): No heartbeat from core client for 30 sec - exiting 00:48:17 (4336): No heartbeat from core client for 30 sec - exiting 00:48:18 (4336): No heartbeat from core client for 30 sec - exiting 00:48:19 (4336): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5744, iMonCtr=1 Model crash detected, will try to restart... 18:07:55 (4800): No heartbeat from core client for 30 sec - exiting 18:07:56 (4800): No heartbeat from core client for 30 sec - exiting 18:07:57 (4800): No heartbeat from core client for 30 sec - exiting 18:07:59 (4800): No heartbeat from core client for 30 sec - exiting 18:08:00 (4800): No heartbeat from core client for 30 sec - exiting 18:08:01 (4800): No heartbeat from core client for 30 sec - exiting 18:08:02 (4800): No heartbeat from core client for 30 sec - exiting 18:08:03 (4800): No heartbeat from core client for 30 sec - exiting 18:08:04 (4800): No heartbeat from core client for 30 sec - exiting 18:08:05 (4800): No heartbeat from core client for 30 sec - exiting 18:08:06 (4800): No heartbeat from core client for 30 sec - exiting 18:08:07 (4800): No heartbeat from core client for 30 sec - exiting 18:08:08 (4800): No heartbeat from core client for 30 sec - exiting 18:08:09 (4800): No heartbeat from core client for 30 sec - exiting 18:08:11 (4800): No heartbeat from core client for 30 sec - exiting 18:08:12 (4800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:08:13 (4800): No heartbeat from core client for 30 sec - exiting 01:54:31 (3232): No heartbeat from core client for 30 sec - exiting 01:54:32 (3232): No heartbeat from core client for 30 sec - exiting 01:54:33 (3232): No heartbeat from core client for 30 sec - exiting 01:54:35 (3232): No heartbeat from core client for 30 sec - exiting 01:54:36 (3232): No heartbeat from core client for 30 sec - exiting 01:54:37 (3232): No heartbeat from core client for 30 sec - exiting 01:54:38 (3232): No heartbeat from core client for 30 sec - exiting 01:54:39 (3232): No heartbeat from core client for 30 sec - exiting 01:54:40 (3232): No heartbeat from core client for 30 sec - exiting 01:54:41 (3232): No heartbeat from core client for 30 sec - exiting 01:54:42 (3232): No heartbeat from core client for 30 sec - exiting 01:54:43 (3232): No heartbeat from core client for 30 sec - exiting 01:54:44 (3232): No heartbeat from core client for 30 sec - exiting 01:54:45 (3232): No heartbeat from core client for 30 sec - exiting 01:54:47 (3232): No heartbeat from core client for 30 sec - exiting 01:54:48 (3232): No heartbeat from core client for 30 sec - exiting 01:54:49 (3232): No heartbeat from core client for 30 sec - exiting 01:54:50 (3232): No heartbeat from core client for 30 sec - exiting 01:54:51 (3232): No heartbeat from core client for 30 sec - exiting 01:54:55 (3232): No heartbeat from core client for 30 sec - exiting 01:54:56 (3232): No heartbeat from core client for 30 sec - exiting 01:54:58 (3232): No heartbeat from core client for 30 sec - exiting 01:54:59 (3232): No heartbeat from core client for 30 sec - exiting 01:55:00 (3232): No heartbeat from core client for 30 sec - exiting 01:55:01 (3232): No heartbeat from core client for 30 sec - exiting 01:55:02 (3232): No heartbeat from core client for 30 sec - exiting 01:55:03 (3232): No heartbeat from core client for 30 sec - exiting 01:55:04 (3232): No heartbeat from core client for 30 sec - exiting 01:55:06 (3232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3280, iMonCtr=1 Model crash detected, will try to restart... 09:35:16 (5448): No heartbeat from core client for 30 sec - exiting 09:35:18 (5448): No heartbeat from core client for 30 sec - exiting 09:35:19 (5448): No heartbeat from core client for 30 sec - exiting 09:35:20 (5448): No heartbeat from core client for 30 sec - exiting 09:35:21 (5448): No heartbeat from core client for 30 sec - exiting 09:35:22 (5448): No heartbeat from core client for 30 sec - exiting 09:35:23 (5448): No heartbeat from core client for 30 sec - exiting 09:35:24 (5448): No heartbeat from core client for 30 sec - exiting 09:35:25 (5448): No heartbeat from core client for 30 sec - exiting 09:35:26 (5448): No heartbeat from core client for 30 sec - exiting 09:35:27 (5448): No heartbeat from core client for 30 sec - exiting 09:35:29 (5448): No heartbeat from core client for 30 sec - exiting 09:35:30 (5448): No heartbeat from core client for 30 sec - exiting 09:35:31 (5448): No heartbeat from core client for 30 sec - exiting 09:35:32 (5448): No heartbeat from core client for 30 sec - exiting 09:35:33 (5448): No heartbeat from core client for 30 sec - exiting 09:35:34 (5448): No heartbeat from core client for 30 sec - exiting 09:35:35 (5448): No heartbeat from core client for 30 sec - exiting 09:35:36 (5448): No heartbeat from core client for 30 sec - exiting 09:35:37 (5448): No heartbeat from core client for 30 sec - exiting 09:35:38 (5448): No heartbeat from core client for 30 sec - exiting 09:35:40 (5448): No heartbeat from core client for 30 sec - exiting 09:35:41 (5448): No heartbeat from core client for 30 sec - exiting 09:35:42 (5448): No heartbeat from core client for 30 sec - exiting 09:35:43 (5448): No heartbeat from core client for 30 sec - exiting 09:35:44 (5448): No heartbeat from core client for 30 sec - exiting 09:35:45 (5448): No heartbeat from core client for 30 sec - exiting 09:35:46 (5448): No heartbeat from core client for 30 sec - exiting 09:35:47 (5448): No heartbeat from core client for 30 sec - exiting 09:35:48 (5448): No heartbeat from core client for 30 sec - exiting 09:35:49 (5448): No heartbeat from core client for 30 sec - exiting 09:35:51 (5448): No heartbeat from core client for 30 sec - exiting 09:35:52 (5448): No heartbeat from core client for 30 sec - exiting 09:35:53 (5448): No heartbeat from core client for 30 sec - exiting 09:35:54 (5448): No heartbeat from core client for 30 sec - exiting 09:35:55 (5448): No heartbeat from core client for 30 sec - exiting 09:35:56 (5448): No heartbeat from core client for 30 sec - exiting 09:35:57 (5448): No heartbeat from core client for 30 sec - exiting 09:35:58 (5448): No heartbeat from core client for 30 sec - exiting 09:35:59 (5448): No heartbeat from core client for 30 sec - exiting 09:36:00 (5448): No heartbeat from core client for 30 sec - exiting 09:36:01 (5448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5700, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5700, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 16:44:09 (5700): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3496, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 01:22:46 (3496): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 14:24:52 (5604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8276, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Nov 2013 06:40:48 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 777,600 | 1,470,568 | 1.8912 |
11 Nov 2013 01:04:50 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 751,680 | 1,418,023 | 1.8865 |
09 Nov 2013 22:35:29 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 725,760 | 1,364,964 | 1.8807 |
09 Nov 2013 06:50:45 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 699,840 | 1,312,227 | 1.8750 |
07 Nov 2013 09:35:49 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 673,920 | 1,270,039 | 1.8846 |
04 Nov 2013 04:17:51 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 648,000 | 1,242,348 | 1.9172 |
03 Nov 2013 05:02:06 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 622,080 | 1,190,903 | 1.9144 |
01 Nov 2013 18:47:47 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 596,160 | 1,142,194 | 1.9159 |
01 Nov 2013 02:04:55 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 570,240 | 1,088,452 | 1.9088 |
31 Oct 2013 08:59:13 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 544,320 | 1,034,512 | 1.9006 |
30 Oct 2013 09:02:58 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 518,400 | 981,695 | 1.8937 |
28 Oct 2013 18:35:00 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 492,480 | 936,620 | 1.9018 |
26 Oct 2013 11:48:22 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 466,560 | 891,599 | 1.9110 |
25 Oct 2013 12:50:46 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 440,640 | 846,099 | 1.9202 |
24 Oct 2013 22:02:49 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 414,720 | 799,776 | 1.9285 |
23 Oct 2013 08:56:45 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 388,800 | 755,237 | 1.9425 |
22 Oct 2013 08:02:41 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 362,880 | 702,734 | 1.9365 |
20 Oct 2013 21:49:17 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 336,960 | 650,637 | 1.9309 |
20 Oct 2013 04:52:32 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 311,040 | 597,235 | 1.9201 |
19 Oct 2013 11:51:19 | 978586 | 16063410 | hadcm3n_zl69_1960_40_008393601_4 | 285,120 | 543,247 | 1.9053 |
©2024 cpdn.org