Name | hadcm3n_4ilh_1940_40_008303361_0 |
Workunit | 8454496 |
Created | 6 Feb 2013, 22:34:00 UTC |
Sent | 6 Feb 2013, 22:34:49 UTC |
Report deadline | 9 May 2013, 6:02:00 UTC |
Received | 18 Mar 2013, 16:40:32 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1187110 |
Run time | 7 days 17 hours 48 min 3 sec |
CPU time | 7 days 3 hours 29 min 4 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 3.31 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5328, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 15:31:17 (5464): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4904, iMonCtr=1 Model crash detected, will try to restart... 15:33:47 (5548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:49:52 (6824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5264, iMonCtr=1 Model crash detected, will try to restart... 19:57:21 (6896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5484, iMonCtr=1 Model crash detected, will try to restart... 21:04:07 (5888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:09:18 (5824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2176, iMonCtr=1 Model crash detected, will try to restart... 19:25:14 (5708): No heartbeat from core client for 30 sec - exiting 19:25:15 (5708): No heartbeat from core client for 30 sec - exiting 19:25:16 (5708): No heartbeat from core client for 30 sec - exiting 19:25:17 (5708): No heartbeat from core client for 30 sec - exiting 19:25:18 (5708): No heartbeat from core client for 30 sec - exiting 19:25:19 (5708): No heartbeat from core client for 30 sec - exiting 19:25:20 (5708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5668, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:15:42 (5512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6444, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5372, iMonCtr=1 Model crash detected, will try to restart... 19:46:19 (5632): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:54:46 (5384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6108, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5556, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5596, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5688, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5592, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5576, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8308, iMonCtr=1 Model crash detected, will try to restart... 20:01:26 (5304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:15:29 (4452): No heartbeat from core client for 30 sec - exiting 22:15:30 (4452): No heartbeat from core client for 30 sec - exiting 22:15:31 (4452): No heartbeat from core client for 30 sec - exiting 22:15:32 (4452): No heartbeat from core client for 30 sec - exiting 22:15:33 (4452): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6580, iMonCtr=1 Model crash detected, will try to restart... 21:18:58 (5588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:57:09 (5648): No heartbeat from core client for 30 sec - exiting 21:57:10 (5648): No heartbeat from core client for 30 sec - exiting 21:57:11 (5648): No heartbeat from core client for 30 sec - exiting 21:57:12 (5648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:16:27 (1932): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:19:17 (5532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:27:29 (5872): No heartbeat from core client for 30 sec - exiting 22:27:30 (5872): No heartbeat from core client for 30 sec - exiting 22:27:31 (5872): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:41:31 (5208): No heartbeat from core client for 30 sec - exiting 00:41:32 (5208): No heartbeat from core client for 30 sec - exiting 00:41:33 (5208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:12:49 (6356): No heartbeat from core client for 30 sec - exiting 13:12:50 (6356): No heartbeat from core client for 30 sec - exiting 13:12:51 (6356): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6136, iMonCtr=1 Model crash detected, will try to restart... 20:51:09 (5728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4332, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5904, iMonCtr=1 Model crash detected, will try to restart... 19:56:17 (6864): No heartbeat from core client for 30 sec - exiting 19:56:18 (6864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:50:12 (5676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 06:46:05 (5976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:12:07 (6960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=828, iMonCtr=1 Model crash detected, will try to restart... 20:16:53 (6088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5708, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... 00:37:00 (6968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:05:55 (6340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:27:55 (6588): No heartbeat from core client for 30 sec - exiting 23:27:56 (6588): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Mar 2013 15:40:32 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 518,400 | 617,340 | 1.1909 |
15 Mar 2013 20:52:52 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 492,480 | 586,875 | 1.1917 |
14 Mar 2013 19:57:27 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 466,560 | 555,979 | 1.1917 |
11 Mar 2013 22:03:06 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 440,640 | 524,936 | 1.1913 |
09 Mar 2013 17:00:46 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 414,720 | 494,175 | 1.1916 |
06 Mar 2013 20:29:11 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 388,800 | 463,221 | 1.1914 |
02 Mar 2013 23:58:42 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 362,880 | 432,219 | 1.1911 |
01 Mar 2013 01:15:39 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 336,960 | 400,632 | 1.1890 |
24 Feb 2013 20:00:07 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 311,040 | 368,894 | 1.1860 |
23 Feb 2013 16:00:47 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 285,120 | 337,379 | 1.1833 |
22 Feb 2013 18:47:37 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 259,200 | 306,439 | 1.1822 |
20 Feb 2013 21:06:20 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 233,280 | 275,695 | 1.1818 |
17 Feb 2013 21:40:20 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 207,360 | 244,557 | 1.1794 |
17 Feb 2013 12:27:43 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 181,440 | 214,142 | 1.1802 |
16 Feb 2013 17:46:25 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 155,520 | 183,379 | 1.1791 |
14 Feb 2013 19:42:03 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 129,600 | 152,391 | 1.1759 |
12 Feb 2013 19:57:38 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 103,680 | 121,523 | 1.1721 |
11 Feb 2013 21:00:36 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 77,760 | 91,004 | 1.1703 |
10 Feb 2013 20:00:44 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 51,840 | 60,515 | 1.1673 |
08 Feb 2013 22:29:02 | 1187110 | 15588949 | hadcm3n_4ilh_1940_40_008303361_0 | 25,920 | 29,802 | 1.1498 |
©2024 cpdn.org