Name | hadcm3n_u2bz_2020_40_008339756_0 |
Workunit | 8490617 |
Created | 28 Mar 2013, 19:07:25 UTC |
Sent | 28 Mar 2013, 20:24:01 UTC |
Report deadline | 28 Jun 2013, 3:51:12 UTC |
Received | 26 May 2013, 11:23:16 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1164908 |
Run time | 14 days 23 hours 18 min 19 sec |
CPU time | 14 days 14 hours 26 min 57 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 2.63 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> 10:06:38 (6164): No heartbeat from core client for 30 sec - exiting 10:06:39 (6164): No heartbeat from core client for 30 sec - exiting 10:06:40 (6164): No heartbeat from core client for 30 sec - exiting 10:06:41 (6164): No heartbeat from core client for 30 sec - exiting 10:06:42 (6164): No heartbeat from core client for 30 sec - exiting 10:06:43 (6164): No heartbeat from core client for 30 sec - exiting 10:06:44 (6164): No heartbeat from core client for 30 sec - exiting 10:06:45 (6164): No heartbeat from core client for 30 sec - exiting 10:06:46 (6164): No heartbeat from core client for 30 sec - exiting 10:06:47 (6164): No heartbeat from core client for 30 sec - exiting 10:06:48 (6164): No heartbeat from core client for 30 sec - exiting 10:06:49 (6164): No heartbeat from core client for 30 sec - exiting 10:06:50 (6164): No heartbeat from core client for 30 sec - exiting 10:06:51 (6164): No heartbeat from core client for 30 sec - exiting 10:06:52 (6164): No heartbeat from core client for 30 sec - exiting 10:06:53 (6164): No heartbeat from core client for 30 sec - exiting 10:06:54 (6164): No heartbeat from core client for 30 sec - exiting 10:06:55 (6164): No heartbeat from core client for 30 sec - exiting 10:06:56 (6164): No heartbeat from core client for 30 sec - exiting 10:06:57 (6164): No heartbeat from core client for 30 sec - exiting 10:06:58 (6164): No heartbeat from core client for 30 sec - exiting 10:06:59 (6164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:00:18 (7420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:42:02 (6784): No heartbeat from core client for 30 sec - exiting 11:42:03 (6784): No heartbeat from core client for 30 sec - exiting 11:42:04 (6784): No heartbeat from core client for 30 sec - exiting 11:42:05 (6784): No heartbeat from core client for 30 sec - exiting 11:42:06 (6784): No heartbeat from core client for 30 sec - exiting 11:42:07 (6784): No heartbeat from core client for 30 sec - exiting 11:42:08 (6784): No heartbeat from core client for 30 sec - exiting 11:42:09 (6784): No heartbeat from core client for 30 sec - exiting 11:42:10 (6784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3108, iMonCtr=1 Model crash detected, will try to restart... 10:57:31 (6268): No heartbeat from core client for 30 sec - exiting 10:57:32 (6268): No heartbeat from core client for 30 sec - exiting 10:57:33 (6268): No heartbeat from core client for 30 sec - exiting 10:57:34 (6268): No heartbeat from core client for 30 sec - exiting 10:57:35 (6268): No heartbeat from core client for 30 sec - exiting 10:57:36 (6268): No heartbeat from core client for 30 sec - exiting 10:57:37 (6268): No heartbeat from core client for 30 sec - exiting 10:57:38 (6268): No heartbeat from core client for 30 sec - exiting 10:57:39 (6268): No heartbeat from core client for 30 sec - exiting 10:57:40 (6268): No heartbeat from core client for 30 sec - exiting 10:57:41 (6268): No heartbeat from core client for 30 sec - exiting 10:57:42 (6268): No heartbeat from core client for 30 sec - exiting 10:57:43 (6268): No heartbeat from core client for 30 sec - exiting 10:57:44 (6268): No heartbeat from core client for 30 sec - exiting 10:57:45 (6268): No heartbeat from core client for 30 sec - exiting 10:57:46 (6268): No heartbeat from core client for 30 sec - exiting 10:57:47 (6268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... C10:30:55 (4944): No heartbeat from core client for 30 sec - exiting 10:30:57 (4944): No heartbeat from core client for 30 sec - exiting 10:30:58 (4944): No heartbeat from core client for 30 sec - exiting 10:30:59 (4944): No heartbeat from core client for 30 sec - exiting 10:31:00 (4944): No heartbeat from core client for 30 sec - exiting 10:31:01 (4944): No heartbeat from core client for 30 sec - exiting 10:31:02 (4944): No heartbeat from core client for 30 sec - exiting 10:31:03 (4944): No heartbeat from core client for 30 sec - exiting 10:31:04 (4944): No heartbeat from core client for 30 sec - exiting 10:31:05 (4944): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:52:53 (8488): No heartbeat from core client for 30 sec - exiting 18:52:54 (8488): No heartbeat from core client for 30 sec - exiting 18:52:55 (8488): No heartbeat from core client for 30 sec - exiting 18:52:56 (8488): No heartbeat from core client for 30 sec - exiting 18:52:57 (8488): No heartbeat from core client for 30 sec - exiting 18:52:58 (8488): No heartbeat from core client for 30 sec - exiting 18:52:59 (8488): No heartbeat from core client for 30 sec - exiting 18:53:00 (8488): No heartbeat from core client for 30 sec - exiting 18:53:01 (8488): No heartbeat from core client for 30 sec - exiting 18:53:02 (8488): No heartbeat from core client for 30 sec - exiting 18:53:03 (8488): No heartbeat from core client for 30 sec - exiting 18:53:04 (8488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:12:26 (4532): No heartbeat from core client for 30 sec - exiting 11:12:28 (4532): No heartbeat from core client for 30 sec - exiting 11:12:29 (4532): No heartbeat from core client for 30 sec - exiting 11:12:30 (4532): No heartbeat from core client for 30 sec - exiting 11:12:31 (4532): No heartbeat from core client for 30 sec - exiting 11:12:32 (4532): No heartbeat from core client for 30 sec - exiting 11:12:33 (4532): No heartbeat from core client for 30 sec - exiting 11:12:34 (4532): No heartbeat from core client for 30 sec - exiting 11:12:35 (4532): No heartbeat from core client for 30 sec - exiting 11:12:36 (4532): No heartbeat from core client for 30 sec - exiting 11:12:37 (4532): No heartbeat from core client for 30 sec - exiting 11:12:38 (4532): No heartbeat from core client for 30 sec - exiting 11:12:39 (4532): No heartbeat from core client for 30 sec - exiting 11:12:40 (4532): No heartbeat from core client for 30 sec - exiting 11:12:41 (4532): No heartbeat from core client for 30 sec - exiting 11:12:42 (4532): No heartbeat from core client for 30 sec - exiting 11:12:43 (4532): No heartbeat from core client for 30 sec - exiting 11:12:44 (4532): No heartbeat from core client for 30 sec - exiting 11:12:45 (4532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:43:54 (6204): No heartbeat from core client for 30 sec - exiting 13:43:55 (6204): No heartbeat from core client for 30 sec - exiting 13:43:56 (6204): No heartbeat from core client for 30 sec - exiting 13:43:57 (6204): No heartbeat from core client for 30 sec - exiting 13:43:58 (6204): No heartbeat from core client for 30 sec - exiting 13:43:59 (6204): No heartbeat from core client for 30 sec - exiting 13:44:00 (6204): No heartbeat from core client for 30 sec - exiting 13:44:01 (6204): No heartbeat from core client for 30 sec - exiting 13:44:02 (6204): No heartbeat from core client for 30 sec - exiting 13:44:03 (6204): No heartbeat from core client for 30 sec - exiting 13:44:04 (6204): No heartbeat from core client for 30 sec - exiting 13:44:05 (6204): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:09:12 (5428): No heartbeat from core client for 30 sec - exiting 15:09:13 (5428): No heartbeat from core client for 30 sec - exiting 15:09:14 (5428): No heartbeat from core client for 30 sec - exiting 15:09:15 (5428): No heartbeat from core client for 30 sec - exiting 15:09:16 (5428): No heartbeat from core client for 30 sec - exiting 15:09:17 (5428): No heartbeat from core client for 30 sec - exiting 15:09:18 (5428): No heartbeat from core client for 30 sec - exiting 15:09:19 (5428): No heartbeat from core client for 30 sec - exiting 15:09:20 (5428): No heartbeat from core client for 30 sec - exiting 15:09:21 (5428): No heartbeat from core client for 30 sec - exiting 15:09:22 (5428): No heartbeat from core client for 30 sec - exiting 15:09:23 (5428): No heartbeat from core client for 30 sec - exiting 15:09:24 (5428): No heartbeat from core client for 30 sec - exiting 15:09:25 (5428): No heartbeat from core client for 30 sec - exiting 15:09:26 (5428): No heartbeat from core client for 30 sec - exiting 15:09:27 (5428): No heartbeat from core client for 30 sec - exiting 15:09:28 (5428): No heartbeat from core client for 30 sec - exiting 15:09:29 (5428): No heartbeat from core client for 30 sec - exiting 15:09:30 (5428): No heartbeat from core client for 30 sec - exiting 15:09:32 (5428): No heartbeat from core client for 30 sec - exiting 15:09:33 (5428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5836, iMonCtr=1 Model crash detected, will try to restart... 15:26:29 (1920): No heartbeat from core client for 30 sec - exiting 15:26:30 (1920): No heartbeat from core client for 30 sec - exiting 15:26:31 (1920): No heartbeat from core client for 30 sec - exiting 15:26:32 (1920): No heartbeat from core client for 30 sec - exiting 15:26:33 (1920): No heartbeat from core client for 30 sec - exiting 15:26:34 (1920): No heartbeat from core client for 30 sec - exiting 15:26:35 (1920): No heartbeat from core client for 30 sec - exiting 15:26:36 (1920): No heartbeat from core client for 30 sec - exiting 15:26:37 (1920): No heartbeat from core client for 30 sec - exiting 15:26:38 (1920): No heartbeat from core client for 30 sec - exiting 15:26:39 (1920): No heartbeat from core client for 30 sec - exiting 15:26:40 (1920): No heartbeat from core client for 30 sec - exiting 15:26:41 (1920): No heartbeat from core client for 30 sec - exiting 15:26:42 (1920): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7912, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 15:21:10 (8124): No heartbeat from core client for 30 sec - exiting 15:21:11 (8124): No heartbeat from core client for 30 sec - exiting 15:21:12 (8124): No heartbeat from core client for 30 sec - exiting 15:21:13 (8124): No heartbeat from core client for 30 sec - exiting 15:21:14 (8124): No heartbeat from core client for 30 sec - exiting 15:21:15 (8124): No heartbeat from core client for 30 sec - exiting 15:21:16 (8124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3320, iMonCtr=1 Model crash detected, will try to restart... 15:24:34 (5300): No heartbeat from core client for 30 sec - exiting 15:24:35 (5300): No heartbeat from core client for 30 sec - exiting 15:24:36 (5300): No heartbeat from core client for 30 sec - exiting 15:24:37 (5300): No heartbeat from core client for 30 sec - exiting 15:24:38 (5300): No heartbeat from core client for 30 sec - exiting 15:24:39 (5300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:29:32 (5824): No heartbeat from core client for 30 sec - exiting 09:29:33 (5824): No heartbeat from core client for 30 sec - exiting 09:29:34 (5824): No heartbeat from core client for 30 sec - exiting 09:29:35 (5824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:05:45 (6276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:46 (7896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5860, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6356, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6264, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4496, iMonCtr=1 Model crash detected, will try to restart... 11:39:31 (6372): No heartbeat from core client for 30 sec - exiting 11:39:32 (6372): No heartbeat from core client for 30 sec - exiting 11:39:33 (6372): No heartbeat from core client for 30 sec - exiting 11:39:34 (6372): No heartbeat from core client for 30 sec - exiting 11:39:35 (6372): No heartbeat from core client for 30 sec - exiting 11:39:36 (6372): No heartbeat from core client for 30 sec - exiting 11:39:37 (6372): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:19:56 (6148): No heartbeat from core client for 30 sec - exiting 10:19:57 (6148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6496, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6496, iMonCtr=1 Model crash detected, will try to restart... 10:15:09 (6968): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5440, iMonCtr=1 Model crash detected, will try to restart... Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 May 2013 11:25:56 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 777,600 | 1,261,609 | 1.6224 |
25 May 2013 13:13:36 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 751,680 | 1,221,436 | 1.6249 |
24 May 2013 12:19:02 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 725,760 | 1,182,302 | 1.6291 |
20 May 2013 13:17:29 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 699,840 | 1,142,849 | 1.6330 |
19 May 2013 14:29:07 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 673,920 | 1,104,314 | 1.6386 |
18 May 2013 14:30:53 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 648,000 | 1,062,507 | 1.6397 |
17 May 2013 13:47:19 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 622,080 | 1,018,619 | 1.6374 |
11 May 2013 17:41:54 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 596,160 | 975,784 | 1.6368 |
10 May 2013 17:51:21 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 570,240 | 931,804 | 1.6341 |
09 May 2013 14:40:28 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 544,320 | 886,686 | 1.6290 |
05 May 2013 18:46:46 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 518,400 | 845,611 | 1.6312 |
04 May 2013 22:22:00 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 492,480 | 805,458 | 1.6355 |
04 May 2013 10:16:28 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 466,560 | 765,511 | 1.6408 |
01 May 2013 15:41:00 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 440,640 | 724,493 | 1.6442 |
30 Apr 2013 18:37:17 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 414,720 | 688,482 | 1.6601 |
28 Apr 2013 11:52:06 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 388,800 | 655,690 | 1.6864 |
27 Apr 2013 10:53:35 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 362,880 | 617,276 | 1.7010 |
26 Apr 2013 12:49:11 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 336,960 | 577,455 | 1.7137 |
21 Apr 2013 11:02:45 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 311,040 | 537,461 | 1.7279 |
20 Apr 2013 08:59:00 | 1164908 | 15690371 | hadcm3n_u2bz_2020_40_008339756_0 | 285,120 | 495,949 | 1.7394 |
©2024 climateprediction.net