Name | hadcm3n_zcwj_1880_40_008249908_2 |
Workunit | 8405032 |
Created | 22 Nov 2012, 4:09:00 UTC |
Sent | 22 Nov 2012, 4:09:06 UTC |
Report deadline | 21 Feb 2013, 11:36:17 UTC |
Received | 9 Jun 2014, 18:51:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1042736 |
Run time | 29 days 20 hours 37 min 22 sec |
CPU time | 12 days 2 hours 35 min 59 sec |
Validate state | Invalid |
Credit | 6,842.88 |
Device peak FLOPS | 2.08 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.18</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5476, iMonCtr=1 Model crash detected, will try to restart... 12:57:30 (4692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7020, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9992, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9356, iMonCtr=1 Model crash detected, will try to restart... 11:53:47 (180): No heartbeat from core client for 30 sec - exiting 11:53:48 (180): No heartbeat from core client for 30 sec - exiting 11:53:49 (180): No heartbeat from core client for 30 sec - exiting 11:53:50 (180): No heartbeat from core client for 30 sec - exiting 11:53:51 (180): No heartbeat from core client for 30 sec - exiting 11:53:52 (180): No heartbeat from core client for 30 sec - exiting 11:53:53 (180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:38:32 (88428): No heartbeat from core client for 30 sec - exiting 14:38:33 (88428): No heartbeat from core client for 30 sec - exiting 14:38:34 (88428): No heartbeat from core client for 30 sec - exiting 14:38:35 (88428): No heartbeat from core client for 30 sec - exiting 14:38:36 (88428): No heartbeat from core client for 30 sec - exiting 14:38:37 (88428): No heartbeat from core client for 30 sec - exiting 14:38:38 (88428): No heartbeat from core client for 30 sec - exiting 14:38:40 (88428): No heartbeat from core client for 30 sec - exiting 14:38:41 (88428): No heartbeat from core client for 30 sec - exiting 14:38:42 (88428): No heartbeat from core client for 30 sec - exiting 14:38:43 (88428): No heartbeat from core client for 30 sec - exiting 14:38:44 (88428): No heartbeat from core client for 30 sec - exiting 14:38:45 (88428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:15:45 (6096): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:15:46 (6096): No heartbeat from core client for 30 sec - exiting 10:15:47 (6096): No heartbeat from core client for 30 sec - exiting 10:15:48 (6096): No heartbeat from core client for 30 sec - exiting 10:15:49 (6096): No heartbeat from core client for 30 sec - exiting 10:15:50 (6096): No heartbeat from core client for 30 sec - exiting 03:22:54 (22604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:22:55 (22604): No heartbeat from core client for 30 sec - exiting 03:22:56 (22604): No heartbeat from core client for 30 sec - exiting 03:22:57 (22604): No heartbeat from core client for 30 sec - exiting 09:47:24 (74008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:47:25 (74008): No heartbeat from core client for 30 sec - exiting 09:47:26 (74008): No heartbeat from core client for 30 sec - exiting 09:47:27 (74008): No heartbeat from core client for 30 sec - exiting 09:47:28 (74008): No heartbeat from core client for 30 sec - exiting 09:47:29 (74008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 16:17:56 (536388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:17:57 (536388): No heartbeat from core client for 30 sec - exiting 18:38:55 (7144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:40:33 (10332): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:15:23 (6388): No heartbeat from core client for 30 sec - exiting 17:15:24 (6388): No heartbeat from core client for 30 sec - exiting 17:15:25 (6388): No heartbeat from core client for 30 sec - exiting 17:15:26 (6388): No heartbeat from core client for 30 sec - exiting 17:15:27 (6388): No heartbeat from core client for 30 sec - exiting 17:15:28 (6388): No heartbeat from core client for 30 sec - exiting 17:15:29 (6388): No heartbeat from core client for 30 sec - exiting 17:15:30 (6388): No heartbeat from core client for 30 sec - exiting 17:15:31 (6388): No heartbeat from core client for 30 sec - exiting 17:15:32 (6388): No heartbeat from core client for 30 sec - exiting 17:15:34 (6388): No heartbeat from core client for 30 sec - exiting 17:15:35 (6388): No heartbeat from core client for 30 sec - exiting 17:15:36 (6388): No heartbeat from core client for 30 sec - exiting 17:15:37 (6388): No heartbeat from core client for 30 sec - exiting 17:15:38 (6388): No heartbeat from core client for 30 sec - exiting 17:15:39 (6388): No heartbeat from core client for 30 sec - exiting 17:15:40 (6388): No heartbeat from core client for 30 sec - exiting 17:15:41 (6388): No heartbeat from core client for 30 sec - exiting 17:15:42 (6388): No heartbeat from core client for 30 sec - exiting 17:15:45 (6388): No heartbeat from core client for 30 sec - exiting 17:15:46 (6388): No heartbeat from core client for 30 sec - exiting 17:15:48 (6388): No heartbeat from core client for 30 sec - exiting 17:15:49 (6388): No heartbeat from core client for 30 sec - exiting 17:15:50 (6388): No heartbeat from core client for 30 sec - exiting 17:15:51 (6388): No heartbeat from core client for 30 sec - exiting 17:15:52 (6388): No heartbeat from core client for 30 sec - exiting 17:15:53 (6388): No heartbeat from core client for 30 sec - exiting 17:15:54 (6388): No heartbeat from core client for 30 sec - exiting 17:15:55 (6388): No heartbeat from core client for 30 sec - exiting 17:15:56 (6388): No heartbeat from core client for 30 sec - exiting 17:15:57 (6388): No heartbeat from core client for 30 sec - exiting 17:15:58 (6388): No heartbeat from core client for 30 sec - exiting 17:15:59 (6388): No heartbeat from core client for 30 sec - exiting 17:16:00 (6388): No heartbeat from core client for 30 sec - exiting 17:16:01 (6388): No heartbeat from core client for 30 sec - exiting 17:16:02 (6388): No heartbeat from core client for 30 sec - exiting 17:16:03 (6388): No heartbeat from core client for 30 sec - exiting 17:16:04 (6388): No heartbeat from core client for 30 sec - exiting 17:16:05 (6388): No heartbeat from core client for 30 sec - exiting 17:16:06 (6388): No heartbeat from core client for 30 sec - exiting 17:16:07 (6388): No heartbeat from core client for 30 sec - exiting 17:16:08 (6388): No heartbeat from core client for 30 sec - exiting 17:16:09 (6388): No heartbeat from core client for 30 sec - exiting 17:16:10 (6388): No heartbeat from core client for 30 sec - exiting 17:16:11 (6388): No heartbeat from core client for 30 sec - exiting 17:16:12 (6388): No heartbeat from core client for 30 sec - exiting 17:16:14 (6388): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:22:37 (81048): No heartbeat from core client for 30 sec - exiting 11:22:39 (81048): No heartbeat from core client for 30 sec - exiting 11:22:40 (81048): No heartbeat from core client for 30 sec - exiting 11:22:42 (81048): No heartbeat from core client for 30 sec - exiting 11:22:44 (81048): No heartbeat from core client for 30 sec - exiting 11:22:45 (81048): No heartbeat from core client for 30 sec - exiting 11:22:47 (81048): No heartbeat from core client for 30 sec - exiting 11:22:49 (81048): No heartbeat from core client for 30 sec - exiting 11:22:50 (81048): No heartbeat from core client for 30 sec - exiting 11:22:52 (81048): No heartbeat from core client for 30 sec - exiting 11:22:53 (81048): No heartbeat from core client for 30 sec - exiting 11:22:55 (81048): No heartbeat from core client for 30 sec - exiting 11:22:57 (81048): No heartbeat from core client for 30 sec - exiting 11:22:58 (81048): No heartbeat from core client for 30 sec - exiting 11:22:59 (81048): No heartbeat from core client for 30 sec - exiting 11:23:01 (81048): No heartbeat from core client for 30 sec - exiting 11:23:02 (81048): No heartbeat from core client for 30 sec - exiting 11:23:04 (81048): No heartbeat from core client for 30 sec - exiting 11:23:05 (81048): No heartbeat from core client for 30 sec - exiting 11:23:06 (81048): No heartbeat from core client for 30 sec - exiting 11:23:08 (81048): No heartbeat from core client for 30 sec - exiting 11:23:09 (81048): No heartbeat from core client for 30 sec - exiting 11:23:46 (81048): No heartbeat from core client for 30 sec - exiting 11:23:47 (81048): No heartbeat from core client for 30 sec - exiting 11:23:49 (81048): No heartbeat from core client for 30 sec - exiting 11:23:50 (81048): No heartbeat from core client for 30 sec - exiting 11:23:51 (81048): No heartbeat from core client for 30 sec - exiting 11:23:53 (81048): No heartbeat from core client for 30 sec - exiting 11:23:54 (81048): No heartbeat from core client for 30 sec - exiting 11:23:56 (81048): No heartbeat from core client for 30 sec - exiting 11:23:57 (81048): No heartbeat from core client for 30 sec - exiting 11:23:59 (81048): No heartbeat from core client for 30 sec - exiting 11:24:00 (81048): No heartbeat from core client for 30 sec - exiting 11:24:02 (81048): No heartbeat from core client for 30 sec - exiting 11:24:03 (81048): No heartbeat from core client for 30 sec - exiting 11:24:04 (81048): No heartbeat from core client for 30 sec - exiting 11:24:05 (81048): No heartbeat from core client for 30 sec - exiting 11:24:06 (81048): No heartbeat from core client for 30 sec - exiting 11:24:08 (81048): No heartbeat from core client for 30 sec - exiting 11:24:09 (81048): No heartbeat from core client for 30 sec - exiting 11:24:10 (81048): No heartbeat from core client for 30 sec - exiting 11:24:12 (81048): No heartbeat from core client for 30 sec - exiting 11:24:14 (81048): No heartbeat from core client for 30 sec - exiting 11:24:16 (81048): No heartbeat from core client for 30 sec - exiting 11:24:17 (81048): No heartbeat from core client for 30 sec - exiting 11:24:19 (81048): No heartbeat from core client for 30 sec - exiting 11:24:20 (81048): No heartbeat from core client for 30 sec - exiting 11:24:22 (81048): No heartbeat from core client for 30 sec - exiting 11:24:23 (81048): No heartbeat from core client for 30 sec - exiting 11:24:24 (81048): No heartbeat from core client for 30 sec - exiting 11:24:26 (81048): No heartbeat from core client for 30 sec - exiting 11:24:28 (81048): No heartbeat from core client for 30 sec - exiting 11:24:29 (81048): No heartbeat from core client for 30 sec - exiting 11:24:30 (81048): No heartbeat from core client for 30 sec - exiting 11:24:32 (81048): No heartbeat from core client for 30 sec - exiting 11:24:33 (81048): No heartbeat from core client for 30 sec - exiting 11:24:34 (81048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:22:47 (38972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:22:49 (38972): No heartbeat from core client for 30 sec - exiting 13:22:50 (38972): No heartbeat from core client for 30 sec - exiting 13:33:35 (166812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:33:36 (166812): No heartbeat from core client for 30 sec - exiting 13:33:37 (166812): No heartbeat from core client for 30 sec - exiting 13:33:38 (166812): No heartbeat from core client for 30 sec - exiting 13:33:39 (166812): No heartbeat from core client for 30 sec - exiting 13:33:40 (166812): No heartbeat from core client for 30 sec - exiting 13:33:41 (166812): No heartbeat from core client for 30 sec - exiting 13:33:42 (166812):Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=13940, iMonCtr=1 Model crash detected, will try to restart... 12:42:49 (4128): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:15:53 (274312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:15:54 (274312): No heartbeat from core client for 30 sec - exiting 20:15:58 (274312): No heartbeat from core client for 30 sec - exiting 20:15:59 (274312): No heartbeat from core client for 30 sec - exiting 20:16:00 (274312): No heartbeat from core client for 30 sec - exiting 20:16:01 (274312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 14:41:22 (294900): No heartbeat from core client for 30 sec - exiting C14:41:23 (294900): No heartbeat from core client for 30 sec - exiting C15:41:17 (301696): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:42:56 (134792): No heartbeat from core client for 30 sec - exiting 19:42:57 (134792): No heartbeat from core client for 30 sec - exiting 19:42:58 (134792): No heartbeat from core client for 30 sec - exiting 19:42:59 (134792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:43:00 (134792): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=382852, iMonCtr=1 Model crash detected, will try to restart... 16:57:20 (8136): No heartbeat from core client for 30 sec - exiting 16:57:21 (8136): No heartbeat from core client for 30 sec - exiting 16:57:22 (8136): No heartbeat from core client for 30 sec - exiting 16:57:23 (8136): No heartbeat from core client for 30 sec - exiting 16:57:24 (8136): No heartbeat from core client for 30 sec - exiting 16:57:26 (8136): No heartbeat from core client for 30 sec - exiting 16:57:27 (8136): No heartbeat from core client for 30 sec - exiting 16:57:28 (8136): No heartbeat from core client for 30 sec - exiting 16:57:29 (8136): No heartbeat from core client for 30 sec - exiting 16:57:30 (8136): No heartbeat from core client for 30 sec - exiting 16:57:31 (8136): No heartbeat from core client for 30 sec - exiting 16:57:33 (8136): No heartbeat from core client for 30 sec - exiting 16:57:34 (8136): No heartbeat from core client for 30 sec - exiting 16:57:35 (8136): No heartbeat from core client for 30 sec - exiting 16:57:38 (8136): No heartbeat from core client for 30 sec - exiting 16:57:39 (8136): No heartbeat from core client for 30 sec - exiting 16:57:40 (8136): No heartbeat from core client for 30 sec - exiting 16:57:42 (8136): No heartbeat from core client for 30 sec - exiting 16:58:16 (8136): No heartbeat from core client for 30 sec - exiting 16:58:17 (8136): No heartbeat from core client for 30 sec - exiting 16:58:18 (8136): No heartbeat from core client for 30 sec - exiting 16:58:19 (8136): No heartbeat from core client for 30 sec - exiting 16:58:20 (8136): No heartbeat from core client for 30 sec - exiting 16:58:23 (8136): No heartbeat from core client for 30 sec - exiting 16:58:24 (8136): No heartbeat from core client for 30 sec - exiting 16:58:25 (8136): No heartbeat from core client for 30 sec - exiting 16:58:26 (8136): No heartbeat from core client for 30 sec - exiting 16:58:27 (8136): No heartbeat from core client for 30 sec - exiting 16:58:28 (8136): No heartbeat from core client for 30 sec - exiting 16:58:30 (8136): No heartbeat from core client for 30 sec - exiting 16:58:31 (8136): No heartbeat from core client for 30 sec - exiting 16:58:32 (8136): No heartbeat from core client for 30 sec - exiting 16:58:33 (8136): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 10:40:28 (8312): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 10:44:28 (4856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9180, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8480, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=14304, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=15184, iMonCtr=1 Model crash detected, will try to restart... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7928, iMonCtr=1 Model crash detected, will try to restart... 10:45:33 (13064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:45:34 (13064): No heartbeat from core client for 30 sec - exiting Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=12936, iMonCtr=1 Model crash detected, will try to restart... 12:45:09 (7460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:06:24 (8144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:29:08 (4888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:52:09 (7564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:29:33 (8704): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... C08:10:41 (1012): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CCalled boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 May 2014 06:42:39 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 570,240 | 1,025,961 | 1.7992 |
02 Jan 2014 18:42:03 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 544,320 | 980,155 | 1.8007 |
24 Dec 2013 14:20:09 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 518,400 | 933,071 | 1.7999 |
12 Nov 2013 03:20:38 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 492,480 | 884,804 | 1.7966 |
04 Nov 2013 14:56:26 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 466,560 | 836,248 | 1.7924 |
02 Nov 2013 06:32:29 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 440,640 | 788,627 | 1.7897 |
31 Oct 2013 17:10:51 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 414,720 | 741,269 | 1.7874 |
29 Oct 2013 18:27:00 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 388,800 | 693,613 | 1.7840 |
28 Oct 2013 02:02:40 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 362,880 | 646,055 | 1.7804 |
26 Oct 2013 20:48:34 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 336,960 | 599,496 | 1.7791 |
14 Oct 2013 23:15:36 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 311,040 | 547,938 | 1.7616 |
12 Oct 2013 07:39:20 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 285,120 | 501,770 | 1.7599 |
20 Aug 2013 20:51:46 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 259,200 | 454,621 | 1.7539 |
20 Aug 2013 20:51:46 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 233,280 | 408,636 | 1.7517 |
20 Aug 2013 20:51:46 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 207,360 | 362,901 | 1.7501 |
20 Aug 2013 20:51:46 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 181,440 | 316,927 | 1.7467 |
06 Jul 2013 04:43:35 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 155,520 | 269,902 | 1.7355 |
03 Jul 2013 07:45:06 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 129,600 | 222,540 | 1.7171 |
02 Jul 2013 12:08:09 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 103,680 | 176,084 | 1.6983 |
02 Jul 2013 11:13:09 | 1042736 | 15451197 | hadcm3n_zcwj_1880_40_008249908_2 | 77,760 | 128,430 | 1.6516 |
©2024 climateprediction.net