|
Name | hadcm3n_2035_1940_40_007817032_0 |
Workunit | 7972141 |
Created | 28 Feb 2012, 1:10:25 UTC |
Sent | 28 Feb 2012, 6:15:09 UTC |
Report deadline | 29 May 2012, 13:42:20 UTC |
Received | 28 May 2012, 10:05:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 959555 |
Run time | 27 days 1 hours 50 min 41 sec |
CPU time | 26 days 8 hours 40 min 27 sec |
Validate state | Invalid |
Credit | 10,264.32 |
Device peak FLOPS | 2.63 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 14:27:49 (3988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:04:48 (3564): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:53:13 (2820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:03:03 (10664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 07:12:50 (1952): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:08:11 (2032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 09:00:15 (3888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:00:43 (3888): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 13:14:06 (2928): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:14:07 (2928): No heartbeat from core client for 30 sec - exiting 13:15:24 (4764): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:15:25 (4764): No heartbeat from core client for 30 sec - exiting 13:15:26 (4764): No heartbeat from core client for 30 sec - exiting 13:15:27 (4764): No heartbeat from core client for 30 sec - exiting 13:15:28 (4764): No heartbeat from core client for 30 sec - exiting 13:15:29 (4764): No heartbeat from core client for 30 sec - exiting 13:15:30 (4764): No heartbeat from core client for 30 sec - exiting 13:15:31 (4764): No heartbeat from core client for 30 sec - exiting 13:15:32 (4764): No heartbeat from core client for 30 sec - exiting 13:15:33 (4764): No heartbeat from core client for 30 sec - exiting 13:15:34 (4764): No heartbeat from core client for 30 sec - exiting 13:19:11 (1636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:19:13 (1636): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 08:36:37 (3800): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:36:39 (3800): No heartbeat from core client for 30 sec - exiting 08:36:40 (3800): No heartbeat from core client for 30 sec - exiting 08:36:41 (3800): No heartbeat from core client for 30 sec - exiting 08:36:42 (3800): No heartbeat from core client for 30 sec - exiting 08:37:54 (4912): No heartbeat from core client for 30 sec - exiting 08:37:55 (4912): No heartbeat from core client for 30 sec - exiting 08:37:56 (4912): No heartbeat from core client for 30 sec - exiting 08:37:57 (4912): No heartbeat from core client for 30 sec - exiting 08:37:58 (4912): No heartbeat from core client for 30 sec - exiting 08:37:59 (4912): No heartbeat from core client for 30 sec - exiting 08:38:00 (4912): No heartbeat from core client for 30 sec - exiting 08:38:01 (4912): No heartbeat from core client for 30 sec - exiting 08:38:02 (4912): No heartbeat from core client for 30 sec - exiting 08:38:03 (4912): No heartbeat from core client for 30 sec - exiting 08:38:04 (4912): No heartbeat from core client for 30 sec - exiting 08:38:05 (4912): No heartbeat from core client for 30 sec - exiting 08:38:06 (4912): No heartbeat from core client for 30 sec - exiting 08:38:07 (4912): No heartbeat from core client for 30 sec - exiting 08:38:08 (4912): No heartbeat from core client for 30 sec - exiting 08:38:09 (4912): No heartbeat from core client for 30 sec - exiting 08:38:10 (4912): No heartbeat from core client for 30 sec - exiting 08:38:11 (4912): No heartbeat from core client for 30 sec - exiting 08:38:12 (4912): No heartbeat from core client for 30 sec - exiting 08:38:13 (4912): No heartbeat from core client for 30 sec - exiting 08:38:14 (4912): No heartbeat from core client for 30 sec - exiting 08:38:15 (4912): No heartbeat from core client for 30 sec - exiting 08:38:16 (4912): No heartbeat from core client for 30 sec - exiting 08:38:17 (4912): No heartbeat from core client for 30 sec - exiting 08:38:18 (4912): No heartbeat from core client for 30 sec - exiting 08:38:19 (4912): No heartbeat from core client for 30 sec - exiting 08:38:20 (4912): No heartbeat from core client for 30 sec - exiting 08:38:21 (4912): No heartbeat from core client for 30 sec - exiting 08:38:22 (4912): No heartbeat from core client for 30 sec - exiting 08:38:23 (4912): No heartbeat from core client for 30 sec - exiting 08:38:24 (4912): No heartbeat from core client for 30 sec - exiting 08:38:25 (4912): No heartbeat from core client for 30 sec - exiting 08:38:26 (4912): No heartbeat from core client for 30 sec - exiting 08:38:27 (4912): No heartbeat from core client for 30 sec - exiting 08:38:28 (4912): No heartbeat from core client for 30 sec - exiting 08:38:29 (4912): No heartbeat from core client for 30 sec - exiting 08:38:30 (4912): No heartbeat from core client for 30 sec - exiting 08:38:31 (4912): No heartbeat from core client for 30 sec - exiting 08:38:32 (4912): No heartbeat from core client for 30 sec - exiting 08:38:33 (4912): No heartbeat from core client for 30 sec - exiting 08:38:34 (4912): No heartbeat from core client for 30 sec - exiting 08:38:35 (4912): No heartbeat from core client for 30 sec - exiting 08:38:36 (4912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:41:40 (9924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:41:41 (9924): No heartbeat from core client for 30 sec - exiting 09:42:33 (2072): No heartbeat from core client for 30 sec - exiting 09:42:34 (2072): No heartbeat from core client for 30 sec - exiting 09:42:35 (2072): No heartbeat from core client for 30 sec - exiting 09:42:36 (2072): No heartbeat from core client for 30 sec - exiting 09:42:37 (2072): No heartbeat from core client for 30 sec - exiting 09:42:38 (2072): No heartbeat from core client for 30 sec - exiting 09:42:39 (2072): No heartbeat from core client for 30 sec - exiting 09:42:40 (2072): No heartbeat from core client for 30 sec - exiting 09:42:41 (2072): No heartbeat from core client for 30 sec - exiting 09:42:42 (2072): No heartbeat from core client for 30 sec - exiting 09:42:43 (2072): No heartbeat from core client for 30 sec - exiting 09:42:44 (2072): No heartbeat from core client for 30 sec - exiting 09:42:45 (2072): No heartbeat from core client for 30 sec - exiting 09:42:46 (2072): No heartbeat from core client for 30 sec - exiting 09:42:47 (2072): No heartbeat from core client for 30 sec - exiting 09:42:48 (2072): No heartbeat from core client for 30 sec - exiting 09:42:49 (2072): No heartbeat from core client for 30 sec - exiting 09:42:50 (2072): No heartbeat from core client for 30 sec - exiting 09:42:51 (2072): No heartbeat from core client for 30 sec - exiting 09:42:52 (2072): No heartbeat from core client for 30 sec - exiting 09:42:53 (2072): No heartbeat from core client for 30 sec - exiting 09:42:54 (2072): No heartbeat from core client for 30 sec - exiting 09:42:55 (2072): No heartbeat from core client for 30 sec - exiting 09:42:56 (2072): No heartbeat from core client for 30 sec - exiting 09:42:57 (2072): No heartbeat from core client for 30 sec - exiting 09:42:58 (2072): No heartbeat from core client for 30 sec - exiting 09:42:59 (2072): No heartbeat from core client for 30 sec - exiting 09:43:00 (2072): No heartbeat from core client for 30 sec - exiting 09:43:01 (2072): No heartbeat from core client for 30 sec - exiting 09:43:02 (2072): No heartbeat from core client for 30 sec - exiting 09:43:03 (2072): No heartbeat from core client for 30 sec - exiting 09:43:04 (2072): No heartbeat from core client for 30 sec - exiting 09:43:05 (2072): No heartbeat from core client for 30 sec - exiting 09:43:06 (2072): No heartbeat from core client for 30 sec - exiting 09:43:07 (2072): No heartbeat from core client for 30 sec - exiting 09:43:08 (2072): No heartbeat from core client for 30 sec - exiting 09:43:09 (2072): No heartbeat from core client for 30 sec - exiting 09:43:10 (2072): No heartbeat from core client for 30 sec - exiting 09:43:11 (2072): No heartbeat from core client for 30 sec - exiting 09:43:12 (2072): No heartbeat from core client for 30 sec - exiting 09:43:13 (2072): No heartbeat from core client for 30 sec - exiting 09:43:14 (2072): No heartbeat from core client for 30 sec - exiting 09:43:15 (2072): No heartbeat from core client for 30 sec - exiting 09:43:16 (2072): No heartbeat from core client for 30 sec - exiting 09:43:17 (2072): No heartbeat from core client for 30 sec - exiting 09:43:18 (2072): No heartbeat from core client for 30 sec - exiting 09:43:19 (2072): No heartbeat from core client for 30 sec - exiting 09:43:20 (2072): No heartbeat from core client for 30 sec - exiting 09:43:21 (2072): No heartbeat from core client for 30 sec - exiting 09:43:22 (2072): No heartbeat from core client for 30 sec - exiting 09:43:23 (2072): No heartbeat from core client for 30 sec - exiting 09:43:24 (2072): No heartbeat from core client for 30 sec - exiting 09:43:25 (2072): No heartbeat from core client for 30 sec - exiting 09:43:26 (2072): No heartbeat from core client for 30 sec - exiting 09:43:27 (2072): No heartbeat from core client for 30 sec - exiting 09:43:28 (2072): No heartbeat from core client for 30 sec - exiting 09:43:29 (2072): No heartbeat from core client for 30 sec - exiting 09:43:30 (2072): No heartbeat from core client for 30 sec - exiting 09:43:32 (2072): No heartbeat from core client for 30 sec - exiting 09:43:33 (2072): No heartbeat from core client for 30 sec - exiting 09:43:34 (2072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 06:11:42 (3564): Can't set up shared mem: -1. Will run in standalone mode. 06:12:46 (3896): Can't set up shared mem: -1. Will run in standalone mode. 17:57:12 (2448): Can't acquire lockfile (32) - waiting 35s 17:57:47 (2448): Can't acquire lockfile (32) - exiting 17:57:47 (2448): Error: The process cannot access the file because it is being used by another process. (0x20) 18:58:43 (1096): Can't acquire lockfile (32) - waiting 35s 18:59:18 (1096): Can't acquire lockfile (32) - exiting 18:59:18 (1096): Error: The process cannot access the file because it is being used by another process. (0x20) 19:59:22 (4020): Can't acquire lockfile (32) - waiting 35s 19:59:57 (4020): Can't acquire lockfile (32) - exiting 19:59:57 (4020): Error: The process cannot access the file because it is being used by another process. (0x20) 20:59:58 (8084): Can't acquire lockfile (32) - waiting 35s 21:00:33 (8084): Can't acquire lockfile (32) - exiting 21:00:33 (8084): Error: The process cannot access the file because it is being used by another process. (0x20) 22:00:35 (6344): Can't acquire lockfile (32) - waiting 35s 22:01:10 (6344): Can't acquire lockfile (32) - exiting 22:01:10 (6344): Error: The process cannot access the file because it is being used by another process. (0x20) 23:01:13 (6264): Can't acquire lockfile (32) - waiting 35s 23:01:48 (6264): Can't acquire lockfile (32) - exiting 23:01:48 (6264): Error: The process cannot access the file because it is being used by another process. (0x20) 00:01:50 (6756): Can't acquire lockfile (32) - waiting 35s 00:02:25 (6756): Can't acquire lockfile (32) - exiting 00:02:25 (6756): Error: The process cannot access the file because it is being used by another process. (0x20) 01:02:27 (4772): Can't acquire lockfile (32) - waiting 35s 01:03:02 (4772): Can't acquire lockfile (32) - exiting 01:03:02 (4772): Error: The process cannot access the file because it is being used by another process. (0x20) 02:03:06 (7352): Can't acquire lockfile (32) - waiting 35s 02:03:41 (7352): Can't acquire lockfile (32) - exiting 02:03:41 (7352): Error: The process cannot access the file because it is being used by another process. (0x20) 03:03:43 (7328): Can't acquire lockfile (32) - waiting 35s 03:04:18 (7328): Can't acquire lockfile (32) - exiting 03:04:18 (7328): Error: The process cannot access the file because it is being used by another process. (0x20) 04:04:22 (7352): Can't acquire lockfile (32) - waiting 35s 04:04:57 (7352): Can't acquire lockfile (32) - exiting 04:04:57 (7352): Error: The process cannot access the file because it is being used by another process. (0x20) 05:05:01 (8236): Can't acquire lockfile (32) - waiting 35s 05:05:36 (8236): Can't acquire lockfile (32) - exiting 05:05:36 (8236): Error: The process cannot access the file because it is being used by another process. (0x20) 06:05:39 (8892): Can't acquire lockfile (32) - waiting 35s 06:06:14 (8892): Can't acquire lockfile (32) - exiting 06:06:14 (8892): Error: The process cannot access the file because it is being used by another process. (0x20) 07:06:16 (5364): Can't acquire lockfile (32) - waiting 35s 07:06:51 (5364): Can't acquire lockfile (32) - exiting 07:06:52 (5364): Error: The process cannot access the file because it is being used by another process. (0x20) 08:06:55 (3920): Can't acquire lockfile (32) - waiting 35s 08:07:30 (3920): Can't acquire lockfile (32) - exiting 08:07:30 (3920): Error: The process cannot access the file because it is being used by another process. (0x20) 08:29:02 (4032): Can't acquire lockfile (32) - waiting 35s 08:29:37 (4032): Can't acquire lockfile (32) - exiting 08:29:37 (4032): Error: The process cannot access the file because it is being used by another process. (0x20) 09:29:40 (3716): Can't acquire lockfile (32) - waiting 35s 09:30:15 (3716): Can't acquire lockfile (32) - exiting 09:30:15 (3716): Error: The process cannot access the file because it is being used by another process. (0x20) 10:13:30 (7480): Can't acquire lockfile (32) - waiting 35s 10:14:05 (7480): Can't acquire lockfile (32) - exiting 10:14:05 (7480): Error: The process cannot access the file because it is being used by another process. (0x20) 10:24:09 (9004): Can't acquire lockfile (32) - waiting 35s 10:24:44 (9004): Can't acquire lockfile (32) - exiting 10:24:44 (9004): Error: The process cannot access the file because it is being used by another process. (0x20) 10:34:47 (9556): Can't acquire lockfile (32) - waiting 35s 10:35:22 (9556): Can't acquire lockfile (32) - exiting 10:35:22 (9556): Error: The process cannot access the file because it is being used by another process. (0x20) 10:49:20 (4536): Can't acquire lockfile (32) - waiting 35s 10:49:55 (4536): Can't acquire lockfile (32) - exiting 10:49:55 (4536): Error: The process cannot access the file because it is being used by another process. (0x20) 11:06:59 (952): Can't acquire lockfile (32) - waiting 35s 11:07:34 (952): Can't acquire lockfile (32) - exiting 11:07:34 (952): Error: The process cannot access the file because it is being used by another process. (0x20) 12:13:37 (8508): Can't acquire lockfile (32) - waiting 35s 12:14:12 (8508): Can't acquire lockfile (32) - exiting 12:14:12 (8508): Error: The process cannot access the file because it is being used by another process. (0x20) 13:14:15 (8444): Can't acquire lockfile (32) - waiting 35s 13:14:50 (8444): Can't acquire lockfile (32) - exiting 13:14:50 (8444): Error: The process cannot access the file because it is being used by another process. (0x20) 14:14:53 (9380): Can't acquire lockfile (32) - waiting 35s 14:15:28 (9380): Can't acquire lockfile (32) - exiting 14:15:28 (9380): Error: The process cannot access the file because it is being used by another process. (0x20) 14:34:46 (3340): Can't acquire lockfile (32) - waiting 35s 14:35:21 (3340): Can't acquire lockfile (32) - exiting 14:35:21 (3340): Error: The process cannot access the file because it is being used by another process. (0x20) 14:45:33 (2176): Can't acquire lockfile (32) - waiting 35s 14:46:08 (2176): Can't acquire lockfile (32) - exiting 14:46:08 (2176): Error: The process cannot access the file because it is being used by another process. (0x20) 15:18:10 (5772): Can't acquire lockfile (32) - waiting 35s 15:18:45 (5772): Can't acquire lockfile (32) - exiting 15:18:45 (5772): Error: The process cannot access the file because it is being used by another process. (0x20) 15:28:49 (9536): Can't acquire lockfile (32) - waiting 35s 15:29:24 (9536): Can't acquire lockfile (32) - exiting 15:29:24 (9536): Error: The process cannot access the file because it is being used by another process. (0x20) 16:33:59 (8332): Can't acquire lockfile (32) - waiting 35s 16:34:34 (8332): Can't acquire lockfile (32) - exiting 16:34:34 (8332): Error: The process cannot access the file because it is being used by another process. (0x20) CPDN Monitor - Quit request from BOINC... Model crashed: P_TH_ADJ : NEGATIVE PRESSURE VALUE CREATED. tmp/pipe_dummy 2048 CPDN Monitor - Quit request from BOINC... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 May 2012 10:06:37 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 855,360 | 2,274,175 | 2.6587 |
28 May 2012 10:06:37 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 829,440 | 2,171,878 | 2.6185 |
28 May 2012 10:06:37 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 803,520 | 2,077,732 | 2.5858 |
28 May 2012 10:06:37 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 777,600 | 1,985,110 | 2.5529 |
26 Mar 2012 22:31:32 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 751,680 | 1,890,398 | 2.5149 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 725,760 | 1,799,217 | 2.4791 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 699,840 | 1,706,682 | 2.4387 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 673,920 | 1,608,692 | 2.3871 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 648,000 | 1,515,213 | 2.3383 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 622,080 | 1,423,288 | 2.2880 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 596,160 | 1,333,580 | 2.2369 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 570,240 | 1,250,740 | 2.1934 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 544,320 | 1,194,558 | 2.1946 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 518,400 | 1,138,522 | 2.1962 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 492,480 | 1,082,264 | 2.1976 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 466,560 | 1,024,797 | 2.1965 |
26 Mar 2012 05:01:55 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 440,640 | 967,493 | 2.1957 |
15 Mar 2012 06:18:29 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 414,720 | 910,230 | 2.1948 |
15 Mar 2012 06:18:29 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 388,800 | 852,888 | 2.1936 |
15 Mar 2012 06:18:29 | 959555 | 14198687 | hadcm3n_2035_1940_40_007817032_0 | 362,880 | 795,834 | 2.1931 |
©2024 climateprediction.net