Name | hadcm3n_o5ha_2060_40_008139422_0 |
Workunit | 8294536 |
Created | 13 Aug 2012, 13:02:15 UTC |
Sent | 13 Aug 2012, 13:04:56 UTC |
Report deadline | 12 Nov 2012, 20:32:07 UTC |
Received | 28 Aug 2012, 1:36:56 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -226 (0xFFFFFF1E) ERR_TOO_MANY_EXITS |
Computer ID | 1186899 |
Run time | 11 days 17 hours 24 min 19 sec |
CPU time | 11 days 12 hours 45 min 9 sec |
Validate state | Invalid |
Credit | 6,531.84 |
Device peak FLOPS | 2.82 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <message> too many boinc_temporary_exit()s </message> <stderr_txt> 18:35:54 (6072): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:41:32 (4580): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:01:57 (3480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:47:44 (4340): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:02:18 (2832): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:31:13 (9196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:06:34 (3364): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:28:40 (4784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:00:32 (3988): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:56:14 (4256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:56:39 (5924): Can't acquire lockfile (32) - waiting 35s 12:56:56 (5836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:14 (5924): Can't set up shared mem: -1. Will run in standalone mode. 12:57:14 (5512): Can't set up shared mem: -1. Will run in standalone mode. 08:18:43 (2676): Can't acquire lockfile (32) - waiting 35s 08:19:19 (2676): Can't acquire lockfile (32) - exiting 08:19:19 (2676): Error: The process cannot access the file because it is being used by another process. (0x20) 09:02:42 (5652): Can't acquire lockfile (32) - waiting 35s 09:03:17 (5652): Can't acquire lockfile (32) - exiting 09:03:17 (5652): Error: The process cannot access the file because it is being used by another process. (0x20) 09:19:34 (5144): Can't acquire lockfile (32) - waiting 35s 09:20:09 (5144): Can't acquire lockfile (32) - exiting 09:20:09 (5144): Error: The process cannot access the file because it is being used by another process. (0x20) 09:30:11 (5752): Can't acquire lockfile (32) - waiting 35s 09:30:46 (5752): Can't acquire lockfile (32) - exiting 09:30:46 (5752): Error: The process cannot access the file because it is being used by another process. (0x20) 09:59:57 (3888): Can't acquire lockfile (32) - waiting 35s 10:00:32 (3888): Can't acquire lockfile (32) - exiting 10:00:32 (3888): Error: The process cannot access the file because it is being used by another process. (0x20) 10:10:43 (2688): Can't acquire lockfile (32) - waiting 35s 10:11:18 (2688): Can't acquire lockfile (32) - exiting 10:11:18 (2688): Error: The process cannot access the file because it is being used by another process. (0x20) 10:21:51 (6684): Can't acquire lockfile (32) - waiting 35s 10:22:26 (6684): Can't acquire lockfile (32) - exiting 10:22:26 (6684): Error: The process cannot access the file because it is being used by another process. (0x20) 10:32:29 (1336): Can't acquire lockfile (32) - waiting 35s 10:33:04 (1336): Can't acquire lockfile (32) - exiting 10:33:04 (1336): Error: The process cannot access the file because it is being used by another process. (0x20) 10:43:29 (5200): Can't acquire lockfile (32) - waiting 35s 10:44:04 (5200): Can't acquire lockfile (32) - exiting 10:44:04 (5200): Error: The process cannot access the file because it is being used by another process. (0x20) 10:59:55 (7052): Can't acquire lockfile (32) - waiting 35s 11:00:30 (7052): Can't acquire lockfile (32) - exiting 11:00:30 (7052): Error: The process cannot access the file because it is being used by another process. (0x20) 11:10:41 (5956): Can't acquire lockfile (32) - waiting 35s 11:11:16 (5956): Can't acquire lockfile (32) - exiting 11:11:16 (5956): Error: The process cannot access the file because it is being used by another process. (0x20) 11:21:26 (7388): Can't acquire lockfile (32) - waiting 35s 11:22:01 (7388): Can't acquire lockfile (32) - exiting 11:22:01 (7388): Error: The process cannot access the file because it is being used by another process. (0x20) 11:32:12 (2884): Can't acquire lockfile (32) - waiting 35s 11:32:47 (2884): Can't acquire lockfile (32) - exiting 11:32:47 (2884): Error: The process cannot access the file because it is being used by another process. (0x20) 11:42:58 (4868): Can't acquire lockfile (32) - waiting 35s 11:43:33 (4868): Can't acquire lockfile (32) - exiting 11:43:33 (4868): Error: The process cannot access the file because it is being used by another process. (0x20) 11:53:44 (8092): Can't acquire lockfile (32) - waiting 35s 11:54:19 (8092): Can't acquire lockfile (32) - exiting 11:54:19 (8092): Error: The process cannot access the file because it is being used by another process. (0x20) 12:04:29 (7348): Can't acquire lockfile (32) - waiting 35s 12:05:04 (7348): Can't acquire lockfile (32) - exiting 12:05:04 (7348): Error: The process cannot access the file because it is being used by another process. (0x20) 12:15:32 (8164): Can't acquire lockfile (32) - waiting 35s 12:16:07 (8164): Can't acquire lockfile (32) - exiting 12:16:07 (8164): Error: The process cannot access the file because it is being used by another process. (0x20) 12:26:25 (5748): Can't acquire lockfile (32) - waiting 35s 12:27:00 (5748): Can't acquire lockfile (32) - exiting 12:27:00 (5748): Error: The process cannot access the file because it is being used by another process. (0x20) 12:45:12 (7216): Can't acquire lockfile (32) - waiting 35s 12:45:47 (7216): Can't acquire lockfile (32) - exiting 12:45:47 (7216): Error: The process cannot access the file because it is being used by another process. (0x20) 12:56:11 (8116): Can't acquire lockfile (32) - waiting 35s 12:56:46 (8116): Can't acquire lockfile (32) - exiting 12:56:46 (8116): Error: The process cannot access the file because it is being used by another process. (0x20) 13:07:22 (7064): Can't acquire lockfile (32) - waiting 35s 13:07:57 (7064): Can't acquire lockfile (32) - exiting 13:07:57 (7064): Error: The process cannot access the file because it is being used by another process. (0x20) 13:18:56 (7364): Can't acquire lockfile (32) - waiting 35s 13:19:31 (7364): Can't acquire lockfile (32) - exiting 13:19:31 (7364): Error: The process cannot access the file because it is being used by another process. (0x20) 13:30:18 (5532): Can't acquire lockfile (32) - waiting 35s 13:30:53 (5532): Can't acquire lockfile (32) - exiting 13:30:53 (5532): Error: The process cannot access the file because it is being used by another process. (0x20) 13:47:03 (7504): Can't acquire lockfile (32) - waiting 35s 13:47:38 (7504): Can't acquire lockfile (32) - exiting 13:47:38 (7504): Error: The process cannot access the file because it is being used by another process. (0x20) 13:58:14 (1012): Can't acquire lockfile (32) - waiting 35s 13:58:49 (1012): Can't acquire lockfile (32) - exiting 13:58:49 (1012): Error: The process cannot access the file because it is being used by another process. (0x20) 14:15:17 (5376): Can't acquire lockfile (32) - waiting 35s 14:15:52 (5376): Can't acquire lockfile (32) - exiting 14:15:52 (5376): Error: The process cannot access the file because it is being used by another process. (0x20) 14:31:30 (6356): Can't acquire lockfile (32) - waiting 35s 14:32:05 (6356): Can't acquire lockfile (32) - exiting 14:32:05 (6356): Error: The process cannot access the file because it is being used by another process. (0x20) 14:50:15 (6720): Can't acquire lockfile (32) - waiting 35s 14:50:50 (6720): Can't acquire lockfile (32) - exiting 14:50:50 (6720): Error: The process cannot access the file because it is being used by another process. (0x20) 15:14:15 (608): Can't acquire lockfile (32) - waiting 35s 15:14:50 (608): Can't acquire lockfile (32) - exiting 15:14:50 (608): Error: The process cannot access the file because it is being used by another process. (0x20) 15:25:26 (6748): Can't acquire lockfile (32) - waiting 35s 15:26:01 (6748): Can't acquire lockfile (32) - exiting 15:26:01 (6748): Error: The process cannot access the file because it is being used by another process. (0x20) 15:36:12 (3580): Can't acquire lockfile (32) - waiting 35s 15:36:47 (3580): Can't acquire lockfile (32) - exiting 15:36:47 (3580): Error: The process cannot access the file because it is being used by another process. (0x20) 15:57:48 (8700): Can't acquire lockfile (32) - waiting 35s 15:58:23 (8700): Can't acquire lockfile (32) - exiting 15:58:23 (8700): Error: The process cannot access the file because it is being used by another process. (0x20) 16:09:18 (7392): Can't acquire lockfile (32) - waiting 35s 16:09:53 (7392): Can't acquire lockfile (32) - exiting 16:09:53 (7392): Error: The process cannot access the file because it is being used by another process. (0x20) 16:30:21 (8236): Can't acquire lockfile (32) - waiting 35s 16:30:56 (8236): Can't acquire lockfile (32) - exiting 16:30:56 (8236): Error: The process cannot access the file because it is being used by another process. (0x20) 16:41:05 (8612): Can't acquire lockfile (32) - waiting 35s 16:41:40 (8612): Can't acquire lockfile (32) - exiting 16:41:40 (8612): Error: The process cannot access the file because it is being used by another process. (0x20) 16:59:17 (8256): Can't acquire lockfile (32) - waiting 35s 16:59:52 (8256): Can't acquire lockfile (32) - exiting 16:59:52 (8256): Error: The process cannot access the file because it is being used by another process. (0x20) 17:10:43 (8020): Can't acquire lockfile (32) - waiting 35s 17:11:18 (8020): Can't acquire lockfile (32) - exiting 17:11:18 (8020): Error: The process cannot access the file because it is being used by another process. (0x20) 17:21:54 (8268): Can't acquire lockfile (32) - waiting 35s 17:22:29 (8268): Can't acquire lockfile (32) - exiting 17:22:29 (8268): Error: The process cannot access the file because it is being used by another process. (0x20) 17:33:05 (6080): Can't acquire lockfile (32) - waiting 35s 17:33:40 (6080): Can't acquire lockfile (32) - exiting 17:33:40 (6080): Error: The process cannot access the file because it is being used by another process. (0x20) 17:45:40 (5352): Can't acquire lockfile (32) - waiting 35s 17:46:15 (5352): Can't acquire lockfile (32) - exiting 17:46:15 (5352): Error: The process cannot access the file because it is being used by another process. (0x20) 18:00:52 (8984): Can't acquire lockfile (32) - waiting 35s 18:01:27 (8984): Can't acquire lockfile (32) - exiting 18:01:27 (8984): Error: The process cannot access the file because it is being used by another process. (0x20) 18:12:21 (5612): Can't acquire lockfile (32) - waiting 35s 18:12:56 (5612): Can't acquire lockfile (32) - exiting 18:12:56 (5612): Error: The process cannot access the file because it is being used by another process. (0x20) 18:28:54 (9020): Can't acquire lockfile (32) - waiting 35s 18:29:29 (9020): Can't acquire lockfile (32) - exiting 18:29:29 (9020): Error: The process cannot access the file because it is being used by another process. (0x20) 18:40:06 (9108): Can't acquire lockfile (32) - waiting 35s 18:40:41 (9108): Can't acquire lockfile (32) - exiting 18:40:41 (9108): Error: The process cannot access the file because it is being used by another process. (0x20) 18:51:24 (8992): Can't acquire lockfile (32) - waiting 35s 18:51:59 (8992): Can't acquire lockfile (32) - exiting 18:51:59 (8992): Error: The process cannot access the file because it is being used by another process. (0x20) 19:02:35 (4168): Can't acquire lockfile (32) - waiting 35s 19:03:10 (4168): Can't acquire lockfile (32) - exiting 19:03:10 (4168): Error: The process cannot access the file because it is being used by another process. (0x20) 19:14:41 (9508): Can't acquire lockfile (32) - waiting 35s 19:15:16 (9508): Can't acquire lockfile (32) - exiting 19:15:16 (9508): Error: The process cannot access the file because it is being used by another process. (0x20) 19:41:40 (9360): Can't acquire lockfile (32) - waiting 35s 19:42:15 (9360): Can't acquire lockfile (32) - exiting 19:42:15 (9360): Error: The process cannot access the file because it is being used by another process. (0x20) 19:53:09 (7660): Can't acquire lockfile (32) - waiting 35s 19:53:44 (7660): Can't acquire lockfile (32) - exiting 19:53:44 (7660): Error: The process cannot access the file because it is being used by another process. (0x20) 20:04:20 (9432): Can't acquire lockfile (32) - waiting 35s 20:04:55 (9432): Can't acquire lockfile (32) - exiting 20:04:55 (9432): Error: The process cannot access the file because it is being used by another process. (0x20) 20:14:55 (9424): Can't acquire lockfile (32) - waiting 35s 20:15:30 (9424): Can't acquire lockfile (32) - exiting 20:15:30 (9424): Error: The process cannot access the file because it is being used by another process. (0x20) </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
26 Aug 2012 09:35:24 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 544,320 | 971,001 | 1.7839 |
25 Aug 2012 19:26:05 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 518,400 | 925,051 | 1.7844 |
25 Aug 2012 06:34:18 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 492,480 | 879,036 | 1.7849 |
24 Aug 2012 17:19:53 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 466,560 | 831,590 | 1.7824 |
24 Aug 2012 04:10:14 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 440,640 | 784,638 | 1.7807 |
23 Aug 2012 14:47:45 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 414,720 | 737,129 | 1.7774 |
22 Aug 2012 12:32:16 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 388,800 | 692,335 | 1.7807 |
21 Aug 2012 23:49:13 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 362,880 | 647,225 | 1.7836 |
21 Aug 2012 10:25:22 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 336,960 | 600,266 | 1.7814 |
20 Aug 2012 21:27:04 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 311,040 | 553,615 | 1.7799 |
20 Aug 2012 07:19:13 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 285,120 | 504,576 | 1.7697 |
19 Aug 2012 18:30:57 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 259,200 | 458,035 | 1.7671 |
19 Aug 2012 04:58:29 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 233,280 | 411,024 | 1.7619 |
18 Aug 2012 15:35:21 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 207,360 | 365,117 | 1.7608 |
18 Aug 2012 02:32:59 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 181,440 | 318,462 | 1.7552 |
17 Aug 2012 08:44:51 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 155,520 | 271,550 | 1.7461 |
16 Aug 2012 20:46:59 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 129,600 | 225,769 | 1.7420 |
16 Aug 2012 07:25:38 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 103,680 | 181,731 | 1.7528 |
15 Aug 2012 19:05:46 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 77,760 | 137,693 | 1.7707 |
14 Aug 2012 16:55:44 | 1186899 | 15109877 | hadcm3n_o5ha_2060_40_008139422_0 | 51,840 | 92,566 | 1.7856 |
©2024 cpdn.org