Name | hadcm3n_y9i0_1900_40_007345170_1 |
Workunit | 7542600 |
Created | 6 Jul 2011, 13:29:33 UTC |
Sent | 22 Jul 2011, 10:39:40 UTC |
Report deadline | 21 Oct 2011, 18:06:51 UTC |
Received | 13 Aug 2011, 22:48:22 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1115097 |
Run time | 17 days 21 hours 9 min |
CPU time | 14 days 0 hours 11 min 39 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 2.18 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code 193 (0xc1) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 23:17:58 (9244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:17:59 (9244): No heartbeat from core client for 30 sec - exiting 11:19:39 (13468): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:25:58 (15896): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:11:08 (16396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:11:09 (16396): No heartbeat from core client for 30 sec - exiting 13:29:20 (12484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:20:52 (17084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:23:54 (13604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:27:07 (15788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:30:10 (16316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:39:14 (16532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:42:17 (16064): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:45:30 (3528): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:35 (9396): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:05:39 (12600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:44:50 (16868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=9276, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... 13:32:06 (12728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:32:07 (12728): No heartbeat from core client for 30 sec - exiting 13:32:08 (12728): No heartbeat from core client for 30 sec - exiting 13:32:09 (12728): No heartbeat from core client for 30 sec - exiting 13:32:10 (12728): No heartbeat from core client for 30 sec - exiting 13:32:11 (12728): No heartbeat from core client for 30 sec - exiting 13:32:12 (12728): No heartbeat from core client for 30 sec - exiting 14:28:37 (17120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:11:42 (20220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:11:43 (20220): No heartbeat from core client for 30 sec - exiting 06:11:44 (20220): No heartbeat from core client for 30 sec - exiting 06:11:45 (20220): No heartbeat from core client for 30 sec - exiting 06:11:46 (20220): No heartbeat from core client for 30 sec - exiting 06:11:47 (20220): No heartbeat from core client for 30 sec - exiting 06:11:48 (20220): No heartbeat from core client for 30 sec - exiting 18:30:09 (22268): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:30:10 (22268): No heartbeat from core client for 30 sec - exiting 19:21:19 (28856): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:45:27 (26148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:45:28 (26148): No heartbeat from core client for 30 sec - exiting 01:21:26 (29132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:01:21 (26744): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:08 (27092): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:21:09 (27092): No heartbeat from core client for 30 sec - exiting 03:21:10 (27092): No heartbeat from core client for 30 sec - exiting 03:21:11 (27092): No heartbeat from core client for 30 sec - exiting 03:21:12 (27092): No heartbeat from core client for 30 sec - exiting 03:21:13 (27092): No heartbeat from core client for 30 sec - exiting 08:49:53 (30504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:49:54 (30504): No heartbeat from core client for 30 sec - exiting 08:49:55 (30504): No heartbeat from core client for 30 sec - exiting 08:49:57 (30504): No heartbeat from core client for 30 sec - exiting 08:49:58 (30504): No heartbeat from core client for 30 sec - exiting 08:56:02 (29176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:23:21 (30500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:50:44 (30036): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:53:56 (30480): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:29:03 (29924): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:43:58 (30220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:42:22 (26104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:42:24 (26104): No heartbeat from core client for 30 sec - exiting 15:42:25 (26104): No heartbeat from core client for 30 sec - exiting 15:42:26 (26104): No heartbeat from core client for 30 sec - exiting 15:42:27 (26104): No heartbeat from core client for 30 sec - exiting 15:42:28 (26104): No heartbeat from core client for 30 sec - exiting 15:54:34 (30240): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:57:46 (28428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:00:47 (3724): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:03:50 (30428): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:25:10 (29908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:28:12 (29232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:28:13 (29232): No heartbeat from core client for 30 sec - exiting 17:04:24 (29512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:16:28 (32060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:35:30 (28368): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:41:32 (31668): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:41:34 (19692): Can't acquire lockfile (32) - waiting 35s 17:53:46 (19692): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:21:16 (26232): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:49:23 (32540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:58:27 (32172): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:28:37 (30440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:28:38 (30440): No heartbeat from core client for 30 sec - exiting 06:21:52 (20596): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:21:53 (20596): No heartbeat from core client for 30 sec - exiting 06:21:54 (20596): No heartbeat from core client for 30 sec - exiting 06:21:55 (20596): No heartbeat from core client for 30 sec - exiting 06:21:56 (20596): No heartbeat from core client for 30 sec - exiting 07:13:30 (30780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:37:47 (31648): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:37:48 (31648): No heartbeat from core client for 30 sec - exiting 07:37:49 (31648): No heartbeat from core client for 30 sec - exiting 07:37:50 (31648): No heartbeat from core client for 30 sec - exiting 07:37:52 (31648): No heartbeat from core client for 30 sec - exiting 07:43:56 (31940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:20:18 (25576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:26:32 (33132): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:35:36 (33684): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:44:41 (31768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:02:59 (30568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:06:59 (30868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:07:00 (30868): No heartbeat from core client for 30 sec - exiting 13:09:54 (29980): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:25:00 (32616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:25:01 (32616): No heartbeat from core client for 30 sec - exiting 13:58:31 (33316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:10:35 (33196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:47:43 (32032): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:50:47 (34008): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:07:12 (30532): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:28:20 (33880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:28:21 (33880): No heartbeat from core client for 30 sec - exiting 16:28:22 (33880): No heartbeat from core client for 30 sec - exiting 16:28:23 (33880): No heartbeat from core client for 30 sec - exiting 16:28:24 (33880): No heartbeat from core client for 30 sec - exiting 16:28:25 (33880): No heartbeat from core client for 30 sec - exiting 16:28:26 (33880): No heartbeat from core client for 30 sec - exiting 17:04:36 (31880): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:04:37 (31880): No heartbeat from core client for 30 sec - exiting 17:04:38 (31880): No heartbeat from core client for 30 sec - exiting 17:04:39 (31880): No heartbeat from core client for 30 sec - exiting 17:08:21 (34716): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:49:25 (32972): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:58:29 (33048): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:03 (34296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:20:46 (31440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:23:48 (32844): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:41:53 (34820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:25:05 (32112): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:24:22 (34076): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:24:53 (11316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:56:40 (9284): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x774936F9 read attempt to address 0xFFFFFFF8 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_y9i0_1900_40_007345170/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
13 Aug 2011 22:52:07 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 518,400 | 1,210,291 | 2.3347 |
13 Aug 2011 07:32:33 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 492,480 | 1,157,280 | 2.3499 |
12 Aug 2011 15:27:54 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 466,560 | 1,102,585 | 2.3632 |
10 Aug 2011 17:49:42 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 440,640 | 1,045,592 | 2.3729 |
09 Aug 2011 23:03:30 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 414,720 | 983,123 | 2.3706 |
05 Aug 2011 12:01:48 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 388,800 | 923,218 | 2.3745 |
04 Aug 2011 04:31:46 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 362,880 | 864,404 | 2.3821 |
02 Aug 2011 21:15:04 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 336,960 | 807,136 | 2.3953 |
01 Aug 2011 18:55:34 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 311,040 | 747,832 | 2.4043 |
31 Jul 2011 23:17:55 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 285,120 | 685,784 | 2.4052 |
31 Jul 2011 05:38:24 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 259,200 | 624,246 | 2.4084 |
30 Jul 2011 11:56:02 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 233,280 | 562,249 | 2.4102 |
29 Jul 2011 17:53:52 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 207,360 | 499,345 | 2.4081 |
28 Jul 2011 19:59:30 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 181,440 | 436,010 | 2.4031 |
27 Jul 2011 16:09:38 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 155,520 | 375,865 | 2.4168 |
26 Jul 2011 19:46:47 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 129,600 | 312,074 | 2.4080 |
26 Jul 2011 00:29:11 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 103,680 | 247,788 | 2.3899 |
25 Jul 2011 22:15:34 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 77,760 | 185,567 | 2.3864 |
25 Jul 2011 21:13:54 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 51,840 | 124,344 | 2.3986 |
25 Jul 2011 20:28:51 | 1115097 | 13094206 | hadcm3n_y9i0_1900_40_007345170_1 | 25,920 | 61,334 | 2.3663 |
©2024 cpdn.org