Name | hadcm3n_877s_1980_40_008515855_4 |
Workunit | 8663367 |
Created | 2 Jun 2014, 6:02:15 UTC |
Sent | 2 Jun 2014, 6:02:18 UTC |
Report deadline | 1 Sep 2014, 13:29:29 UTC |
Received | 27 Jul 2014, 10:08:25 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1314952 |
Run time | 8 days 21 hours 55 min 0 sec |
CPU time | 4 days 22 hours 43 min 47 sec |
Validate state | Invalid |
Credit | 6,220.80 |
Device peak FLOPS | 3.31 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> (unknown error) - exit code 193 (0xc1) </message> <stderr_txt> 04:36:48 (144824): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:36:51 (144824): No heartbeat from core client for 30 sec - exiting 09:04:21 (148720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:05:27 (148720): No heartbeat from core client for 30 sec - exiting 09:05:28 (148720): No heartbeat from core client for 30 sec - exiting 09:05:29 (148720): No heartbeat from core client for 30 sec - exiting 09:05:30 (148720): No heartbeat from core client for 30 sec - exiting 09:05:31 (148720): No heartbeat from core client for 30 sec - exiting 09:05:32 (148720): No heartbeat from core client for 30 sec - exiting 09:05:33 (148720): No heartbeat from core client for 30 sec - exiting 09:05:34 (148720): No heartbeat from core client for 30 sec - exiting 09:05:35 (148720): No heartbeat from core client for 30 sec - exiting 09:05:36 (148720): No heartbeat from core client for 30 sec - exiting 09:06:15 (149160): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:36:44 (148500): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:18:44 (149144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:38:57 (154212): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 forrtl: There is not enough space on the disk. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=156540, iMonCtr=1 Model crash detected, will try to restart... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 13:18:02 (156540): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:18:03 (156540): No heartbeat from core client for 30 sec - exiting 13:18:04 (156540): No heartbeat from core client for 30 sec - exiting 13:18:05 (156540): No heartbeat from core client for 30 sec - exiting 13:18:06 (156540): No heartbeat from core client for 30 sec - exiting 13:18:07 (156540): No heartbeat from core client for 30 sec - exiting 13:18:08 (156540): No heartbeat from core client for 30 sec - exiting 13:18:09 (156540): No heartbeat from core client for 30 sec - exiting 13:18:10 (156540): No heartbeat from core client for 30 sec - exiting 13:18:11 (156540): No heartbeat from core client for 30 sec - exiting 13:18:12 (156540): No heartbeat from core client for 30 sec - exiting 13:25:35 (201840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:25:36 (201840): No heartbeat from core client for 30 sec - exiting 13:25:37 (201840): No heartbeat from core client for 30 sec - exiting 13:25:38 (201840): No heartbeat from core client for 30 sec - exiting 13:25:39 (201840): No heartbeat from core client for 30 sec - exiting 13:25:40 (201840): No heartbeat from core client for 30 sec - exiting 13:25:41 (201840): No heartbeat from core client for 30 sec - exiting 13:25:42 (201840): No heartbeat from core client for 30 sec - exiting 13:25:45 (201840): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:56:00 (14728): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:56:12 (14728): No heartbeat from core client for 30 sec - exiting 02:56:13 (14728): No heartbeat from core client for 30 sec - exiting 02:56:14 (14728): No heartbeat from core client for 30 sec - exiting 02:56:15 (14728): No heartbeat from core client for 30 sec - exiting 02:56:16 (14728): No heartbeat from core client for 30 sec - exiting 02:56:17 (14728): No heartbeat from core client for 30 sec - exiting 02:56:18 (14728): No heartbeat from core client for 30 sec - exiting 02:56:19 (14728): No heartbeat from core client for 30 sec - exiting 02:56:20 (14728): No heartbeat from core client for 30 sec - exiting 02:56:21 (14728): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 08:41:36 (19424): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:41:37 (19424): No heartbeat from core client for 30 sec - exiting 08:41:38 (19424): No heartbeat from core client for 30 sec - exiting 08:41:39 (19424): No heartbeat from core client for 30 sec - exiting 08:41:40 (19424): No heartbeat from core client for 30 sec - exiting 08:41:41 (19424): No heartbeat from core client for 30 sec - exiting 08:41:42 (19424): No heartbeat from core client for 30 sec - exiting 08:41:43 (19424): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 09:03:51 (24548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:03:52 (24548): No heartbeat from core client for 30 sec - exiting 09:03:53 (24548): No heartbeat from core client for 30 sec - exiting 09:03:54 (24548): No heartbeat from core client for 30 sec - exiting 09:03:55 (24548): No heartbeat from core client for 30 sec - exiting 09:03:56 (24548): No heartbeat from core client for 30 sec - exiting 09:03:57 (24548): No heartbeat from core client for 30 sec - exiting 09:03:58 (24548): No heartbeat from core client for 30 sec - exiting Atmos Hold Restart file rename failed on atmos_restart.hold 14:29:29 (25108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:29:30 (25108): No heartbeat from core client for 30 sec - exiting 14:29:31 (25108): No heartbeat from core client for 30 sec - exiting 14:29:32 (25108): No heartbeat from core client for 30 sec - exiting 14:29:33 (25108): No heartbeat from core client for 30 sec - exiting 14:29:34 (25108): No heartbeat from core client for 30 sec - exiting 14:29:35 (25108): No heartbeat from core client for 30 sec - exiting 14:29:36 (25108): No heartbeat from core client for 30 sec - exiting 14:29:37 (25108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 15:48:17 (27420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:04 (36148): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:05 (36148): No heartbeat from core client for 30 sec - exiting 18:04:06 (36148): No heartbeat from core client for 30 sec - exiting 18:04:07 (36148): No heartbeat from core client for 30 sec - exiting 18:04:08 (36148): No heartbeat from core client for 30 sec - exiting 18:04:09 (36148): No heartbeat from core client for 30 sec - exiting 18:04:10 (36148): No heartbeat from core client for 30 sec - exiting 18:04:11 (36148): No heartbeat from core client for 30 sec - exiting 18:04:12 (36148): No heartbeat from core client for 30 sec - exiting 18:04:13 (36148): No heartbeat from core client for 30 sec - exiting 18:04:14 (36148): No heartbeat from core client for 30 sec - exiting 19:04:25 (38300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:04:32 (38300): No heartbeat from core client for 30 sec - exiting 19:04:33 (38300): No heartbeat from core client for 30 sec - exiting 19:04:34 (38300): No heartbeat from core client for 30 sec - exiting 19:04:35 (38300): No heartbeat from core client for 30 sec - exiting 19:04:36 (38300): No heartbeat from core client for 30 sec - exiting 19:04:37 (38300): No heartbeat from core client for 30 sec - exiting 21:07:48 (39984): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:32 (39768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:51:38 (39768): No heartbeat from core client for 30 sec - exiting 07:51:39 (39768): No heartbeat from core client for 30 sec - exiting 07:51:40 (39768): No heartbeat from core client for 30 sec - exiting 07:51:41 (39768): No heartbeat from core client for 30 sec - exiting 07:51:42 (39768): No heartbeat from core client for 30 sec - exiting 07:51:43 (39768): No heartbeat from core client for 30 sec - exiting 07:51:44 (39768): No heartbeat from core client for 30 sec - exiting 07:51:45 (39768): No heartbeat from core client for 30 sec - exiting 07:51:46 (39768): No heartbeat from core client for 30 sec - exiting 07:51:47 (39768): No heartbeat from core client for 30 sec - exiting 17:17:38 (41520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:17:39 (41520): No heartbeat from core client for 30 sec - exiting 17:17:40 (41520): No heartbeat from core client for 30 sec - exiting 17:17:41 (41520): No heartbeat from core client for 30 sec - exiting 17:17:42 (41520): No heartbeat from core client for 30 sec - exiting 17:17:43 (41520): No heartbeat from core client for 30 sec - exiting 17:17:44 (41520): No heartbeat from core client for 30 sec - exiting 17:17:45 (41520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:26:48 (48680): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:05:41 (52196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76EC3AC3 read attempt to address 0x40E167B4 Engaging BOINC Windows Runtime Debugger... Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x76EC3AC3 read attempt to address 0x40E167B4 Engaging BOINC Windows Runtime Debugger... Cannot serialize file C:\ProgramData\BOINC/projects/climateprediction.net/hadcm3n_877s_1980_40_008515855/dataout/shmem_restart.day Signal 11 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Jul 2014 10:10:41 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 518,400 | 412,429 | 0.7956 |
27 Jul 2014 10:10:41 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 492,480 | 381,862 | 0.7754 |
27 Jul 2014 10:10:41 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 466,560 | 351,567 | 0.7535 |
27 Jul 2014 10:10:41 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 440,640 | 321,146 | 0.7288 |
21 Jul 2014 10:09:05 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 414,720 | 291,052 | 0.7018 |
21 Jul 2014 00:05:32 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 388,800 | 261,113 | 0.6716 |
20 Jul 2014 04:23:10 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 362,880 | 230,956 | 0.6365 |
20 Jul 2014 04:23:10 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 336,960 | 201,676 | 0.5985 |
25 Jun 2014 02:41:38 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 311,040 | 178,886 | 0.5751 |
24 Jun 2014 21:19:29 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 285,120 | 160,141 | 0.5617 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 259,200 | 141,263 | 0.5450 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 233,280 | 122,176 | 0.5237 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 207,360 | 102,141 | 0.4926 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 181,440 | 83,237 | 0.4588 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 155,520 | 64,299 | 0.4134 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 129,600 | 45,387 | 0.3502 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 103,680 | 120,246 | 1.1598 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 77,760 | 89,718 | 1.1538 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 51,840 | 59,103 | 1.1401 |
24 Jun 2014 16:58:43 | 1314952 | 16659190 | hadcm3n_877s_1980_40_008515855_4 | 25,920 | 29,285 | 1.1298 |
©2024 cpdn.org