Name | hadcm3n_85ew_1980_40_008464972_0 |
Workunit | 8615811 |
Created | 19 Sep 2013, 14:50:02 UTC |
Sent | 20 Sep 2013, 6:15:41 UTC |
Report deadline | 20 Dec 2013, 13:42:52 UTC |
Received | 29 Oct 2013, 12:25:52 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1285155 |
Run time | 31 days 4 hours 8 min 11 sec |
CPU time | 25 days 10 hours 29 min 9 sec |
Validate state | Invalid |
Credit | 10,886.40 |
Device peak FLOPS | 2.03 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.23</core_client_version> <![CDATA[ <message> Enheten känner inte igen kommandot. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:36:36 (3964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:18:23 (3864): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4736, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:53:31 (4912): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3788, iMonCtr=1 Model crash detected, will try to restart... CSuspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:23:43 (2248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:23:44 (2248): No heartbeat from core client for 30 sec - exiting 03:23:45 (2248): No heartbeat from core client for 30 sec - exiting 03:23:46 (2248): No heartbeat from core client for 30 sec - exiting 03:23:53 (5324): Can't acquire lockfile (32) - waiting 35s Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5324, iMonCtr=1 Model crash detected, will try to restart... 03:22:42 (4996): No heartbeat from core client for 30 sec - exiting 03:22:43 (4996): No heartbeat from core client for 30 sec - exiting 03:22:44 (4996): No heartbeat from core client for 30 sec - exiting 03:22:45 (4996): No heartbeat from core client for 30 sec - exiting 03:22:46 (4996): No heartbeat from core client for 30 sec - exiting 03:22:47 (4996): No heartbeat from core client for 30 sec - exiting 03:22:48 (4996): No heartbeat from core client for 30 sec - exiting 03:22:49 (4996): No heartbeat from core client for 30 sec - exiting 03:22:50 (4996): No heartbeat from core client for 30 sec - exiting 03:22:51 (4996): No heartbeat from core client for 30 sec - exiting 03:22:52 (4996): No heartbeat from core client for 30 sec - exiting 03:22:53 (4996): No heartbeat from core client for 30 sec - exiting 03:22:54 (4996): No heartbeat from core client for 30 sec - exiting 03:22:56 (4996): No heartbeat from core client for 30 sec - exiting 03:22:57 (4996): No heartbeat from core client for 30 sec - exiting 03:22:58 (4996): No heartbeat from core client for 30 sec - exiting 03:22:59 (4996): No heartbeat from core client for 30 sec - exiting 03:23:00 (4996): No heartbeat from core client for 30 sec - exiting 03:23:01 (4996): No heartbeat from core client for 30 sec - exiting 03:23:02 (4996): No heartbeat from core client for 30 sec - exiting 03:23:03 (4996): No heartbeat from core client for 30 sec - exiting 03:23:04 (4996): No heartbeat from core client for 30 sec - exiting 03:23:05 (4996): No heartbeat from core client for 30 sec - exiting 03:23:06 (4996): No heartbeat from core client for 30 sec - exiting 03:23:07 (4996): No heartbeat from core client for 30 sec - exiting 03:23:08 (4996): No heartbeat from core client for 30 sec - exiting 03:23:09 (4996): No heartbeat from core client for 30 sec - exiting 03:23:10 (4996): No heartbeat from core client for 30 sec - exiting 03:23:11 (4996): No heartbeat from core client for 30 sec - exiting 03:23:12 (4996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 03:24:52 (5052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 21:19:39 (4220): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 16:26:10 (2776): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:22:46 (3628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 18:29:26 (5060): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:31:40 (5080): No heartbeat from core client for 30 sec - exiting 18:31:41 (5080): No heartbeat from core client for 30 sec - exiting 18:31:42 (5080): No heartbeat from core client for 30 sec - exiting 18:31:43 (5080): No heartbeat from core client for 30 sec - exiting 18:31:44 (5080): No heartbeat from core client for 30 sec - exiting 18:31:45 (5080): No heartbeat from core client for 30 sec - exiting 18:31:46 (5080): No heartbeat from core client for 30 sec - exiting 18:31:47 (5080): No heartbeat from core client for 30 sec - exiting 18:31:48 (5080): No heartbeat from core client for 30 sec - exiting 18:31:49 (5080): No heartbeat from core client for 30 sec - exiting 18:31:50 (5080): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:37:14 (4792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:37:15 (4792): No heartbeat from core client for 30 sec - exiting 18:37:16 (4792): No heartbeat from core client for 30 sec - exiting 18:37:17 (4792): No heartbeat from core client for 30 sec - exiting 18:37:18 (4792): No heartbeat from core client for 30 sec - exiting 18:37:19 (4792): No heartbeat from core client for 30 sec - exiting 18:37:20 (4792): No heartbeat from core client for 30 sec - exiting 18:37:21 (4792): No heartbeat from core client for 30 sec - exiting 18:37:22 (4792): No heartbeat from core client for 30 sec - exiting 18:37:23 (4792): No heartbeat from core client for 30 sec - exiting 18:37:24 (4792): No heartbeat from core client for 30 sec - exiting 18:39:11 (5964): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:39:12 (5964): No heartbeat from core client for 30 sec - exiting 18:39:13 (5964): No heartbeat from core client for 30 sec - exiting 18:39:14 (5964): No heartbeat from core client for 30 sec - exiting 18:39:15 (5964): No heartbeat from core client for 30 sec - exiting 18:39:16 (5964): No heartbeat from core client for 30 sec - exiting 18:39:17 (5964): No heartbeat from core client for 30 sec - exiting 18:39:18 (5964): No heartbeat from core client for 30 sec - exiting 18:42:11 (1996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:42:12 (1996): No heartbeat from core client for 30 sec - exiting 18:42:13 (1996): No heartbeat from core client for 30 sec - exiting 18:42:14 (1996): No heartbeat from core client for 30 sec - exiting 18:42:15 (1996): No heartbeat from core client for 30 sec - exiting 18:42:16 (1996): No heartbeat from core client for 30 sec - exiting 18:42:17 (1996): No heartbeat from core client for 30 sec - exiting 18:42:18 (1996): No heartbeat from core client for 30 sec - exiting 18:42:19 (1996): No heartbeat from core client for 30 sec - exiting 18:42:20 (1996): No heartbeat from core client for 30 sec - exiting 18:49:06 (5852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:49:07 (5852): No heartbeat from core client for 30 sec - exiting 18:49:08 (5852): No heartbeat from core client for 30 sec - exiting 18:49:09 (5852): No heartbeat from core client for 30 sec - exiting 18:49:10 (5852): No heartbeat from core client for 30 sec - exiting 18:49:11 (5852): No heartbeat from core client for 30 sec - exiting 18:49:12 (5852): No heartbeat from core client for 30 sec - exiting 18:49:13 (5852): No heartbeat from core client for 30 sec - exiting 18:49:14 (5852): No heartbeat from core client for 30 sec - exiting 18:49:15 (5852): No heartbeat from core client for 30 sec - exiting 18:49:16 (5852): No heartbeat from core client for 30 sec - exiting BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 forrtl: Det finns inte tillrackligt med utrymme pa disken. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3120, iMonCtr=1 Model crash detected, will try to restart... forrtl: Det finns inte tillrackligt med utrymme pa disken. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3120, iMonCtr=1 Model crash detected, will try to restart... forrtl: Det finns inte tillrackligt med utrymme pa disken. Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3120, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
29 Oct 2013 12:29:59 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 907,200 | 2,281,718 | 2.5151 |
29 Oct 2013 12:29:59 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 881,280 | 2,215,546 | 2.5140 |
24 Oct 2013 01:35:26 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 855,360 | 2,150,531 | 2.5142 |
23 Oct 2013 05:40:40 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 829,440 | 2,085,961 | 2.5149 |
22 Oct 2013 09:58:25 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 803,520 | 2,020,791 | 2.5149 |
21 Oct 2013 11:44:22 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 777,600 | 1,955,077 | 2.5142 |
20 Oct 2013 15:04:02 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 751,680 | 1,888,967 | 2.5130 |
19 Oct 2013 17:17:22 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 725,760 | 1,822,941 | 2.5118 |
18 Oct 2013 21:56:10 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 699,840 | 1,757,863 | 2.5118 |
16 Oct 2013 13:27:19 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 673,920 | 1,692,595 | 2.5116 |
15 Oct 2013 17:23:36 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 648,000 | 1,627,160 | 2.5110 |
14 Oct 2013 20:53:32 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 622,080 | 1,561,375 | 2.5099 |
13 Oct 2013 18:54:45 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 596,160 | 1,495,914 | 2.5092 |
12 Oct 2013 21:39:31 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 570,240 | 1,431,604 | 2.5105 |
12 Oct 2013 01:41:32 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 544,320 | 1,366,412 | 2.5103 |
11 Oct 2013 05:58:22 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 518,400 | 1,301,117 | 2.5099 |
10 Oct 2013 08:51:13 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 492,480 | 1,235,070 | 2.5079 |
09 Oct 2013 10:58:53 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 466,560 | 1,169,887 | 2.5075 |
08 Oct 2013 13:09:54 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 440,640 | 1,104,157 | 2.5058 |
07 Oct 2013 15:10:45 | 1285155 | 16026319 | hadcm3n_85ew_1980_40_008464972_0 | 414,720 | 1,039,223 | 2.5058 |
©2024 cpdn.org