Name | hadcm3n_n2n0_1920_40_008322140_4 |
Workunit | 8473275 |
Created | 2 Oct 2013, 13:10:09 UTC |
Sent | 2 Oct 2013, 13:30:11 UTC |
Report deadline | 1 Jan 2014, 20:57:22 UTC |
Received | 18 Oct 2013, 6:11:16 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1175450 |
Run time | 8 days 19 hours 52 min 55 sec |
CPU time | 7 days 22 hours 41 min 52 sec |
Validate state | Invalid |
Credit | 3,732.48 |
Device peak FLOPS | 3.05 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 02:01:43 (4488): No heartbeat from core client for 30 sec - exiting 02:01:44 (4488): No heartbeat from core client for 30 sec - exiting 02:01:45 (4488): No heartbeat from core client for 30 sec - exiting 02:01:46 (4488): No heartbeat from core client for 30 sec - exiting 02:01:47 (4488): No heartbeat from core client for 30 sec - exiting 02:01:48 (4488): No heartbeat from core client for 30 sec - exiting 02:01:49 (4488): No heartbeat from core client for 30 sec - exiting 02:01:50 (4488): No heartbeat from core client for 30 sec - exiting 02:01:51 (4488): No heartbeat from core client for 30 sec - exiting 02:01:52 (4488): No heartbeat from core client for 30 sec - exiting 02:01:53 (4488): No heartbeat from core client for 30 sec - exiting 02:01:54 (4488): No heartbeat from core client for 30 sec - exiting 02:01:55 (4488): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:16:46 (3272): No heartbeat from core client for 30 sec - exiting 23:16:47 (3272): No heartbeat from core client for 30 sec - exiting 23:16:48 (3272): No heartbeat from core client for 30 sec - exiting 23:16:49 (3272): No heartbeat from core client for 30 sec - exiting 23:16:50 (3272): No heartbeat from core client for 30 sec - exiting 23:16:51 (3272): No heartbeat from core client for 30 sec - exiting 23:16:52 (3272): No heartbeat from core client for 30 sec - exiting 23:16:53 (3272): No heartbeat from core client for 30 sec - exiting 23:16:54 (3272): No heartbeat from core client for 30 sec - exiting 23:16:55 (3272): No heartbeat from core client for 30 sec - exiting 23:16:56 (3272): No heartbeat from core client for 30 sec - exiting 23:16:57 (3272): No heartbeat from core client for 30 sec - exiting 23:16:58 (3272): No heartbeat from core client for 30 sec - exiting 23:16:59 (3272): No heartbeat from core client for 30 sec - exiting 23:17:00 (3272): No heartbeat from core client for 30 sec - exiting 23:17:01 (3272): No heartbeat from core client for 30 sec - exiting 23:17:02 (3272): No heartbeat from core client for 30 sec - exiting 23:17:03 (3272): No heartbeat from core client for 30 sec - exiting 23:17:04 (3272): No heartbeat from core client for 30 sec - exiting 23:17:05 (3272): No heartbeat from core client for 30 sec - exiting 23:17:06 (3272): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 23:11:35 (4708): No heartbeat from core client for 30 sec - exiting 23:11:36 (4708): No heartbeat from core client for 30 sec - exiting 23:11:37 (4708): No heartbeat from core client for 30 sec - exiting 23:11:38 (4708): No heartbeat from core client for 30 sec - exiting 23:11:39 (4708): No heartbeat from core client for 30 sec - exiting 23:11:40 (4708): No heartbeat from core client for 30 sec - exiting 23:11:41 (4708): No heartbeat from core client for 30 sec - exiting 23:11:42 (4708): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 22:53:06 (5732): No heartbeat from core client for 30 sec - exiting 22:53:07 (5732): No heartbeat from core client for 30 sec - exiting 22:53:08 (5732): No heartbeat from core client for 30 sec - exiting 22:53:10 (5732): No heartbeat from core client for 30 sec - exiting 22:53:11 (5732): No heartbeat from core client for 30 sec - exiting 22:53:12 (5732): No heartbeat from core client for 30 sec - exiting 22:53:13 (5732): No heartbeat from core client for 30 sec - exiting 22:53:14 (5732): No heartbeat from core client for 30 sec - exiting 22:53:15 (5732): No heartbeat from core client for 30 sec - exiting 22:53:16 (5732): No heartbeat from core client for 30 sec - exiting 22:53:17 (5732): No heartbeat from core client for 30 sec - exiting 22:53:18 (5732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5032, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Oct 2013 02:45:29 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 311,040 | 659,826 | 2.1214 |
16 Oct 2013 05:03:32 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 285,120 | 605,649 | 2.1242 |
15 Oct 2013 07:29:51 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 259,200 | 552,182 | 2.1303 |
14 Oct 2013 10:02:29 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 233,280 | 496,720 | 2.1293 |
13 Oct 2013 05:43:11 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 207,360 | 440,383 | 2.1238 |
11 Oct 2013 03:02:11 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 181,440 | 383,549 | 2.1139 |
10 Oct 2013 09:46:42 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 155,520 | 324,904 | 2.0891 |
09 Oct 2013 13:10:45 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 129,600 | 269,370 | 2.0785 |
08 Oct 2013 15:58:14 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 103,680 | 212,967 | 2.0541 |
07 Oct 2013 23:48:28 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 77,760 | 157,235 | 2.0221 |
06 Oct 2013 02:41:19 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 51,840 | 104,329 | 2.0125 |
04 Oct 2013 01:42:42 | 1175450 | 16053733 | hadcm3n_n2n0_1920_40_008322140_4 | 25,920 | 50,353 | 1.9426 |
©2024 cpdn.org