Name | hadcm3n_zjnm_1920_40_008271226_3 |
Workunit | 8426350 |
Created | 2 Jul 2013, 5:00:24 UTC |
Sent | 2 Jul 2013, 8:26:37 UTC |
Report deadline | 1 Oct 2013, 15:53:48 UTC |
Received | 18 Aug 2013, 2:09:46 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1178372 |
Run time | 22 days 5 hours 9 min 48 sec |
CPU time | 20 days 13 hours 53 min 19 sec |
Validate state | Invalid |
Credit | 6,531.84 |
Device peak FLOPS | 1.00 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 17:30:07 (4244): No heartbeat from core client for 30 sec - exiting 17:30:08 (4244): No heartbeat from core client for 30 sec - exiting 17:30:09 (4244): No heartbeat from core client for 30 sec - exiting 17:30:10 (4244): No heartbeat from core client for 30 sec - exiting 17:30:11 (4244): No heartbeat from core client for 30 sec - exiting 17:30:12 (4244): No heartbeat from core client for 30 sec - exiting 17:30:13 (4244): No heartbeat from core client for 30 sec - exiting 17:30:14 (4244): No heartbeat from core client for 30 sec - exiting 17:30:15 (4244): No heartbeat from core client for 30 sec - exiting 17:30:16 (4244): No heartbeat from core client for 30 sec - exiting 17:30:18 (4244): No heartbeat from core client for 30 sec - exiting 17:30:19 (4244): No heartbeat from core client for 30 sec - exiting 17:30:20 (4244): No heartbeat from core client for 30 sec - exiting 17:30:21 (4244): No heartbeat from core client for 30 sec - exiting 17:30:22 (4244): No heartbeat from core client for 30 sec - exiting 17:30:23 (4244): No heartbeat from core client for 30 sec - exiting 17:30:24 (4244): No heartbeat from core client for 30 sec - exiting 17:30:25 (4244): No heartbeat from core client for 30 sec - exiting 17:30:26 (4244): No heartbeat from core client for 30 sec - exiting 17:30:27 (4244): No heartbeat from core client for 30 sec - exiting 17:30:28 (4244): No heartbeat from core client for 30 sec - exiting 17:30:30 (4244): No heartbeat from core client for 30 sec - exiting 17:30:31 (4244): No heartbeat from core client for 30 sec - exiting 17:30:32 (4244): No heartbeat from core client for 30 sec - exiting 17:30:33 (4244): No heartbeat from core client for 30 sec - exiting 17:30:34 (4244): No heartbeat from core client for 30 sec - exiting 17:30:36 (4244): No heartbeat from core client for 30 sec - exiting 17:30:37 (4244): No heartbeat from core client for 30 sec - exiting 17:30:38 (4244): No heartbeat from core client for 30 sec - exiting 17:30:39 (4244): No heartbeat from core client for 30 sec - exiting 17:30:40 (4244): No heartbeat from core client for 30 sec - exiting 17:30:41 (4244): No heartbeat from core client for 30 sec - exiting 17:30:42 (4244): No heartbeat from core client for 30 sec - exiting 17:30:43 (4244): No heartbeat from core client for 30 sec - exiting 17:30:44 (4244): No heartbeat from core client for 30 sec - exiting 17:30:45 (4244): No heartbeat from core client for 30 sec - exiting 17:30:47 (4244): No heartbeat from core client for 30 sec - exiting 17:30:48 (4244): No heartbeat from core client for 30 sec - exiting 17:30:49 (4244): No heartbeat from core client for 30 sec - exiting 17:30:50 (4244): No heartbeat from core client for 30 sec - exiting 17:30:51 (4244): No heartbeat from core client for 30 sec - exiting 17:30:52 (4244): No heartbeat from core client for 30 sec - exiting 17:30:53 (4244): No heartbeat from core client for 30 sec - exiting 17:30:54 (4244): No heartbeat from core client for 30 sec - exiting 17:30:55 (4244): No heartbeat from core client for 30 sec - exiting 17:30:56 (4244): No heartbeat from core client for 30 sec - exiting 17:30:57 (4244): No heartbeat from core client for 30 sec - exiting 17:30:59 (4244): No heartbeat from core client for 30 sec - exiting 17:31:00 (4244): No heartbeat from core client for 30 sec - exiting 17:31:01 (4244): No heartbeat from core client for 30 sec - exiting 17:31:02 (4244): No heartbeat from core client for 30 sec - exiting 17:31:03 (4244): No heartbeat from core client for 30 sec - exiting 17:31:04 (4244): No heartbeat from core client for 30 sec - exiting 17:31:06 (4244): No heartbeat from core client for 30 sec - exiting 17:31:07 (4244): No heartbeat from core client for 30 sec - exiting 17:31:08 (4244): No heartbeat from core client for 30 sec - exiting 17:31:09 (4244): No heartbeat from core client for 30 sec - exiting 17:31:10 (4244): No heartbeat from core client for 30 sec - exiting 17:31:11 (4244): No heartbeat from core client for 30 sec - exiting 17:31:12 (4244): No heartbeat from core client for 30 sec - exiting 17:31:13 (4244): No heartbeat from core client for 30 sec - exiting 17:31:14 (4244): No heartbeat from core client for 30 sec - exiting 17:31:15 (4244): No heartbeat from core client for 30 sec - exiting 17:31:17 (4244): No heartbeat from core client for 30 sec - exiting 17:31:18 (4244): No heartbeat from core client for 30 sec - exiting 17:31:19 (4244): No heartbeat from core client for 30 sec - exiting 17:31:20 (4244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 07:55:19 (7144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:48:00 (6196): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:55:03 (4600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 20:21:58 (7176): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:42:10 (3016): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:50:39 (7124): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:33:11 (4344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:12:21 (5384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4464, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4600, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2884, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Aug 2013 02:13:18 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 544,320 | 1,774,606 | 3.2602 |
18 Aug 2013 02:13:18 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 518,400 | 1,688,664 | 3.2575 |
18 Aug 2013 02:13:18 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 492,480 | 1,657,112 | 3.3648 |
18 Aug 2013 02:13:18 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 466,560 | 1,625,612 | 3.4843 |
18 Aug 2013 02:13:18 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 440,640 | 1,594,026 | 3.6175 |
18 Aug 2013 02:13:17 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 414,720 | 1,558,534 | 3.7580 |
30 Jul 2013 10:04:19 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 388,800 | 1,472,314 | 3.7868 |
29 Jul 2013 13:40:39 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 362,880 | 1,316,183 | 3.6270 |
26 Jul 2013 09:51:53 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 336,960 | 1,229,928 | 3.6501 |
25 Jul 2013 09:25:12 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 311,040 | 1,145,963 | 3.6843 |
25 Jul 2013 00:00:46 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 285,120 | 1,112,304 | 3.9012 |
24 Jul 2013 13:32:45 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 259,200 | 1,075,018 | 4.1474 |
23 Jul 2013 22:16:48 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 233,280 | 1,013,897 | 4.3463 |
23 Jul 2013 20:22:04 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 207,360 | 830,102 | 4.0032 |
23 Jul 2013 19:51:53 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 181,440 | 768,462 | 4.2354 |
23 Jul 2013 14:36:37 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 155,520 | 653,867 | 4.2044 |
23 Jul 2013 14:36:37 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 129,600 | 584,851 | 4.5127 |
23 Jul 2013 14:36:37 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 103,680 | 518,219 | 4.9983 |
23 Jul 2013 14:36:36 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 77,760 | 411,063 | 5.2863 |
23 Jul 2013 14:36:36 | 1178372 | 15876060 | hadcm3n_zjnm_1920_40_008271226_3 | 51,840 | 304,026 | 5.8647 |
©2024 cpdn.org