Name | hadcm3n_sbud_1940_40_009112058_2 |
Workunit | 9242394 |
Created | 27 Oct 2014, 10:14:32 UTC |
Sent | 27 Oct 2014, 10:24:33 UTC |
Report deadline | 26 Jan 2015, 17:51:44 UTC |
Received | 29 Oct 2014, 11:04:20 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1289062 |
Run time | 23 hours 49 min 45 sec |
CPU time | 23 hours 18 min 43 sec |
Validate state | Invalid |
Credit | 311.04 |
Device peak FLOPS | 2.64 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.2.42</core_client_version> <![CDATA[ <message> Das Gerät erkennt den Befehl nicht. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 13:46:25 (4344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:47:44 (7568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 15:02:40 (10296): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:04:03 (9520): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:04:42 (9300): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:04:51 (9300): No heartbeat from core client for 30 sec - exiting 15:04:52 (9300): No heartbeat from core client for 30 sec - exiting 15:04:53 (9300): No heartbeat from core client for 30 sec - exiting 15:04:54 (9300): No heartbeat from core client for 30 sec - exiting 15:04:55 (9300): No heartbeat from core client for 30 sec - exiting 15:04:56 (9300): No heartbeat from core client for 30 sec - exiting 15:04:57 (9300): No heartbeat from core client for 30 sec - exiting 15:04:58 (9300): No heartbeat from core client for 30 sec - exiting 15:04:59 (9300): No heartbeat from core client for 30 sec - exiting 15:05:00 (9300): No heartbeat from core client for 30 sec - exiting Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7408, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=7408, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish CPDN Monitor - Quit request from BOINC... 08:35:13 (3828): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:37:03 (780): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:38:00 (4616): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:42:57 (4604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3444, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3568, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish 09:04:35 (3568): No heartbeat from core client for 30 sec - exiting 09:04:36 (3568): No heartbeat from core client for 30 sec - exiting 09:52:37 (4504): No heartbeat from core client for 30 sec - exiting 09:53:23 (4504): No heartbeat from core client for 30 sec - exiting 09:53:24 (4504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 09:55:40 (720): No heartbeat from core client for 30 sec - exiting 09:55:41 (720): No heartbeat from core client for 30 sec - exiting 09:55:42 (720): No heartbeat from core client for 30 sec - exiting 09:55:43 (720): No heartbeat from core client for 30 sec - exiting 09:55:44 (720): No heartbeat from core client for 30 sec - exiting 09:55:45 (720): No heartbeat from core client for 30 sec - exiting 09:55:46 (720): No heartbeat from core client for 30 sec - exiting 09:55:47 (720): No heartbeat from core client for 30 sec - exiting 09:55:48 (720): No heartbeat from core client for 30 sec - exiting 09:55:49 (720): No heartbeat from core client for 30 sec - exiting 09:55:50 (720): No heartbeat from core client for 30 sec - exiting 09:55:51 (720): No heartbeat from core client for 30 sec - exiting 09:55:52 (720): No heartbeat from core client for 30 sec - exiting 09:55:53 (720): No heartbeat from core client for 30 sec - exiting 09:55:54 (720): No heartbeat from core client for 30 sec - exiting 09:55:55 (720): No heartbeat from core client for 30 sec - exiting 09:55:56 (720): No heartbeat from core client for 30 sec - exiting 09:55:57 (720): No heartbeat from core client for 30 sec - exiting 09:55:58 (720): No heartbeat from core client for 30 sec - exiting 09:55:59 (720): No heartbeat from core client for 30 sec - exiting 09:56:00 (720): No heartbeat from core client for 30 sec - exiting 09:56:01 (720): No heartbeat from core client for 30 sec - exiting 09:56:02 (720): No heartbeat from core client for 30 sec - exiting 09:56:03 (720): No heartbeat from core client for 30 sec - exiting 09:56:04 (720): No heartbeat from core client for 30 sec - exiting 09:56:05 (720): No heartbeat from core client for 30 sec - exiting 09:56:06 (720): No heartbeat from core client for 30 sec - exiting 09:56:07 (720): No heartbeat from core client for 30 sec - exiting 09:56:08 (720): No heartbeat from core client for 30 sec - exiting 09:56:09 (720): No heartbeat from core client for 30 sec - exiting 09:56:10 (720): No heartbeat from core client for 30 sec - exiting 09:56:11 (720): No heartbeat from core client for 30 sec - exiting 09:56:12 (720): No heartbeat from core client for 30 sec - exiting 09:56:13 (720): No heartbeat from core client for 30 sec - exiting 09:56:14 (720): No heartbeat from core client for 30 sec - exiting 09:56:15 (720): No heartbeat from core client for 30 sec - exiting 09:56:16 (720): No heartbeat from core client for 30 sec - exiting 09:56:17 (720): No heartbeat from core client for 30 sec - exiting 09:56:18 (720): No heartbeat from core client for 30 sec - exiting 09:56:19 (720): No heartbeat from core client for 30 sec - exiting 09:56:20 (720): No heartbeat from core client for 30 sec - exiting 09:56:21 (720): No heartbeat from core client for 30 sec - exiting 09:56:22 (720): No heartbeat from core client for 30 sec - exiting 09:56:23 (720): No heartbeat from core client for 30 sec - exiting 09:56:24 (720): No heartbeat from core client for 30 sec - exiting 09:56:25 (720): No heartbeat from core client for 30 sec - exiting 09:56:26 (720): No heartbeat from core client for 30 sec - exiting 09:56:27 (720): No heartbeat from core client for 30 sec - exiting 09:56:28 (720): No heartbeat from core client for 30 sec - exiting 09:56:29 (720): No heartbeat from core client for 30 sec - exiting 09:56:30 (720): No heartbeat from core client for 30 sec - exiting 09:56:31 (720): No heartbeat from core client for 30 sec - exiting 09:56:32 (720): No heartbeat from core client for 30 sec - exiting 09:56:33 (720): No heartbeat from core client for 30 sec - exiting 09:56:34 (720): No heartbeat from core client for 30 sec - exiting 09:56:35 (720): No heartbeat from core client for 30 sec - exiting 09:56:36 (720): No heartbeat from core client for 30 sec - exiting 09:56:37 (720): No heartbeat from core client for 30 sec - exiting 09:56:38 (720): No heartbeat from core client for 30 sec - exiting 09:56:39 (720): No heartbeat from core client for 30 sec - exiting 09:56:40 (720): No heartbeat from core client for 30 sec - exiting 09:56:41 (720): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 09:57:37 (4236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:57:38 (4236): No heartbeat from core client for 30 sec - exiting 09:57:39 (4236): No heartbeat from core client for 30 sec - exiting 09:57:40 (4236): No heartbeat from core client for 30 sec - exiting 09:57:41 (4236): No heartbeat from core client for 30 sec - exiting 09:57:42 (4236): No heartbeat from core client for 30 sec - exiting 09:57:43 (4236): No heartbeat from core client for 30 sec - exiting 09:57:44 (4236): No heartbeat from core client for 30 sec - exiting 09:57:45 (4236): No heartbeat from core client for 30 sec - exiting 09:57:46 (4236): No heartbeat from core client for 30 sec - exiting 09:57:47 (4236): No heartbeat from core client for 30 sec - exiting BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 116 - Return code = 16 Model crashed: REPLANCA :I/O ERROR tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
27 Oct 2014 23:02:43 | 1289062 | 17310328 | hadcm3n_sbud_1940_40_009112058_2 | 25,920 | 44,701 | 1.7246 |
©2024 cpdn.org