Name | hadcm3n_zaeg_1920_40_008281662_1 |
Workunit | 8432797 |
Created | 15 Apr 2013, 7:27:22 UTC |
Sent | 15 Apr 2013, 7:27:30 UTC |
Report deadline | 15 Jul 2013, 14:54:41 UTC |
Received | 21 Apr 2013, 9:26:12 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 980519 |
Run time | 5 days 16 hours 14 min 24 sec |
CPU time | 5 days 8 hours 47 min 49 sec |
Validate state | Invalid |
Credit | 2,799.36 |
Device peak FLOPS | 2.33 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 15:54:47 (5184): No heartbeat from core client for 30 sec - exiting 15:54:48 (5184): No heartbeat from core client for 30 sec - exiting 15:54:49 (5184): No heartbeat from core client for 30 sec - exiting 15:54:50 (5184): No heartbeat from core client for 30 sec - exiting 15:54:51 (5184): No heartbeat from core client for 30 sec - exiting 15:54:52 (5184): No heartbeat from core client for 30 sec - exiting 15:54:53 (5184): No heartbeat from core client for 30 sec - exiting 15:54:54 (5184): No heartbeat from core client for 30 sec - exiting 15:54:55 (5184): No heartbeat from core client for 30 sec - exiting 15:54:56 (5184): No heartbeat from core client for 30 sec - exiting 15:54:57 (5184): No heartbeat from core client for 30 sec - exiting 15:54:58 (5184): No heartbeat from core client for 30 sec - exiting 15:54:59 (5184): No heartbeat from core client for 30 sec - exiting 15:55:00 (5184): No heartbeat from core client for 30 sec - exiting 15:55:01 (5184): No heartbeat from core client for 30 sec - exiting 15:55:02 (5184): No heartbeat from core client for 30 sec - exiting 15:55:03 (5184): No heartbeat from core client for 30 sec - exiting 15:55:04 (5184): No heartbeat from core client for 30 sec - exiting 15:55:05 (5184): No heartbeat from core client for 30 sec - exiting 15:55:06 (5184): No heartbeat from core client for 30 sec - exiting 15:55:07 (5184): No heartbeat from core client for 30 sec - exiting 15:55:08 (5184): No heartbeat from core client for 30 sec - exiting 15:55:09 (5184): No heartbeat from core client for 30 sec - exiting 15:55:10 (5184): No heartbeat from core client for 30 sec - exiting 15:55:11 (5184): No heartbeat from core client for 30 sec - exiting 15:55:12 (5184): No heartbeat from core client for 30 sec - exiting 15:55:13 (5184): No heartbeat from core client for 30 sec - exiting 15:55:14 (5184): No heartbeat from core client for 30 sec - exiting 15:55:15 (5184): No heartbeat from core client for 30 sec - exiting 15:55:16 (5184): No heartbeat from core client for 30 sec - exiting 15:55:17 (5184): No heartbeat from core client for 30 sec - exiting 15:55:18 (5184): No heartbeat from core client for 30 sec - exiting 15:55:20 (5184): No heartbeat from core client for 30 sec - exiting 15:55:21 (5184): No heartbeat from core client for 30 sec - exiting 15:55:22 (5184): No heartbeat from core client for 30 sec - exiting 15:55:23 (5184): No heartbeat from core client for 30 sec - exiting 15:55:24 (5184): No heartbeat from core client for 30 sec - exiting 15:55:25 (5184): No heartbeat from core client for 30 sec - exiting 15:55:26 (5184): No heartbeat from core client for 30 sec - exiting 15:55:27 (5184): No heartbeat from core client for 30 sec - exiting 15:55:28 (5184): No heartbeat from core client for 30 sec - exiting 15:55:29 (5184): No heartbeat from core client for 30 sec - exiting 15:55:30 (5184): No heartbeat from core client for 30 sec - exiting 15:55:31 (5184): No heartbeat from core client for 30 sec - exiting 15:55:32 (5184): No heartbeat from core client for 30 sec - exiting 15:55:33 (5184): No heartbeat from core client for 30 sec - exiting 15:55:34 (5184): No heartbeat from core client for 30 sec - exiting 15:55:35 (5184): No heartbeat from core client for 30 sec - exiting 15:55:36 (5184): No heartbeat from core client for 30 sec - exiting 15:55:37 (5184): No heartbeat from core client for 30 sec - exiting 15:55:38 (5184): No heartbeat from core client for 30 sec - exiting 15:55:39 (5184): No heartbeat from core client for 30 sec - exiting 15:55:40 (5184): No heartbeat from core client for 30 sec - exiting 15:55:41 (5184): No heartbeat from core client for 30 sec - exiting 15:55:42 (5184): No heartbeat from core client for 30 sec - exiting 15:55:43 (5184): No heartbeat from core client for 30 sec - exiting 15:55:44 (5184): No heartbeat from core client for 30 sec - exiting 15:55:45 (5184): No heartbeat from core client for 30 sec - exiting 15:55:46 (5184): No heartbeat from core client for 30 sec - exiting 15:55:47 (5184): No heartbeat from core client for 30 sec - exiting 15:55:48 (5184): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:20:12 (3276): No heartbeat from core client for 30 sec - exiting 10:20:13 (3276): No heartbeat from core client for 30 sec - exiting 10:20:14 (3276): No heartbeat from core client for 30 sec - exiting 10:20:15 (3276): No heartbeat from core client for 30 sec - exiting 10:20:16 (3276): No heartbeat from core client for 30 sec - exiting 10:20:17 (3276): No heartbeat from core client for 30 sec - exiting 10:20:18 (3276): No heartbeat from core client for 30 sec - exiting 10:20:19 (3276): No heartbeat from core client for 30 sec - exiting 10:20:20 (3276): No heartbeat from core client for 30 sec - exiting 10:20:22 (3276): No heartbeat from core client for 30 sec - exiting 10:20:23 (3276): No heartbeat from core client for 30 sec - exiting 10:20:24 (3276): No heartbeat from core client for 30 sec - exiting 10:20:25 (3276): No heartbeat from core client for 30 sec - exiting 10:20:26 (3276): No heartbeat from core client for 30 sec - exiting 10:20:27 (3276): No heartbeat from core client for 30 sec - exiting 10:20:28 (3276): No heartbeat from core client for 30 sec - exiting 10:20:29 (3276): No heartbeat from core client for 30 sec - exiting 10:20:30 (3276): No heartbeat from core client for 30 sec - exiting 10:20:31 (3276): No heartbeat from core client for 30 sec - exiting 10:20:32 (3276): No heartbeat from core client for 30 sec - exiting 10:20:34 (3276): No heartbeat from core client for 30 sec - exiting 10:20:35 (3276): No heartbeat from core client for 30 sec - exiting 10:20:36 (3276): No heartbeat from core client for 30 sec - exiting 10:20:37 (3276): No heartbeat from core client for 30 sec - exiting 10:20:38 (3276): No heartbeat from core client for 30 sec - exiting 10:20:39 (3276): No heartbeat from core client for 30 sec - exiting 10:20:40 (3276): No heartbeat from core client for 30 sec - exiting 10:20:41 (3276): No heartbeat from core client for 30 sec - exiting 10:20:42 (3276): No heartbeat from core client for 30 sec - exiting 10:20:43 (3276): No heartbeat from core client for 30 sec - exiting 10:20:44 (3276): No heartbeat from core client for 30 sec - exiting 10:20:46 (3276): No heartbeat from core client for 30 sec - exiting 10:20:47 (3276): No heartbeat from core client for 30 sec - exiting 10:20:48 (3276): No heartbeat from core client for 30 sec - exiting 10:20:49 (3276): No heartbeat from core client for 30 sec - exiting 10:20:50 (3276): No heartbeat from core client for 30 sec - exiting 10:20:51 (3276): No heartbeat from core client for 30 sec - exiting 10:20:52 (3276): No heartbeat from core client for 30 sec - exiting 10:20:53 (3276): No heartbeat from core client for 30 sec - exiting 10:20:54 (3276): No heartbeat from core client for 30 sec - exiting 10:20:55 (3276): No heartbeat from core client for 30 sec - exiting 10:20:56 (3276): No heartbeat from core client for 30 sec - exiting 10:20:58 (3276): No heartbeat from core client for 30 sec - exiting 10:20:59 (3276): No heartbeat from core client for 30 sec - exiting 10:21:00 (3276): No heartbeat from core client for 30 sec - exiting 10:21:01 (3276): No heartbeat from core client for 30 sec - exiting 10:21:02 (3276): No heartbeat from core client for 30 sec - exiting 10:21:03 (3276): No heartbeat from core client for 30 sec - exiting 10:21:04 (3276): No heartbeat from core client for 30 sec - exiting 10:21:05 (3276): No heartbeat from core client for 30 sec - exiting 10:21:06 (3276): No heartbeat from core client for 30 sec - exiting 10:21:07 (3276): No heartbeat from core client for 30 sec - exiting 10:21:08 (3276): No heartbeat from core client for 30 sec - exiting 10:21:10 (3276): No heartbeat from core client for 30 sec - exiting 10:21:11 (3276): No heartbeat from core client for 30 sec - exiting 10:21:12 (3276): No heartbeat from core client for 30 sec - exiting 10:21:13 (3276): No heartbeat from core client for 30 sec - exiting 10:21:14 (3276): No heartbeat from core client for 30 sec - exiting 10:21:15 (3276): No heartbeat from core client for 30 sec - exiting 10:21:16 (3276): No heartbeat from core client for 30 sec - exiting 10:21:17 (3276): No heartbeat from core client for 30 sec - exiting 10:21:18 (3276): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4596, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=5256, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Apr 2013 16:57:40 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 233,280 | 439,529 | 1.8841 |
20 Apr 2013 02:46:47 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 207,360 | 390,869 | 1.8850 |
19 Apr 2013 11:49:42 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 181,440 | 341,291 | 1.8810 |
18 Apr 2013 21:09:23 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 155,520 | 291,845 | 1.8766 |
18 Apr 2013 06:20:19 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 129,600 | 242,611 | 1.8720 |
17 Apr 2013 15:55:48 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 103,680 | 193,680 | 1.8681 |
17 Apr 2013 01:24:24 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 77,760 | 144,758 | 1.8616 |
16 Apr 2013 11:50:45 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 51,840 | 97,963 | 1.8897 |
15 Apr 2013 22:42:32 | 980519 | 15724958 | hadcm3n_zaeg_1920_40_008281662_1 | 25,920 | 48,955 | 1.8887 |
©2024 cpdn.org