Name | hadcm3n_yav6_1900_40_007520562_1 |
Workunit | 7718037 |
Created | 28 Oct 2011, 13:08:34 UTC |
Sent | 3 Nov 2011, 16:25:24 UTC |
Report deadline | 2 Feb 2012, 23:52:35 UTC |
Received | 15 Nov 2011, 16:48:49 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1178052 |
Run time | 4 days 12 hours 59 min 32 sec |
CPU time | 4 days 9 hours 47 min 41 sec |
Validate state | Invalid |
Credit | 1,244.16 |
Device peak FLOPS | 1.62 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 14:26:13 (3412): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:26:14 (3412): No heartbeat from core client for 30 sec - exiting 14:26:15 (3412): No heartbeat from core client for 30 sec - exiting 14:26:16 (3412): No heartbeat from core client for 30 sec - exiting 14:26:17 (3412): No heartbeat from core client for 30 sec - exiting 14:26:18 (3412): No heartbeat from core client for 30 sec - exiting 14:26:19 (3412): No heartbeat from core client for 30 sec - exiting 16:44:35 (3392): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 21:34:15 (1324): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 05:11:42 (384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:11:44 (384): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 12:03:18 (2108): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 02:08:35 (2248): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:08:40 (2248): No heartbeat from core client for 30 sec - exiting 02:08:41 (2248): No heartbeat from core client for 30 sec - exiting 02:08:42 (2248): No heartbeat from core client for 30 sec - exiting 02:09:33 (3804): No heartbeat from core client for 30 sec - exiting 02:09:34 (3804): No heartbeat from core client for 30 sec - exiting 02:09:35 (3804): No heartbeat from core client for 30 sec - exiting 02:09:36 (3804): No heartbeat from core client for 30 sec - exiting 02:09:37 (3804): No heartbeat from core client for 30 sec - exiting 02:09:38 (3804): No heartbeat from core client for 30 sec - exiting 02:09:39 (3804): No heartbeat from core client for 30 sec - exiting 02:09:41 (3804): No heartbeat from core client for 30 sec - exiting 02:09:42 (3804): No heartbeat from core client for 30 sec - exiting 02:09:43 (3804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:09:44 (3804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 05:05:15 (3512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 05:05:17 (3512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 08:07:16 (3792): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:07:17 (3792): No heartbeat from core client for 30 sec - exiting 11:30:36 (3348): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 22:06:32 (3700): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:01:37 (2836): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:26:50 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:26:54 (3560): No heartbeat from core client for 30 sec - exiting 12:26:55 (3560): No heartbeat from core client for 30 sec - exiting 12:26:56 (3560): No heartbeat from core client for 30 sec - exiting 12:26:58 (3560): No heartbeat from core client for 30 sec - exiting 12:26:59 (3560): No heartbeat from core client for 30 sec - exiting 12:27:00 (3560): No heartbeat from core client for 30 sec - exiting 12:27:01 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 15:51:01 (3560): No heartbeat from core client for 30 sec - exiting 15:51:02 (3560): No heartbeat from core client for 30 sec - exiting 15:51:03 (3560): No heartbeat from core client for 30 sec - exiting 15:51:04 (3560): No heartbeat from core client for 30 sec - exiting 15:51:06 (3560): No heartbeat from core client for 30 sec - exiting 15:51:07 (3560): No heartbeat from core client for 30 sec - exiting 15:51:08 (3560): No heartbeat from core client for 30 sec - exiting 15:51:09 (3560): No heartbeat from core client for 30 sec - exiting 15:51:10 (3560): No heartbeat from core client for 30 sec - exiting 15:51:11 (3560): No heartbeat from core client for 30 sec - exiting 15:51:12 (3560): No heartbeat from core client for 30 sec - exiting 15:51:14 (3560): No heartbeat from core client for 30 sec - exiting 15:51:15 (3560): No heartbeat from core client for 30 sec - exiting 15:51:16 (3560): No heartbeat from core client for 30 sec - exiting 15:51:17 (3560): No heartbeat from core client for 30 sec - exiting 15:51:18 (3560): No heartbeat from core client for 30 sec - exiting 15:51:19 (3560): No heartbeat from core client for 30 sec - exiting 15:51:20 (3560): No heartbeat from core client for 30 sec - exiting 15:51:21 (3560): No heartbeat from core client for 30 sec - exiting 15:51:23 (3560): No heartbeat from core client for 30 sec - exiting 15:51:24 (3560): No heartbeat from core client for 30 sec - exiting 15:51:25 (3560): No heartbeat from core client for 30 sec - exiting 15:51:26 (3560): No heartbeat from core client for 30 sec - exiting 15:51:27 (3560): No heartbeat from core client for 30 sec - exiting 15:51:28 (3560): No heartbeat from core client for 30 sec - exiting 15:51:29 (3560): No heartbeat from core client for 30 sec - exiting 15:51:31 (3560): No heartbeat from core client for 30 sec - exiting 15:51:32 (3560): No heartbeat from core client for 30 sec - exiting 15:51:33 (3560): No heartbeat from core client for 30 sec - exiting 15:51:34 (3560): No heartbeat from core client for 30 sec - exiting 15:51:35 (3560): No heartbeat from core client for 30 sec - exiting 15:51:36 (3560): No heartbeat from core client for 30 sec - exiting 15:51:37 (3560): No heartbeat from core client for 30 sec - exiting 15:51:38 (3560): No heartbeat from core client for 30 sec - exiting 15:51:39 (3560): No heartbeat from core client for 30 sec - exiting 15:51:41 (3560): No heartbeat from core client for 30 sec - exiting 15:51:42 (3560): No heartbeat from core client for 30 sec - exiting 15:51:43 (3560): No heartbeat from core client for 30 sec - exiting 15:51:44 (3560): No heartbeat from core client for 30 sec - exiting 15:51:45 (3560): No heartbeat from core client for 30 sec - exiting 15:51:46 (3560): No heartbeat from core client for 30 sec - exiting 15:51:47 (3560): No heartbeat from core client for 30 sec - exiting 15:51:49 (3560): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:53:25 (3020): Can't acquire lockfile (32) - waiting 35s CPDN Monitor - Quit request from BOINC... 16:42:28 (908): No heartbeat from core client for 30 sec - exiting 16:42:29 (908): No heartbeat from core client for 30 sec - exiting 16:42:30 (908): No heartbeat from core client for 30 sec - exiting 16:42:31 (908): No heartbeat from core client for 30 sec - exiting 16:42:32 (908): No heartbeat from core client for 30 sec - exiting 16:42:33 (908): No heartbeat from core client for 30 sec - exiting 16:42:34 (908): No heartbeat from core client for 30 sec - exiting 16:42:35 (908): No heartbeat from core client for 30 sec - exiting 16:42:36 (908): No heartbeat from core client for 30 sec - exiting 16:42:38 (908): No heartbeat from core client for 30 sec - exiting 16:42:39 (908): No heartbeat from core client for 30 sec - exiting 16:42:40 (908): No heartbeat from core client for 30 sec - exiting 16:42:41 (908): No heartbeat from core client for 30 sec - exiting 16:42:42 (908): No heartbeat from core client for 30 sec - exiting 16:42:43 (908): No heartbeat from core client for 30 sec - exiting 16:42:44 (908): No heartbeat from core client for 30 sec - exiting 16:42:45 (908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3984, iMonCtr=1 Model crash detected, will try to restart... 11:42:09 (1460): No heartbeat from core client for 30 sec - exiting 11:42:11 (1460): No heartbeat from core client for 30 sec - exiting 11:42:12 (1460): No heartbeat from core client for 30 sec - exiting 11:42:13 (1460): No heartbeat from core client for 30 sec - exiting 11:42:14 (1460): No heartbeat from core client for 30 sec - exiting 11:42:15 (1460): No heartbeat from core client for 30 sec - exiting 11:42:16 (1460): No heartbeat from core client for 30 sec - exiting 11:42:17 (1460): No heartbeat from core client for 30 sec - exiting 11:42:19 (1460): No heartbeat from core client for 30 sec - exiting 11:42:20 (1460): No heartbeat from core client for 30 sec - exiting 11:42:21 (1460): No heartbeat from core client for 30 sec - exiting 11:42:22 (1460): No heartbeat from core client for 30 sec - exiting 11:42:23 (1460): No heartbeat from core client for 30 sec - exiting 11:42:25 (1460): No heartbeat from core client for 30 sec - exiting 11:42:26 (1460): No heartbeat from core client for 30 sec - exiting 11:42:27 (1460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:42:28 (1460): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 12:49:35 (1732): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=3384, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2864, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2864, iMonCtr=1 Model crash detected, will try to restart... Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Nov 2011 17:09:59 | 1178052 | 13546812 | hadcm3n_yav6_1900_40_007520562_1 | 103,680 | 345,178 | 3.3293 |
07 Nov 2011 22:35:11 | 1178052 | 13546812 | hadcm3n_yav6_1900_40_007520562_1 | 77,760 | 270,199 | 3.4748 |
06 Nov 2011 18:50:09 | 1178052 | 13546812 | hadcm3n_yav6_1900_40_007520562_1 | 51,840 | 175,411 | 3.3837 |
05 Nov 2011 08:30:58 | 1178052 | 13546812 | hadcm3n_yav6_1900_40_007520562_1 | 25,920 | 85,055 | 3.2814 |
©2024 cpdn.org