Name | hadcm3n_zje8_1880_40_008245691_1 |
Workunit | 8400815 |
Created | 20 Nov 2012, 20:28:56 UTC |
Sent | 20 Nov 2012, 20:28:59 UTC |
Report deadline | 20 Feb 2013, 3:56:10 UTC |
Received | 26 Nov 2012, 18:44:04 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1244719 |
Run time | 5 days 5 hours 20 min 23 sec |
CPU time | 4 days 18 hours 49 min 1 sec |
Validate state | Invalid |
Credit | 3,732.48 |
Device peak FLOPS | 3.18 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... 18:30:32 (6100): No heartbeat from core client for 30 sec - exiting 18:30:33 (6100): No heartbeat from core client for 30 sec - exiting 18:30:34 (6100): No heartbeat from core client for 30 sec - exiting 18:30:35 (6100): No heartbeat from core client for 30 sec - exiting 18:30:36 (6100): No heartbeat from core client for 30 sec - exiting 18:30:37 (6100): No heartbeat from core client for 30 sec - exiting 18:30:39 (6100): No heartbeat from core client for 30 sec - exiting 18:30:40 (6100): No heartbeat from core client for 30 sec - exiting 18:30:41 (6100): No heartbeat from core client for 30 sec - exiting 18:30:42 (6100): No heartbeat from core client for 30 sec - exiting 18:30:43 (6100): No heartbeat from core client for 30 sec - exiting 18:30:44 (6100): No heartbeat from core client for 30 sec - exiting 18:30:45 (6100): No heartbeat from core client for 30 sec - exiting 18:30:46 (6100): No heartbeat from core client for 30 sec - exiting 18:30:47 (6100): No heartbeat from core client for 30 sec - exiting 18:30:48 (6100): No heartbeat from core client for 30 sec - exiting 18:30:49 (6100): No heartbeat from core client for 30 sec - exiting 18:30:51 (6100): No heartbeat from core client for 30 sec - exiting 18:30:52 (6100): No heartbeat from core client for 30 sec - exiting 18:30:53 (6100): No heartbeat from core client for 30 sec - exiting 18:30:54 (6100): No heartbeat from core client for 30 sec - exiting 18:30:55 (6100): No heartbeat from core client for 30 sec - exiting 18:30:56 (6100): No heartbeat from core client for 30 sec - exiting 18:30:57 (6100): No heartbeat from core client for 30 sec - exiting 18:30:58 (6100): No heartbeat from core client for 30 sec - exiting 18:30:59 (6100): No heartbeat from core client for 30 sec - exiting 18:31:00 (6100): No heartbeat from core client for 30 sec - exiting 18:31:01 (6100): No heartbeat from core client for 30 sec - exiting 18:31:02 (6100): No heartbeat from core client for 30 sec - exiting 18:31:03 (6100): No heartbeat from core client for 30 sec - exiting 18:31:04 (6100): No heartbeat from core client for 30 sec - exiting 18:31:05 (6100): No heartbeat from core client for 30 sec - exiting 18:31:06 (6100): No heartbeat from core client for 30 sec - exiting 18:31:07 (6100): No heartbeat from core client for 30 sec - exiting 18:31:08 (6100): No heartbeat from core client for 30 sec - exiting 18:31:09 (6100): No heartbeat from core client for 30 sec - exiting 18:31:10 (6100): No heartbeat from core client for 30 sec - exiting 18:31:11 (6100): No heartbeat from core client for 30 sec - exiting 18:31:12 (6100): No heartbeat from core client for 30 sec - exiting 18:31:13 (6100): No heartbeat from core client for 30 sec - exiting 18:31:14 (6100): No heartbeat from core client for 30 sec - exiting 18:31:16 (6100): No heartbeat from core client for 30 sec - exiting 18:31:17 (6100): No heartbeat from core client for 30 sec - exiting 18:31:18 (6100): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 19:18:45 (3344): No heartbeat from core client for 30 sec - exiting 19:18:46 (3344): No heartbeat from core client for 30 sec - exiting 19:18:47 (3344): No heartbeat from core client for 30 sec - exiting 19:18:48 (3344): No heartbeat from core client for 30 sec - exiting 19:18:49 (3344): No heartbeat from core client for 30 sec - exiting 19:18:50 (3344): No heartbeat from core client for 30 sec - exiting 19:18:52 (3344): No heartbeat from core client for 30 sec - exiting 19:18:53 (3344): No heartbeat from core client for 30 sec - exiting 19:18:54 (3344): No heartbeat from core client for 30 sec - exiting 19:18:55 (3344): No heartbeat from core client for 30 sec - exiting 19:18:56 (3344): No heartbeat from core client for 30 sec - exiting 19:18:57 (3344): No heartbeat from core client for 30 sec - exiting 19:18:58 (3344): No heartbeat from core client for 30 sec - exiting 19:18:59 (3344): No heartbeat from core client for 30 sec - exiting 19:19:00 (3344): No heartbeat from core client for 30 sec - exiting 19:19:01 (3344): No heartbeat from core client for 30 sec - exiting 19:19:02 (3344): No heartbeat from core client for 30 sec - exiting 19:19:04 (3344): No heartbeat from core client for 30 sec - exiting 19:19:05 (3344): No heartbeat from core client for 30 sec - exiting 19:19:06 (3344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4832, iMonCtr=1 Model crash detected, will try to restart... 17:09:52 (4772): No heartbeat from core client for 30 sec - exiting 17:09:53 (4772): No heartbeat from core client for 30 sec - exiting 17:09:54 (4772): No heartbeat from core client for 30 sec - exiting 17:09:55 (4772): No heartbeat from core client for 30 sec - exiting 17:09:56 (4772): No heartbeat from core client for 30 sec - exiting 17:09:57 (4772): No heartbeat from core client for 30 sec - exiting 17:09:59 (4772): No heartbeat from core client for 30 sec - exiting 17:10:00 (4772): No heartbeat from core client for 30 sec - exiting 17:10:01 (4772): No heartbeat from core client for 30 sec - exiting 17:10:02 (4772): No heartbeat from core client for 30 sec - exiting 17:10:03 (4772): No heartbeat from core client for 30 sec - exiting 17:10:04 (4772): No heartbeat from core client for 30 sec - exiting 17:10:05 (4772): No heartbeat from core client for 30 sec - exiting 17:10:06 (4772): No heartbeat from core client for 30 sec - exiting 17:10:07 (4772): No heartbeat from core client for 30 sec - exiting 17:10:08 (4772): No heartbeat from core client for 30 sec - exiting 17:10:10 (4772): No heartbeat from core client for 30 sec - exiting 17:10:11 (4772): No heartbeat from core client for 30 sec - exiting 17:10:12 (4772): No heartbeat from core client for 30 sec - exiting 17:10:13 (4772): No heartbeat from core client for 30 sec - exiting 17:10:14 (4772): No heartbeat from core client for 30 sec - exiting 17:10:15 (4772): No heartbeat from core client for 30 sec - exiting 17:10:16 (4772): No heartbeat from core client for 30 sec - exiting 17:10:17 (4772): No heartbeat from core client for 30 sec - exiting 17:10:18 (4772): No heartbeat from core client for 30 sec - exiting 17:10:19 (4772): No heartbeat from core client for 30 sec - exiting 17:10:20 (4772): No heartbeat from core client for 30 sec - exiting 17:10:22 (4772): No heartbeat from core client for 30 sec - exiting 17:10:23 (4772): No heartbeat from core client for 30 sec - exiting 17:10:24 (4772): No heartbeat from core client for 30 sec - exiting 17:10:25 (4772): No heartbeat from core client for 30 sec - exiting 17:10:26 (4772): No heartbeat from core client for 30 sec - exiting 17:10:27 (4772): No heartbeat from core client for 30 sec - exiting 17:10:28 (4772): No heartbeat from core client for 30 sec - exiting 17:10:29 (4772): No heartbeat from core client for 30 sec - exiting 17:10:30 (4772): No heartbeat from core client for 30 sec - exiting 17:10:31 (4772): No heartbeat from core client for 30 sec - exiting 17:10:32 (4772): No heartbeat from core client for 30 sec - exiting 17:10:34 (4772): No heartbeat from core client for 30 sec - exiting 17:10:35 (4772): No heartbeat from core client for 30 sec - exiting 17:10:36 (4772): No heartbeat from core client for 30 sec - exiting 17:10:37 (4772): No heartbeat from core client for 30 sec - exiting 17:10:38 (4772): No heartbeat from core client for 30 sec - exiting 17:10:39 (4772): No heartbeat from core client for 30 sec - exiting 17:10:40 (4772): No heartbeat from core client for 30 sec - exiting 17:10:41 (4772): No heartbeat from core client for 30 sec - exiting 17:10:42 (4772): No heartbeat from core client for 30 sec - exiting 17:10:43 (4772): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 17:10:44 (4772): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=6096, iMonCtr=1 Model crash detected, will try to restart... 02:03:19 (1820): No heartbeat from core client for 30 sec - exiting 02:03:20 (1820): No heartbeat from core client for 30 sec - exiting 02:03:21 (1820): No heartbeat from core client for 30 sec - exiting 02:03:22 (1820): No heartbeat from core client for 30 sec - exiting 02:03:23 (1820): No heartbeat from core client for 30 sec - exiting 02:03:24 (1820): No heartbeat from core client for 30 sec - exiting 02:03:25 (1820): No heartbeat from core client for 30 sec - exiting 02:03:26 (1820): No heartbeat from core client for 30 sec - exiting 02:03:28 (1820): No heartbeat from core client for 30 sec - exiting 02:03:29 (1820): No heartbeat from core client for 30 sec - exiting 02:03:30 (1820): No heartbeat from core client for 30 sec - exiting 02:03:31 (1820): No heartbeat from core client for 30 sec - exiting 02:03:32 (1820): No heartbeat from core client for 30 sec - exiting 02:03:33 (1820): No heartbeat from core client for 30 sec - exiting 02:03:34 (1820): No heartbeat from core client for 30 sec - exiting 02:03:35 (1820): No heartbeat from core client for 30 sec - exiting 02:03:36 (1820): No heartbeat from core client for 30 sec - exiting 02:03:37 (1820): No heartbeat from core client for 30 sec - exiting 02:03:38 (1820): No heartbeat from core client for 30 sec - exiting 02:03:40 (1820): No heartbeat from core client for 30 sec - exiting 02:03:41 (1820): No heartbeat from core client for 30 sec - exiting 02:03:42 (1820): No heartbeat from core client for 30 sec - exiting 02:03:43 (1820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:03:44 (1820): No heartbeat from core client for 30 sec - exiting 02:03:45 (1820): No heartbeat from core client for 30 sec - exiting 02:03:46 (1820): No heartbeat from core client for 30 sec - exiting 02:03:47 (1820): No heartbeat from core client for 30 sec - exiting 02:03:48 (1820): No heartbeat from core client for 30 sec - exiting 02:03:49 (1820): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
25 Nov 2012 20:41:03 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 311,040 | 397,666 | 1.2785 |
25 Nov 2012 11:15:52 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 285,120 | 364,792 | 1.2794 |
25 Nov 2012 00:28:31 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 259,200 | 331,508 | 1.2790 |
24 Nov 2012 15:04:44 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 233,280 | 298,552 | 1.2798 |
24 Nov 2012 04:57:36 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 207,360 | 265,584 | 1.2808 |
23 Nov 2012 19:06:56 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 181,440 | 232,658 | 1.2823 |
23 Nov 2012 09:35:43 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 155,520 | 199,768 | 1.2845 |
22 Nov 2012 23:48:34 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 129,600 | 166,457 | 1.2844 |
22 Nov 2012 13:53:32 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 103,680 | 133,017 | 1.2830 |
22 Nov 2012 03:21:30 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 77,760 | 99,660 | 1.2816 |
21 Nov 2012 17:04:40 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 51,840 | 66,052 | 1.2742 |
21 Nov 2012 07:08:22 | 1244719 | 15440679 | hadcm3n_zje8_1880_40_008245691_1 | 25,920 | 32,380 | 1.2492 |
©2024 cpdn.org