Name | hadcm3n_4j7a_2020_40_008399617_2 |
Workunit | 8550473 |
Created | 24 Aug 2013, 4:55:10 UTC |
Sent | 24 Aug 2013, 5:56:01 UTC |
Report deadline | 23 Nov 2013, 13:23:12 UTC |
Received | 2 Oct 2013, 15:13:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1069778 |
Run time | 23 days 18 hours 26 min 41 sec |
CPU time | 22 days 22 hours 6 min 42 sec |
Validate state | Invalid |
Credit | 11,819.52 |
Device peak FLOPS | 2.68 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:00:09 (3120): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 01:00:16 (4568): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 08:21:06 (4304): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=4360, iMonCtr=1 Model crash detected, will try to restart... Signal 22 received, exiting... Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
01 Oct 2013 19:02:32 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 984,960 | 1,959,342 | 1.9893 |
30 Sep 2013 15:44:41 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 959,040 | 1,912,856 | 1.9946 |
27 Sep 2013 16:57:42 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 933,120 | 1,866,624 | 2.0004 |
26 Sep 2013 17:05:58 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 907,200 | 1,818,508 | 2.0045 |
25 Sep 2013 09:35:41 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 881,280 | 1,770,484 | 2.0090 |
21 Sep 2013 01:58:15 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 855,360 | 1,720,851 | 2.0118 |
20 Sep 2013 03:22:17 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 829,440 | 1,668,339 | 2.0114 |
19 Sep 2013 09:12:27 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 803,520 | 1,615,728 | 2.0108 |
18 Sep 2013 15:13:48 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 777,600 | 1,563,327 | 2.0105 |
17 Sep 2013 20:23:59 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 751,680 | 1,510,622 | 2.0097 |
17 Sep 2013 05:17:02 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 725,760 | 1,458,486 | 2.0096 |
16 Sep 2013 11:07:41 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 699,840 | 1,406,261 | 2.0094 |
15 Sep 2013 16:51:26 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 673,920 | 1,354,023 | 2.0092 |
15 Sep 2013 01:47:11 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 648,000 | 1,301,848 | 2.0090 |
14 Sep 2013 07:42:37 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 622,080 | 1,249,682 | 2.0089 |
13 Sep 2013 13:37:10 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 596,160 | 1,197,452 | 2.0086 |
12 Sep 2013 19:22:55 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 570,240 | 1,144,364 | 2.0068 |
12 Sep 2013 04:12:38 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 544,320 | 1,091,409 | 2.0051 |
11 Sep 2013 08:46:07 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 518,400 | 1,039,137 | 2.0045 |
10 Sep 2013 14:36:05 | 1069778 | 15938018 | hadcm3n_4j7a_2020_40_008399617_2 | 492,480 | 986,171 | 2.0025 |
©2024 cpdn.org