Name | hadcm3n_zj65_1880_40_008026367_0 |
Workunit | 8181481 |
Created | 29 Jun 2012, 16:30:41 UTC |
Sent | 29 Jun 2012, 16:31:02 UTC |
Report deadline | 28 Sep 2012, 23:58:13 UTC |
Received | 28 Jul 2012, 18:22:50 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1158176 |
Run time | 28 days 16 hours 16 min 38 sec |
CPU time | 24 days 13 hours 53 min 7 sec |
Validate state | Invalid |
Credit | 12,130.56 |
Device peak FLOPS | 2.94 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.25</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:26:23 (1664): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=11840, iMonCtr=1 Model crash detected, will try to restart... 07:40:34 (4548): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:08:07 (4504): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:15:01 (996): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:16:13 (5436): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:14:00 (4656): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 04:25:31 (3948): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:22:32 (5208): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:32:36 (7056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:47:08 (8628): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:52:07 (6244): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
28 Jul 2012 17:03:12 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 1,010,880 | 2,155,169 | 2.1320 |
28 Jul 2012 00:07:24 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 984,960 | 2,099,585 | 2.1316 |
27 Jul 2012 07:49:57 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 959,040 | 2,044,054 | 2.1314 |
26 Jul 2012 14:56:42 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 933,120 | 1,984,830 | 2.1271 |
25 Jul 2012 20:17:29 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 907,200 | 1,924,046 | 2.1209 |
25 Jul 2012 02:15:55 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 881,280 | 1,861,120 | 2.1118 |
24 Jul 2012 07:13:40 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 855,360 | 1,803,860 | 2.1089 |
23 Jul 2012 12:58:42 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 829,440 | 1,750,465 | 2.1104 |
22 Jul 2012 17:11:41 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 803,520 | 1,693,225 | 2.1073 |
21 Jul 2012 20:49:14 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 777,600 | 1,634,202 | 2.1016 |
21 Jul 2012 01:14:48 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 751,680 | 1,576,055 | 2.0967 |
20 Jul 2012 06:51:44 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 725,760 | 1,519,506 | 2.0937 |
19 Jul 2012 11:11:44 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 699,840 | 1,462,196 | 2.0893 |
18 Jul 2012 17:10:16 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 673,920 | 1,405,289 | 2.0852 |
17 Jul 2012 22:46:45 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 648,000 | 1,345,405 | 2.0762 |
16 Jul 2012 23:48:12 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 622,080 | 1,280,508 | 2.0584 |
16 Jul 2012 04:10:33 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 596,160 | 1,217,368 | 2.0420 |
15 Jul 2012 11:52:53 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 570,240 | 1,159,134 | 2.0327 |
14 Jul 2012 16:47:18 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 544,320 | 1,095,798 | 2.0132 |
13 Jul 2012 22:22:54 | 1158176 | 14847978 | hadcm3n_zj65_1880_40_008026367_0 | 518,400 | 1,034,480 | 1.9955 |
©2024 cpdn.org