Name | hadcm3n_zn04_1880_40_008026533_0 |
Workunit | 8181647 |
Created | 29 Jun 2012, 17:43:42 UTC |
Sent | 29 Jun 2012, 17:44:22 UTC |
Report deadline | 29 Sep 2012, 1:11:33 UTC |
Received | 20 Aug 2012, 5:08:53 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1114237 |
Run time | 37 days 18 hours 3 min 34 sec |
CPU time | 33 days 0 hours 46 min 40 sec |
Validate state | Invalid |
Credit | 12,130.56 |
Device peak FLOPS | 2.21 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> process exited with code 22 (0x16, -234) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 07:32:19 (17164): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:32:24 (17164): No heartbeat from core client for 30 sec - exiting 07:32:26 (17164): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 10:10:58 (12499): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:11:18 (12499): No heartbeat from core client for 30 sec - exiting Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... 13:32:36 (1344): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:32:50 (1344): No heartbeat from core client for 30 sec - exiting 13:32:51 (1344): No heartbeat from core client for 30 sec - exiting 13:32:52 (1344): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 08:46:30 (4531): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:46:32 (4531): No heartbeat from core client for 30 sec - exiting 08:46:33 (4531): No heartbeat from core client for 30 sec - exiting 08:47:57 (17056): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:56:57 (17056): No heartbeat from core client for 30 sec - exiting 09:37:34 (18671): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:37:37 (18671): No heartbeat from core client for 30 sec - exiting 09:40:12 (32508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:40:14 (32508): No heartbeat from core client for 30 sec - exiting 10:28:34 (629): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:28:45 (629): No heartbeat from core client for 30 sec - exiting 15:11:42 (17024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:11:45 (17024): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 10:41:58 (5887): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:48:27 (5887): No heartbeat from core client for 30 sec - exiting 10:49:23 (5887): No heartbeat from core client for 30 sec - exiting 10:49:25 (5887): No heartbeat from core client for 30 sec - exiting 10:49:26 (5887): No heartbeat from core client for 30 sec - exiting 10:54:04 (21194): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 10:54:10 (21194): No heartbeat from core client for 30 sec - exiting 08:59:18 (24310): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish 07:34:07 (1310): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:34:08 (1310): No heartbeat from core client for 30 sec - exiting 07:34:09 (1310): No heartbeat from core client for 30 sec - exiting 07:34:10 (1310): No heartbeat from core client for 30 sec - exiting 07:34:11 (1310): No heartbeat from core client for 30 sec - exiting 07:34:12 (1310): No heartbeat from core client for 30 sec - exiting 07:34:13 (1310): No heartbeat from core client for 30 sec - exiting 07:34:14 (1310): No heartbeat from core client for 30 sec - exiting 09:47:12 (22555): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:47:15 (22555): No heartbeat from core client for 30 sec - exiting 09:47:17 (22555): No heartbeat from core client for 30 sec - exiting 09:47:18 (22555): No heartbeat from core client for 30 sec - exiting 09:47:19 (22555): No heartbeat from core client for 30 sec - exiting 09:47:21 (22555): No heartbeat from core client for 30 sec - exiting 09:47:22 (22555): No heartbeat from core client for 30 sec - exiting 09:47:23 (22555): No heartbeat from core client for 30 sec - exiting 09:47:24 (22555): No heartbeat from core client for 30 sec - exiting 09:47:25 (22555): No heartbeat from core client for 30 sec - exiting 09:47:26 (22555): No heartbeat from core client for 30 sec - exiting 09:47:27 (22555): No heartbeat from core client for 30 sec - exiting 09:47:28 (22555): No heartbeat from core client for 30 sec - exiting 09:47:29 (22555): No heartbeat from core client for 30 sec - exiting 09:47:30 (22555): No heartbeat from core client for 30 sec - exiting 09:47:31 (22555): No heartbeat from core client for 30 sec - exiting 09:47:32 (22555): No heartbeat from core client for 30 sec - exiting 09:47:33 (22555): No heartbeat from core client for 30 sec - exiting 09:47:34 (22555): No heartbeat from core client for 30 sec - exiting 09:47:35 (22555): No heartbeat from core client for 30 sec - exiting 09:47:36 (22555): No heartbeat from core client for 30 sec - exiting 09:47:37 (22555): No heartbeat from core client for 30 sec - exiting 09:47:38 (22555): No heartbeat from core client for 30 sec - exiting 09:47:39 (22555): No heartbeat from core client for 30 sec - exiting 09:47:40 (22555): No heartbeat from core client for 30 sec - exiting 09:47:41 (22555): No heartbeat from core client for 30 sec - exiting 09:47:42 (22555): No heartbeat from core client for 30 sec - exiting 09:47:43 (22555): No heartbeat from core client for 30 sec - exiting 09:47:44 (22555): No heartbeat from core client for 30 sec - exiting 09:47:45 (22555): No heartbeat from core client for 30 sec - exiting 09:47:46 (22555): No heartbeat from core client for 30 sec - exiting 09:47:47 (22555): No heartbeat from core client for 30 sec - exiting 09:47:48 (22555): No heartbeat from core client for 30 sec - exiting 09:47:49 (22555): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:58:11 (4387): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:00:55 (4387): No heartbeat from core client for 30 sec - exiting 07:08:40 (31448): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 09:19:11 (1179): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:19:19 (1179): No heartbeat from core client for 30 sec - exiting 09:19:21 (1179): No heartbeat from core client for 30 sec - exiting 09:19:23 (1179): No heartbeat from core client for 30 sec - exiting 09:19:24 (1179): No heartbeat from core client for 30 sec - exiting 09:19:26 (1179): No heartbeat from core client for 30 sec - exiting 09:19:28 (1179): No heartbeat from core client for 30 sec - exiting 09:19:31 (1179): No heartbeat from core client for 30 sec - exiting 09:19:32 (1179): No heartbeat from core client for 30 sec - exiting 09:19:33 (1179): No heartbeat from core client for 30 sec - exiting 09:19:34 (1179): No heartbeat from core client for 30 sec - exiting 09:19:35 (1179): No heartbeat from core client for 30 sec - exiting 09:52:17 (28788): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:52:49 (28788): No heartbeat from core client for 30 sec - exiting Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Model crashed: REPLANCA: PP HEADERS ON ANCILLARY FILE DO NOT MATCH tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 1,010,880 | 3,029,979 | 2.9974 |
20 Aug 2012 05:13:44 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 984,960 | 2,953,652 | 2.9988 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 959,040 | 2,877,114 | 3.0000 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 933,120 | 2,803,290 | 3.0042 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 907,200 | 2,731,570 | 3.0110 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 881,280 | 2,659,740 | 3.0180 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 855,360 | 2,587,901 | 3.0255 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 829,440 | 2,512,282 | 3.0289 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 803,520 | 2,436,990 | 3.0329 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 777,600 | 2,361,220 | 3.0365 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 751,680 | 2,283,533 | 3.0379 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 725,760 | 2,204,801 | 3.0379 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 699,840 | 2,125,968 | 3.0378 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 673,920 | 2,047,088 | 3.0376 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 648,000 | 1,967,963 | 3.0370 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 622,080 | 1,888,889 | 3.0364 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 596,160 | 1,809,935 | 3.0360 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 570,240 | 1,731,551 | 3.0365 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 544,320 | 1,652,746 | 3.0363 |
20 Aug 2012 05:13:45 | 1114237 | 14848397 | hadcm3n_zn04_1880_40_008026533_0 | 518,400 | 1,573,892 | 3.0361 |
©2024 cpdn.org