Name | hadcm3n_n44j_1920_40_008321845_0 |
Workunit | 8472980 |
Created | 24 Feb 2013, 21:24:51 UTC |
Sent | 24 Feb 2013, 21:30:25 UTC |
Report deadline | 27 May 2013, 4:57:36 UTC |
Received | 12 Apr 2013, 14:15:34 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 996482 |
Run time | 27 days 23 hours 18 min 35 sec |
CPU time | 23 days 11 hours 15 min |
Validate state | Invalid |
Credit | 11,508.48 |
Device peak FLOPS | 2.35 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.59</core_client_version> <![CDATA[ <message> Enheten känner inte igen kommandot. (0x16) - exit code 22 (0x16) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=2728, iMonCtr=1 Model crash detected, will try to restart... 15:22:50 (3084): No heartbeat from core client for 30 sec - exiting 15:22:51 (3084): No heartbeat from core client for 30 sec - exiting 15:22:52 (3084): No heartbeat from core client for 30 sec - exiting 15:22:54 (3084): No heartbeat from core client for 30 sec - exiting 15:22:55 (3084): No heartbeat from core client for 30 sec - exiting 15:22:56 (3084): No heartbeat from core client for 30 sec - exiting 15:22:57 (3084): No heartbeat from core client for 30 sec - exiting 15:22:58 (3084): No heartbeat from core client for 30 sec - exiting 15:22:59 (3084): No heartbeat from core client for 30 sec - exiting 15:23:00 (3084): No heartbeat from core client for 30 sec - exiting 15:23:01 (3084): No heartbeat from core client for 30 sec - exiting 15:23:02 (3084): No heartbeat from core client for 30 sec - exiting 15:23:03 (3084): No heartbeat from core client for 30 sec - exiting 15:23:05 (3084): No heartbeat from core client for 30 sec - exiting 15:23:06 (3084): No heartbeat from core client for 30 sec - exiting 15:23:07 (3084): No heartbeat from core client for 30 sec - exiting 15:23:08 (3084): No heartbeat from core client for 30 sec - exiting 15:23:09 (3084): No heartbeat from core client for 30 sec - exiting 15:23:14 (3084): No heartbeat from core client for 30 sec - exiting 15:23:15 (3084): No heartbeat from core client for 30 sec - exiting 15:23:16 (3084): No heartbeat from core client for 30 sec - exiting 15:23:17 (3084): No heartbeat from core client for 30 sec - exiting 15:23:18 (3084): No heartbeat from core client for 30 sec - exiting 15:23:19 (3084): No heartbeat from core client for 30 sec - exiting 15:23:20 (3084): No heartbeat from core client for 30 sec - exiting 15:23:21 (3084): No heartbeat from core client for 30 sec - exiting 15:23:22 (3084): No heartbeat from core client for 30 sec - exiting 15:23:23 (3084): No heartbeat from core client for 30 sec - exiting 15:23:24 (3084): No heartbeat from core client for 30 sec - exiting 15:23:25 (3084): No heartbeat from core client for 30 sec - exiting 15:23:26 (3084): No heartbeat from core client for 30 sec - exiting 15:23:27 (3084): No heartbeat from core client for 30 sec - exiting 15:23:28 (3084): No heartbeat from core client for 30 sec - exiting 15:23:29 (3084): No heartbeat from core client for 30 sec - exiting 15:23:30 (3084): No heartbeat from core client for 30 sec - exiting 15:23:31 (3084): No heartbeat from core client for 30 sec - exiting 15:23:32 (3084): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:24:33 (3576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 18:23:35 (6908): No heartbeat from core client for 30 sec - exiting 18:23:36 (6908): No heartbeat from core client for 30 sec - exiting 18:23:37 (6908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 23:46:58 (3508): No heartbeat from core client for 30 sec - exiting 23:46:59 (3508): No heartbeat from core client for 30 sec - exiting 23:47:00 (3508): No heartbeat from core client for 30 sec - exiting 23:47:01 (3508): No heartbeat from core client for 30 sec - exiting 23:47:03 (3508): No heartbeat from core client for 30 sec - exiting 23:47:04 (3508): No heartbeat from core client for 30 sec - exiting 23:47:05 (3508): No heartbeat from core client for 30 sec - exiting 23:47:06 (3508): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:48:24 (1600): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:18:51 (3760): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 20:48:16 (3940): No heartbeat from core client for 30 sec - exiting 20:48:17 (3940): No heartbeat from core client for 30 sec - exiting 20:48:18 (3940): No heartbeat from core client for 30 sec - exiting 20:48:19 (3940): No heartbeat from core client for 30 sec - exiting 20:48:20 (3940): No heartbeat from core client for 30 sec - exiting 20:48:21 (3940): No heartbeat from core client for 30 sec - exiting 20:48:22 (3940): No heartbeat from core client for 30 sec - exiting 20:48:23 (3940): No heartbeat from core client for 30 sec - exiting 20:48:25 (3940): No heartbeat from core client for 30 sec - exiting 20:48:26 (3940): No heartbeat from core client for 30 sec - exiting 20:48:27 (3940): No heartbeat from core client for 30 sec - exiting 20:48:28 (3940): No heartbeat from core client for 30 sec - exiting 20:48:29 (3940): No heartbeat from core client for 30 sec - exiting 20:48:30 (3940): No heartbeat from core client for 30 sec - exiting 20:48:31 (3940): No heartbeat from core client for 30 sec - exiting 20:48:32 (3940): No heartbeat from core client for 30 sec - exiting 20:48:33 (3940): No heartbeat from core client for 30 sec - exiting 20:48:34 (3940): No heartbeat from core client for 30 sec - exiting 20:48:35 (3940): No heartbeat from core client for 30 sec - exiting 20:48:37 (3940): No heartbeat from core client for 30 sec - exiting 20:48:38 (3940): No heartbeat from core client for 30 sec - exiting 20:48:39 (3940): No heartbeat from core client for 30 sec - exiting 20:48:40 (3940): No heartbeat from core client for 30 sec - exiting 20:48:41 (3940): No heartbeat from core client for 30 sec - exiting 20:48:42 (3940): No heartbeat from core client for 30 sec - exiting 20:48:43 (3940): No heartbeat from core client for 30 sec - exiting 20:48:44 (3940): No heartbeat from core client for 30 sec - exiting 20:48:45 (3940): No heartbeat from core client for 30 sec - exiting 20:48:46 (3940): No heartbeat from core client for 30 sec - exiting 20:48:47 (3940): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:49:30 (3816): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFIN: C I/O Error feof - Unit 41 - Return code = 16 Model crashed: READHEAD: I/O error tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 41 - Return code = 16 Model crashed: READHEAD: I/O error tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 41 - Return code = 16 Model crashed: READHEAD: I/O error tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 41 - Return code = 16 Model crashed: READHEAD: I/O error tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 41 - Return code = 16 Model crashed: READHEAD: I/O error tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 41 - Return code = 16 Model crashed: READHEAD: I/O error tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
12 Apr 2013 07:43:57 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 959,040 | 2,016,599 | 2.1027 |
11 Apr 2013 17:59:12 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 933,120 | 1,965,873 | 2.1068 |
10 Apr 2013 23:36:03 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 907,200 | 1,911,260 | 2.1068 |
10 Apr 2013 04:31:40 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 881,280 | 1,855,110 | 2.1050 |
09 Apr 2013 08:09:46 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 855,360 | 1,798,263 | 2.1023 |
08 Apr 2013 10:32:46 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 829,440 | 1,742,666 | 2.1010 |
07 Apr 2013 15:40:38 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 803,520 | 1,686,103 | 2.0984 |
06 Apr 2013 21:28:54 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 777,600 | 1,631,586 | 2.0982 |
06 Apr 2013 03:51:41 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 751,680 | 1,576,786 | 2.0977 |
05 Apr 2013 08:09:13 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 725,760 | 1,524,089 | 2.1000 |
04 Apr 2013 18:35:08 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 699,840 | 1,478,912 | 2.1132 |
04 Apr 2013 02:53:26 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 673,920 | 1,429,982 | 2.1219 |
03 Apr 2013 09:25:49 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 648,000 | 1,379,380 | 2.1287 |
02 Apr 2013 11:09:33 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 622,080 | 1,328,481 | 2.1355 |
01 Apr 2013 19:06:07 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 596,160 | 1,278,094 | 2.1439 |
01 Apr 2013 04:02:46 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 570,240 | 1,229,646 | 2.1564 |
31 Mar 2013 13:55:20 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 544,320 | 1,185,463 | 2.1779 |
13 Mar 2013 01:48:13 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 518,400 | 1,137,040 | 2.1934 |
12 Mar 2013 10:01:55 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 492,480 | 1,088,339 | 2.2099 |
11 Mar 2013 20:05:57 | 996482 | 15637200 | hadcm3n_n44j_1920_40_008321845_0 | 466,560 | 1,043,247 | 2.2360 |
©2024 climateprediction.net