Name | hadcm3n_o3cn_2140_40_008269493_2 |
Workunit | 8424617 |
Created | 28 Apr 2013, 9:47:49 UTC |
Sent | 28 Apr 2013, 9:47:58 UTC |
Report deadline | 28 Jul 2013, 17:15:09 UTC |
Received | 26 May 2013, 13:25:21 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1263454 |
Run time | 5 days 22 hours 41 min 38 sec |
CPU time | 5 days 18 hours 9 min 25 sec |
Validate state | Invalid |
Credit | 9,331.20 |
Device peak FLOPS | 3.73 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.64</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 00:17:56 (2804): No heartbeat from core client for 30 sec - exiting 00:17:58 (2804): No heartbeat from core client for 30 sec - exiting 00:17:59 (2804): No heartbeat from core client for 30 sec - exiting 00:18:00 (2804): No heartbeat from core client for 30 sec - exiting 00:18:01 (2804): No heartbeat from core client for 30 sec - exiting 00:18:02 (2804): No heartbeat from core client for 30 sec - exiting 00:18:03 (2804): No heartbeat from core client for 30 sec - exiting 00:18:04 (2804): No heartbeat from core client for 30 sec - exiting 00:18:05 (2804): No heartbeat from core client for 30 sec - exiting 00:18:06 (2804): No heartbeat from core client for 30 sec - exiting 00:18:07 (2804): No heartbeat from core client for 30 sec - exiting 00:18:09 (2804): No heartbeat from core client for 30 sec - exiting 00:18:10 (2804): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 07:06:46 (1144): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 11:16:59 AM No files match the supplied pattern. MainError: 11:16:59 AM No files match the supplied pattern. MainError: 03:56:55 PM No files match the supplied pattern. MainError: 03:56:55 PM No files match the supplied pattern. MainError: 08:37:34 PM No files match the supplied pattern. MainError: 08:37:34 PM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 01:20:48 AM No files match the supplied pattern. MainError: 01:20:48 AM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... MainError: 06:12:45 AM No files match the supplied pattern. MainError: 06:12:45 AM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 10:51:00 AM No files match the supplied pattern. MainError: 10:51:00 AM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... MainError: 03:27:30 PM No files match the supplied pattern. MainError: 03:27:30 PM No files match the supplied pattern. CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... MainError: 08:15:18 PM No files match the supplied pattern. MainError: 08:15:18 PM No files match the supplied pattern. MainError: 01:00:50 AM No files match the supplied pattern. MainError: 01:00:50 AM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... MainError: 07:53:41 AM No files match the supplied pattern. MainError: 07:53:41 AM No files match the supplied pattern. Suspended CPDN Monitor - Suspend request from BOINC... Error converting file to netcdf: dataout/o3cnka.ph11c10 Error converting file to netcdf: dataout/o3cnka.pg11c10 Error converting file to netcdf: dataout/o3cnka.pe11c10 MainError: 12:40:45 AM No files match the supplied pattern. MainError: 12:40:45 AM No files match the supplied pattern. BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 Model crashed: STWORK : I/O error - PP fixed length header tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
20 May 2013 12:42:13 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 777,600 | 498,541 | 0.6411 |
20 May 2013 07:55:35 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 751,680 | 481,891 | 0.6411 |
20 May 2013 01:03:12 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 725,760 | 465,157 | 0.6409 |
19 May 2013 20:19:47 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 699,840 | 448,446 | 0.6408 |
19 May 2013 15:29:27 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 673,920 | 431,859 | 0.6408 |
19 May 2013 10:52:53 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 648,000 | 415,474 | 0.6412 |
19 May 2013 06:16:20 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 622,080 | 399,156 | 0.6416 |
19 May 2013 01:24:31 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 596,160 | 382,675 | 0.6419 |
18 May 2013 20:42:08 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 570,240 | 366,263 | 0.6423 |
18 May 2013 16:01:40 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 544,320 | 349,746 | 0.6425 |
18 May 2013 11:19:37 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 518,400 | 333,197 | 0.6427 |
18 May 2013 06:32:48 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 492,480 | 316,753 | 0.6432 |
18 May 2013 00:30:30 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 466,560 | 300,380 | 0.6438 |
17 May 2013 19:51:17 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 440,640 | 283,785 | 0.6440 |
17 May 2013 16:08:21 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 414,720 | 267,241 | 0.6444 |
17 May 2013 10:36:06 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 388,800 | 250,675 | 0.6447 |
17 May 2013 05:59:18 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 362,880 | 234,278 | 0.6456 |
14 May 2013 11:59:53 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 336,960 | 217,734 | 0.6462 |
12 May 2013 10:26:36 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 311,040 | 200,914 | 0.6459 |
12 May 2013 02:08:37 | 1263454 | 15753999 | hadcm3n_o3cn_2140_40_008269493_2 | 285,120 | 184,362 | 0.6466 |
©2024 cpdn.org