Name | hadcm3n_y8tx_1900_40_007344303_1 |
Workunit | 7541733 |
Created | 6 Jul 2011, 13:23:43 UTC |
Sent | 22 Jul 2011, 16:14:13 UTC |
Report deadline | 21 Oct 2011, 23:41:24 UTC |
Received | 16 Sep 2011, 0:45:11 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 22 (0x00000016) Unknown error code |
Computer ID | 1024632 |
Run time | 26 days 11 hours 33 min |
CPU time | 22 days 13 hours 11 min 29 sec |
Validate state | Invalid |
Credit | 11,197.44 |
Device peak FLOPS | 2.24 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> The device does not recognize the command. (0x16) - exit code 22 (0x16) </message> <stderr_txt> 16:03:54 (5236): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... 08:55:15 (1420): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:01 (4104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:51:03 (4104): No heartbeat from core client for 30 sec - exiting 14:51:04 (4104): No heartbeat from core client for 30 sec - exiting 14:51:05 (4104): No heartbeat from core client for 30 sec - exiting 14:51:06 (4104): No heartbeat from core client for 30 sec - exiting 14:51:07 (4104): No heartbeat from core client for 30 sec - exiting 14:51:08 (4104): No heartbeat from core client for 30 sec - exiting 14:51:09 (4104): No heartbeat from core client for 30 sec - exiting 14:51:10 (4104): No heartbeat from core client for 30 sec - exiting 14:51:11 (4104): No heartbeat from core client for 30 sec - exiting 14:51:12 (4104): No heartbeat from core client for 30 sec - exiting 14:51:13 (4104): No heartbeat from core client for 30 sec - exiting 14:51:14 (4104): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 09:49:50 (5052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 09:49:52 (5052): No heartbeat from core client for 30 sec - exiting 09:49:53 (5052): No heartbeat from core client for 30 sec - exiting 09:49:54 (5052): No heartbeat from core client for 30 sec - exiting 09:49:55 (5052): No heartbeat from core client for 30 sec - exiting 09:49:56 (5052): No heartbeat from core client for 30 sec - exiting 09:49:57 (5052): No heartbeat from core client for 30 sec - exiting 09:49:58 (5052): No heartbeat from core client for 30 sec - exiting 09:49:59 (5052): No heartbeat from core client for 30 sec - exiting 09:50:00 (5052): No heartbeat from core client for 30 sec - exiting 09:50:01 (5052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... 14:29:09 (440): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... BUFFOUT: C I/O Error - Return code = 32 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/pipe_dummy 2048 BUFFOUT: C I/O Error - Return code = 32 Model crashed: STWORK : Error in PP_FILE tmp/pipe_dummy 2048 Sorry, too many model crashes! :-( Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
15 Sep 2011 05:46:07 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 933,120 | 1,939,320 | 2.0783 |
14 Sep 2011 12:08:54 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 907,200 | 1,877,089 | 2.0691 |
13 Sep 2011 19:15:19 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 881,280 | 1,816,098 | 2.0608 |
12 Sep 2011 22:17:00 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 855,360 | 1,753,688 | 2.0502 |
12 Sep 2011 03:55:01 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 829,440 | 1,689,967 | 2.0375 |
11 Sep 2011 09:32:12 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 803,520 | 1,625,390 | 2.0228 |
10 Sep 2011 16:02:50 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 777,600 | 1,564,138 | 2.0115 |
09 Sep 2011 22:50:45 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 751,680 | 1,503,788 | 2.0006 |
09 Sep 2011 05:21:38 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 725,760 | 1,443,337 | 1.9887 |
08 Sep 2011 12:51:28 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 699,840 | 1,384,689 | 1.9786 |
07 Sep 2011 19:32:55 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 673,920 | 1,325,135 | 1.9663 |
07 Sep 2011 02:48:45 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 648,000 | 1,264,560 | 1.9515 |
06 Sep 2011 08:40:12 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 622,080 | 1,203,701 | 1.9350 |
05 Sep 2011 15:10:22 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 596,160 | 1,141,504 | 1.9148 |
05 Sep 2011 06:48:09 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 570,240 | 1,083,575 | 1.9002 |
05 Sep 2011 06:48:09 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 544,320 | 1,026,208 | 1.8853 |
05 Sep 2011 06:48:09 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 518,400 | 968,662 | 1.8686 |
05 Sep 2011 06:48:09 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 492,480 | 910,517 | 1.8488 |
26 Aug 2011 05:19:01 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 466,560 | 851,879 | 1.8259 |
25 Aug 2011 12:18:08 | 1024632 | 13092472 | hadcm3n_y8tx_1900_40_007344303_1 | 440,640 | 793,879 | 1.8016 |
©2024 cpdn.org