Name | hadcm3n_n33x_1960_40_008351029_0 |
Workunit | 8501890 |
Created | 17 Apr 2013, 21:08:17 UTC |
Sent | 17 Apr 2013, 21:08:37 UTC |
Report deadline | 18 Jul 2013, 4:35:48 UTC |
Received | 11 May 2013, 3:15:35 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 25 (0x00000019) Unknown error code |
Computer ID | 1217831 |
Run time | 7 days 12 hours 33 min 16 sec |
CPU time | 5 days 3 hours 33 min 36 sec |
Validate state | Invalid |
Credit | 4,354.56 |
Device peak FLOPS | 3.40 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 windows_intelx86 |
Stderr | <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> The drive cannot locate a specific area or track on the disk. (0x19) - exit code 25 (0x19) </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 23:34:23 (6604): No heartbeat from core client for 30 sec - exiting 23:34:24 (6604): No heartbeat from core client for 30 sec - exiting 23:34:25 (6604): No heartbeat from core client for 30 sec - exiting 23:34:26 (6604): No heartbeat from core client for 30 sec - exiting 23:34:27 (6604): No heartbeat from core client for 30 sec - exiting 23:34:28 (6604): No heartbeat from core client for 30 sec - exiting 23:34:29 (6604): No heartbeat from core client for 30 sec - exiting 23:34:30 (6604): No heartbeat from core client for 30 sec - exiting 23:34:31 (6604): No heartbeat from core client for 30 sec - exiting 23:34:32 (6604): No heartbeat from core client for 30 sec - exiting 23:34:33 (6604): No heartbeat from core client for 30 sec - exiting 23:34:34 (6604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 23:34:35 (6604): No heartbeat from core client for 30 sec - exiting 23:34:36 (6604): No heartbeat from core client for 30 sec - exiting 23:38:32 (7976): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=33492, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 12:59:33 (89820): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 17:41:28 (7088): No heartbeat from core client for 30 sec - exiting 17:41:29 (7088): No heartbeat from core client for 30 sec - exiting 17:41:30 (7088): No heartbeat from core client for 30 sec - exiting 17:41:31 (7088): No heartbeat from core client for 30 sec - exiting 17:41:32 (7088): No heartbeat from core client for 30 sec - exiting 17:41:33 (7088): No heartbeat from core client for 30 sec - exiting 17:41:34 (7088): No heartbeat from core client for 30 sec - exiting 17:41:35 (7088): No heartbeat from core client for 30 sec - exiting 17:41:36 (7088): No heartbeat from core client for 30 sec - exiting 17:41:37 (7088): No heartbeat from core client for 30 sec - exiting 17:41:38 (7088): No heartbeat from core client for 30 sec - exiting 17:41:39 (7088): No heartbeat from core client for 30 sec - exiting 17:41:40 (7088): No heartbeat from core client for 30 sec - exiting 17:41:41 (7088): No heartbeat from core client for 30 sec - exiting 17:41:42 (7088): No heartbeat from core client for 30 sec - exiting 17:41:43 (7088): No heartbeat from core client for 30 sec - exiting 17:41:44 (7088): No heartbeat from core client for 30 sec - exiting 17:41:45 (7088): No heartbeat from core client for 30 sec - exiting 17:41:46 (7088): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 19:07:43 (388052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 20:08:56 (389812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=114232, iMonCtr=1 Model crash detected, will try to restart... CPDN Monitor - Quit request from BOINC... BUFFIN: C I/O Error feof - Unit 63 - Return code = 16 BUFFIN: C I/O Error feof - Unit 64 - Return code = 16 BUFFIN: C I/O Error feof - Unit 65 - Return code = 16 BUFFIN: C I/O Error feof - Unit 66 - Return code = 16 BUFFIN: C I/O Error feof - Unit 67 - Return code = 16 BUFFIN: C I/O Error feof - Unit 68 - Return code = 16 BUFFIN: C I/O Error feof - Unit 69 - Return code = 16 Error converting file to netcdf: dataout/n33xko.pjh4c10 Error converting file to netcdf: dataout/n33xko.pih4c10 Error converting file to netcdf: dataout/n33xko.pfh4c10 Error converting file to netcdf: dataout/n33xka.phh4c10 Error converting file to netcdf: dataout/n33xka.pgh4c10 Error converting file to netcdf: dataout/n33xka.peh4c10 Error converting file to netcdf: dataout/n33xka.pdh4c10 </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 May 2013 03:15:39 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 362,880 | 444,485 | 1.2249 |
10 May 2013 12:53:02 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 336,960 | 411,906 | 1.2224 |
09 May 2013 19:50:24 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 311,040 | 380,454 | 1.2232 |
09 May 2013 08:51:05 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 285,120 | 350,459 | 1.2292 |
08 May 2013 17:16:24 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 259,200 | 320,514 | 1.2366 |
07 May 2013 18:47:01 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 233,280 | 290,382 | 1.2448 |
07 May 2013 06:00:57 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 207,360 | 260,134 | 1.2545 |
06 May 2013 16:06:13 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 181,440 | 228,691 | 1.2604 |
06 May 2013 00:46:07 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 155,520 | 195,746 | 1.2587 |
03 May 2013 03:21:59 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 129,600 | 160,496 | 1.2384 |
02 May 2013 13:30:01 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 103,680 | 129,700 | 1.2510 |
01 May 2013 23:28:10 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 77,760 | 99,226 | 1.2761 |
28 Apr 2013 02:09:03 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 51,840 | 67,000 | 1.2924 |
26 Apr 2013 20:22:55 | 1217831 | 15729790 | hadcm3n_n33x_1960_40_008351029_0 | 25,920 | 33,645 | 1.2980 |
©2024 cpdn.org