Name | hadam3p_eu_k0ii_2013_1_008534196_0 |
Workunit | 8681708 |
Created | 3 Mar 2014, 15:31:29 UTC |
Sent | 4 Mar 2014, 8:07:53 UTC |
Report deadline | 14 Feb 2015, 13:27:53 UTC |
Received | 6 Mar 2014, 16:08:47 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | -187 (0xFFFFFF45) ERR_RESULT_UPLOAD |
Computer ID | 1282508 |
Run time | 2 days 3 hours 10 min 40 sec |
CPU time | 1 days 23 hours 24 min 58 sec |
Validate state | Invalid |
Credit | 1,392.75 |
Device peak FLOPS | 1.92 GFLOPS |
Application version | UK Met Office HadAM3P-HadRM3P Europe v6.09 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.65</core_client_version> <![CDATA[ <message> upload failure </message> <stderr_txt> CPDN Monitor - Quit request from BOINC... 11:34:51 (4874): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 11:46:48 (13798): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 11:46:49 (13798): No heartbeat from core client for 30 sec - exiting 11:46:50 (13798): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... 12:03:08 (14714): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:16:02 (15345): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:20:15 (15997): No heartbeat from core client for 30 sec - exiting 12:21:54 (15997): No heartbeat from core client for 30 sec - exiting 12:21:55 (15997): No heartbeat from core client for 30 sec - exiting 12:21:56 (15997): No heartbeat from core client for 30 sec - exiting 12:21:57 (15997): No heartbeat from core client for 30 sec - exiting 12:21:58 (15997): No heartbeat from core client for 30 sec - exiting 12:21:59 (15997): No heartbeat from core client for 30 sec - exiting 12:22:00 (15997): No heartbeat from core client for 30 sec - exiting 12:22:01 (15997): No heartbeat from core client for 30 sec - exiting 12:22:39 (15997): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:25:46 (16479): No heartbeat from core client for 30 sec - exiting 12:25:50 (16479): No heartbeat from core client for 30 sec - exiting 12:25:51 (16479): No heartbeat from core client for 30 sec - exiting 12:25:52 (16479): No heartbeat from core client for 30 sec - exiting 12:25:53 (16479): No heartbeat from core client for 30 sec - exiting 12:25:54 (16479): No heartbeat from core client for 30 sec - exiting 12:25:55 (16479): No heartbeat from core client for 30 sec - exiting 12:28:26 (16479): No heartbeat from core client for 30 sec - exiting 12:28:27 (16479): No heartbeat from core client for 30 sec - exiting 12:28:28 (16479): No heartbeat from core client for 30 sec - exiting 12:28:29 (16479): No heartbeat from core client for 30 sec - exiting 12:28:30 (16479): No heartbeat from core client for 30 sec - exiting 12:28:31 (16479): No heartbeat from core client for 30 sec - exiting 12:28:32 (16479): No heartbeat from core client for 30 sec - exiting 12:28:33 (16479): No heartbeat from core client for 30 sec - exiting 12:28:34 (16479): No heartbeat from core client for 30 sec - exiting 12:29:37 (16479): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:34:21 (16908): No heartbeat from core client for 30 sec - exiting 12:36:54 (16908): No heartbeat from core client for 30 sec - exiting 12:36:55 (16908): No heartbeat from core client for 30 sec - exiting 12:36:56 (16908): No heartbeat from core client for 30 sec - exiting 12:37:04 (16908): No heartbeat from core client for 30 sec - exiting 12:37:05 (16908): No heartbeat from core client for 30 sec - exiting 12:37:06 (16908): No heartbeat from core client for 30 sec - exiting 12:37:43 (16908): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 12:57:04 (17256): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 12:57:05 (17256): No heartbeat from core client for 30 sec - exiting 12:57:06 (17256): No heartbeat from core client for 30 sec - exiting 12:57:07 (17256): No heartbeat from core client for 30 sec - exiting 12:57:08 (17256): No heartbeat from core client for 30 sec - exiting 12:57:09 (17256): No heartbeat from core client for 30 sec - exiting 12:57:10 (17256): No heartbeat from core client for 30 sec - exiting 12:57:11 (17256): No heartbeat from core client for 30 sec - exiting 12:57:12 (17256): No heartbeat from core client for 30 sec - exiting 12:57:13 (17256): No heartbeat from core client for 30 sec - exiting 12:57:14 (17256): No heartbeat from core client for 30 sec - exiting 12:57:15 (17256): No heartbeat from core client for 30 sec - exiting 12:58:18 (18210): No heartbeat from core client for 30 sec - exiting 12:58:53 (18210): No heartbeat from core client for 30 sec - exiting 12:58:54 (18210): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:01:00 (18378): No heartbeat from core client for 30 sec - exiting 13:01:01 (18378): No heartbeat from core client for 30 sec - exiting 13:02:05 (18378): No heartbeat from core client for 30 sec - exiting 13:02:16 (18378): No heartbeat from core client for 30 sec - exiting 13:02:17 (18378): No heartbeat from core client for 30 sec - exiting 13:02:18 (18378): No heartbeat from core client for 30 sec - exiting 13:02:19 (18378): No heartbeat from core client for 30 sec - exiting 13:02:20 (18378): No heartbeat from core client for 30 sec - exiting 13:02:21 (18378): No heartbeat from core client for 30 sec - exiting 13:02:22 (18378): No heartbeat from core client for 30 sec - exiting 13:02:23 (18378): No heartbeat from core client for 30 sec - exiting 13:02:24 (18378): No heartbeat from core client for 30 sec - exiting 13:02:25 (18378): No heartbeat from core client for 30 sec - exiting 13:02:26 (18378): No heartbeat from core client for 30 sec - exiting 13:02:27 (18378): No heartbeat from core client for 30 sec - exiting 13:02:28 (18378): No heartbeat from core client for 30 sec - exiting 13:02:29 (18378): No heartbeat from core client for 30 sec - exiting 13:04:22 (18378): No heartbeat from core client for 30 sec - exiting 13:04:23 (18378): No heartbeat from core client for 30 sec - exiting 13:04:24 (18378): No heartbeat from core client for 30 sec - exiting 13:04:25 (18378): No heartbeat from core client for 30 sec - exiting 13:04:26 (18378): No heartbeat from core client for 30 sec - exiting 13:04:27 (18378): No heartbeat from core client for 30 sec - exiting 13:04:28 (18378): No heartbeat from core client for 30 sec - exiting 13:04:29 (18378): No heartbeat from core client for 30 sec - exiting 13:04:30 (18378): No heartbeat from core client for 30 sec - exiting 13:04:31 (18378): No heartbeat from core client for 30 sec - exiting 13:04:32 (18378): No heartbeat from core client for 30 sec - exiting 13:04:33 (18378): No heartbeat from core client for 30 sec - exiting 13:04:34 (18378): No heartbeat from core client for 30 sec - exiting 13:04:35 (18378): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 forrtl: No space left on device forrtl: severe (38): error during write, unit 0, file /var/lib/boinc/projects/climateprediction.net/hadam3p_eu_k0ii_2013_1_008534196/dataout/xaakg.err Image PC Routine Line Source hadrm3p_eu_um_6.0 083C744D Unknown Unknown Unknown hadrm3p_eu_um_6.0 083C6245 Unknown Unknown Unknown hadrm3p_eu_um_6.0 08396C9F Unknown Unknown Unknown hadrm3p_eu_um_6.0 08352E0D Unknown Unknown Unknown hadrm3p_eu_um_6.0 08352757 Unknown Unknown Unknown hadrm3p_eu_um_6.0 0838CD8F Unknown Unknown Unknown hadrm3p_eu_um_6.0 083897C9 Unknown Unknown Unknown hadrm3p_eu_um_6.0 08069968 Unknown Unknown Unknown hadrm3p_eu_um_6.0 082CCDA2 Unknown Unknown Unknown libc.so.6 F0D10A35 Unknown Unknown Unknown Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=18693, iMonCtr=2 Model crash detected, will try to restart... Leaving CPDN_Main::Monitor... zip I/O error: No space left on device zip error: Output file write failure (write error on zip file) Called boinc_finish </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
06 Mar 2014 06:26:48 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 80,736 | 152,826 | 1.8929 |
06 Mar 2014 00:04:45 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 69,216 | 131,036 | 1.8931 |
05 Mar 2014 17:21:04 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 57,696 | 109,043 | 1.8900 |
05 Mar 2014 10:43:44 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 46,176 | 87,194 | 1.8883 |
05 Mar 2014 03:51:59 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 34,656 | 65,358 | 1.8859 |
04 Mar 2014 21:45:05 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 23,136 | 43,458 | 1.8784 |
04 Mar 2014 18:38:52 | 1282508 | 16316202 | hadam3p_eu_k0ii_2013_1_008534196_0 | 11,616 | 21,654 | 1.8642 |
©2024 cpdn.org