Name | hadcm3n_u31x_1980_40_007460410_1 |
Workunit | 7657913 |
Created | 22 Sep 2011, 23:40:44 UTC |
Sent | 24 Sep 2011, 2:27:32 UTC |
Report deadline | 24 Dec 2011, 9:54:43 UTC |
Received | 18 Nov 2011, 13:13:15 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1151942 |
Run time | 11 days 20 hours 50 min 16 sec |
CPU time | 7 days 4 hours 1 min 28 sec |
Validate state | Invalid |
Credit | 3,110.40 |
Device peak FLOPS | 2.23 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> 03:11:16 (16028): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:11:21 (16028): No heartbeat from core client for 30 sec - exiting 03:16:22 (5829): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:16:37 (5829): No heartbeat from core client for 30 sec - exiting 03:16:38 (5829): No heartbeat from core client for 30 sec - exiting 03:16:39 (5829): No heartbeat from core client for 30 sec - exiting 03:16:40 (5829): No heartbeat from core client for 30 sec - exiting 03:16:41 (5829): No heartbeat from core client for 30 sec - exiting 03:21:15 (5960): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:27:32 (6053): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:58:36 (830): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 00:34:59 (30011): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 00:35:21 (30011): No heartbeat from core client for 30 sec - exiting 01:59:55 (13484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:16:19 (23974): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:17:17 (25346): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:47:19 (25429): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 02:47:20 (25429): No heartbeat from core client for 30 sec - exiting 02:47:21 (25429): No heartbeat from core client for 30 sec - exiting 02:47:22 (25429): No heartbeat from core client for 30 sec - exiting 02:47:23 (25429): No heartbeat from core client for 30 sec - exiting 02:47:24 (25429): No heartbeat from core client for 30 sec - exiting 02:47:25 (25429): No heartbeat from core client for 30 sec - exiting 03:02:33 (29105): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:45:32 (31130): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:46:33 (3998): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:46:34 (3998): No heartbeat from core client for 30 sec - exiting 06:05:10 (4618): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITHEAD: I/O error tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 08:51:07 (21853): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 03:21:11 (20141): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 01:38:37 (30342): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:12:36 (20900): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:13:41 (8237): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:30:37 (8445): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:07:18 (11852): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:11:09 (7676): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:14:52 (8576): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:18:40 (9609): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:37:15 (9812): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:41:24 (14049): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:42:48 (14939): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:54:49 (15180): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 13:57:56 (18239): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:04:50 (19512): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:18:01 (20411): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 14:58:35 (22876): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:04:02 (32069): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:04:48 (32275): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:42:10 (32703): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 15:42:11 (32703): No heartbeat from core client for 30 sec - exiting 15:42:12 (32703): No heartbeat from core client for 30 sec - exiting 15:42:13 (32703): No heartbeat from core client for 30 sec - exiting 15:42:14 (32703): No heartbeat from core client for 30 sec - exiting 15:42:15 (32703): No heartbeat from core client for 30 sec - exiting 15:42:16 (32703): No heartbeat from core client for 30 sec - exiting 15:42:17 (32703): No heartbeat from core client for 30 sec - exiting 15:42:18 (32703): No heartbeat from core client for 30 sec - exiting 15:42:19 (32703): No heartbeat from core client for 30 sec - exiting 15:42:20 (32703): No heartbeat from core client for 30 sec - exiting 15:42:21 (32703): No heartbeat from core client for 30 sec - exiting 15:44:27 (7604): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 16:22:39 (8599): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:04:35 (16065): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... Atmos Hold Restart file rename failed on atmos_restart.hold 19:21:20 (4938): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:21:22 (4938): No heartbeat from core client for 30 sec - exiting 19:22:26 (20427): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 19:22:27 (20427): No heartbeat from core client for 30 sec - exiting 19:22:28 (20427): No heartbeat from core client for 30 sec - exiting 19:22:29 (20427): No heartbeat from core client for 30 sec - exiting 19:22:30 (20427): No heartbeat from core client for 30 sec - exiting 19:22:31 (20427): No heartbeat from core client for 30 sec - exiting 19:22:32 (20427): No heartbeat from core client for 30 sec - exiting 19:22:33 (20427): No heartbeat from core client for 30 sec - exiting 19:22:34 (20427): No heartbeat from core client for 30 sec - exiting 19:22:35 (20427): No heartbeat from core client for 30 sec - exiting 19:22:36 (20427): No heartbeat from core client for 30 sec - exiting 19:22:37 (20427): No heartbeat from core client for 30 sec - exiting 19:22:38 (20427): No heartbeat from core client for 30 sec - exiting 19:22:39 (20427): No heartbeat from core client for 30 sec - exiting 19:22:40 (20427): No heartbeat from core client for 30 sec - exiting 19:22:41 (20427): No heartbeat from core client for 30 sec - exiting 19:22:42 (20427): No heartbeat from core client for 30 sec - exiting 19:22:43 (20427): No heartbeat from core client for 30 sec - exiting 19:31:30 (22410): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:10:44 (22611): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:06:42 (30890): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:06:57 (30890): No heartbeat from core client for 30 sec - exiting 03:04:05 (28235): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:20:06 (14129): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:20:08 (14129): No heartbeat from core client for 30 sec - exiting 03:20:09 (14129): No heartbeat from core client for 30 sec - exiting CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 Error: Input file: dataout/u31xko.pji8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xko.pji8c10 Error: Input file: dataout/u31xko.pii8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xko.pii8c10 Error: Input file: dataout/u31xko.pfi8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xko.pfi8c10 Error: Input file: dataout/u31xka.phi8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xka.phi8c10 Error: Input file: dataout/u31xka.pgi8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xka.pgi8c10 Error: Input file: dataout/u31xka.pei8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xka.pei8c10 Error: Input file: dataout/u31xka.pdi8c10 is not a valid UM file. Error converting file to netcdf: dataout/u31xka.pdi8c10 03:35:26 (16868): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 03:35:27 (16868): No heartbeat from core client for 30 sec - exiting 17:55:26 (2337): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:24:06 (22913): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 18:24:07 (22913): No heartbeat from core client for 30 sec - exiting 18:27:18 (9927): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x09122a40 *** ======= Backtrace: ========= /lib32/libc.so.6(+0x6c231)[0xf7533231] /lib32/libc.so.6(+0x6dab8)[0xf7534ab8] /lib32/libc.so.6(cfree+0x6d)[0xf7537b9d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf7721a91] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf7721aed] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe6)[0xf74ddbd6] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 fb:00 44331351 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 fb:00 44331351 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 090cd000-09132000 rw-p 00000000 00:00 0 [heap] f6f00000-f6f21000 rw-p 00000000 00:00 0 f6f21000-f7000000 ---p 00000000 00:00 0 f704a000-f74c5000 rw-s 00000000 fb:00 44313460 /var/lib/boinc-client/slots/18/133520 f74c5000-f74c7000 rw-p 00000000 00:00 0 f74c7000-f761a000 r-xp 00000000 fb:00 213647365 /lib32/libc-2.11.1.so f761a000-f761b000 ---p 00153000 fb:00 213647365 /lib32/libc-2.11.1.so f761b000-f761d000 r--p 00153000 fb:00 213647365 /lib32/libc-2.11.1.so f761d000-f761e000 rw-p 00155000 fb:00 213647365 /lib32/libc-2.11.1.so f761e000-f7621000 rw-p 00000000 00:00 0 f7621000-f763e000 r-xp 00000000 fb:00 109451375 /usr/lib32/libgcc_s.so.1 f763e000-f763f000 r--p 0001c000 fb:00 109451375 /usr/lib32/libgcc_s.so.1 f763f000-f7640000 rw-p 0001d000 fb:00 109451375 /usr/lib32/libgcc_s.so.1 f7640000-f7664000 r-xp 00000000 fb:00 213647369 /lib32/libm-2.11.1.so f7664000-f7665000 r--p 00023000 fb:00 213647369 /lib32/libm-2.11.1.so f7665000-f7666000 rw-p 00024000 fb:00 213647369 /lib32/libm-2.11.1.so f7666000-f774f000 r-xp 00000000 fb:00 109451378 /usr/lib32/libstdc++.so.6.0.13 f774f000-f7750000 ---p 000e9000 fb:00 109451378 /usr/lib32/libstdc++.so.6.0.13 f7750000-f7754000 r--p 000e9000 fb:00 109451378 /usr/lib32/libstdc++.so.6.0.13 f7754000-f7755000 rw-p 000ed000 fb:00 109451378 /usr/lib32/libstdc++.so.6.0.13 f7755000-f775c000 rw-p 00000000 00:00 0 f775c000-f775e000 r-xp 00000000 fb:00 213647368 /lib32/libdl-2.11.1.so f775e000-f775f000 r--p 00001000 fb:00 213647368 /lib32/libdl-2.11.1.so f775f000-f7760000 rw-p 00002000 fb:00 213647368 /lib32/libdl-2.11.1.so f7760000-f7761000 rw-p 00000000 00:00 0 f7761000-f7776000 r-xp 00000000 fb:00 213647379 /lib32/libpthread-2.11.1.so f7776000-f7777000 r--p 00014000 fb:00 213647379 /lib32/libpthread-2.11.1.so f7777000-f7778000 rw-p 00015000 fb:00 213647379 /lib32/libpthread-2.11.1.so f7778000-f777a000 rw-p 00000000 00:00 0 f7786000-f7787000 rw-p 00000000 00:00 0 f7787000-f7788000 ---p 00000000 00:00 0 f7788000-f778b000 rw-p 00000000 00:00 0 f778b000-f778d000 rw-s 00000000 fb:00 44313380 /var/lib/boinc-client/slots/18/boinc_mmap_file f778d000-f778f000 rw-p 00000000 00:00 0 f778f000-f7790000 r-xp 00000000 00:00 0 [vdso] f7790000-f77ac000 r-xp 00000000 fb:00 213647362 /lib32/ld-2.11.1.so f77ac000-f77ad000 r--p 0001b000 fb:00 213647362 /lib32/ld-2.11.1.so f77ad000-f77ae000 rw-p 0001c000 fb:00 213647362 /lib32/ld-2.11.1.so ff8b5000-ff926000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (19 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf778f400] [0xf778f430] /lib32/libc.so.6(gsignal+0x51)[0xf74f1921] /lib32/libc.so.6(abort+0x182)[0xf74f4d52] /lib32/libc.so.6(+0x6213d)[0xf752913d] /lib32/libc.so.6(+0x6c231)[0xf7533231] /lib32/libc.so.6(+0x6dab8)[0xf7534ab8] /lib32/libc.so.6(cfree+0x6d)[0xf7537b9d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x21)[0xf7721a91] /usr/lib32/libstdc++.so.6(_ZdaPv+0x1d)[0xf7721aed] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8053e8e] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe6)[0xf74ddbd6] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
18 Nov 2011 13:15:23 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 259,200 | 619,283 | 2.3892 |
15 Nov 2011 23:30:36 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 233,280 | 559,972 | 2.4004 |
15 Nov 2011 18:08:02 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 207,360 | 499,618 | 2.4094 |
15 Nov 2011 18:08:02 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 181,440 | 439,247 | 2.4209 |
15 Nov 2011 18:07:48 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 155,520 | 378,251 | 2.4322 |
10 Nov 2011 08:43:44 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 129,600 | 314,002 | 2.4229 |
09 Nov 2011 13:18:57 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 103,680 | 249,733 | 2.4087 |
26 Sep 2011 13:22:52 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 77,760 | 195,595 | 2.5154 |
25 Sep 2011 17:58:37 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 51,840 | 130,637 | 2.5200 |
24 Sep 2011 23:08:03 | 1151942 | 13411873 | hadcm3n_u31x_1980_40_007460410_1 | 25,920 | 65,291 | 2.5189 |
©2024 cpdn.org