Name | hadcm3n_ofw3_1900_40_008475718_3 |
Workunit | 8626557 |
Created | 18 Feb 2014, 12:01:06 UTC |
Sent | 18 Feb 2014, 12:09:40 UTC |
Report deadline | 20 May 2014, 19:36:51 UTC |
Received | 17 Mar 2014, 22:42:42 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1313937 |
Run time | 18 days 0 hours 5 min 24 sec |
CPU time | 17 days 16 hours 56 min 29 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.62 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.27</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> 21:03:17 (30947): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:08:28 (31985): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:23:45 (32007): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:55:01 (32061): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:46:35 (32251): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... 06:23:49 (32753): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:12:14 (317): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:15:07 (482): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:16:18 (718): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:22:47 (2367): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:26:51 (2391): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:55:29 (2415): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:06:10 (10447): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:18:28 (10715): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:40:36 (11049): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... 20:54:05 (10729): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:59:30 (13052): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:16:39 (13226): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:27:57 (13682): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 22:31:21 (13951): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:25:42 (15503): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... CPDN Monitor - Quit request from BOINC... BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 63 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 64 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 65 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 66 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 67 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 68 - Return code = 1 BUFFIN: Read Failed: No such file or directory BUFFIN: C I/O Error feof - Unit 69 - Return code = 1 CPDN Monitor - Quit request from BOINC... 14:01:37 (4379): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:57:51 (26978): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 20:59:04 (4768): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:01:32 (4822): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:09:37 (4917): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:52 (5152): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:53 (5152): No heartbeat from core client for 30 sec - exiting 21:10:54 (5152): No heartbeat from core client for 30 sec - exiting 21:10:55 (5152): No heartbeat from core client for 30 sec - exiting 21:10:56 (5152): No heartbeat from core client for 30 sec - exiting 21:10:57 (5152): No heartbeat from core client for 30 sec - exiting 21:10:58 (5152): No heartbeat from core client for 30 sec - exiting 21:10:59 (5152): No heartbeat from core client for 30 sec - exiting 21:11:00 (5152): No heartbeat from core client for 30 sec - exiting 21:11:01 (5152): No heartbeat from core client for 30 sec - exiting 21:11:02 (5152): No heartbeat from core client for 30 sec - exiting 21:11:03 (5152): No heartbeat from core client for 30 sec - exiting 21:11:04 (5152): No heartbeat from core client for 30 sec - exiting 21:11:05 (5152): No heartbeat from core client for 30 sec - exiting 21:11:06 (5152): No heartbeat from core client for 30 sec - exiting 21:11:07 (5152): No heartbeat from core client for 30 sec - exiting 21:11:08 (5152): No heartbeat from core client for 30 sec - exiting 21:11:09 (5152): No heartbeat from core client for 30 sec - exiting 21:11:10 (5152): No heartbeat from core client for 30 sec - exiting 21:14:08 (5262): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:17:45 (5371): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:26:31 (5459): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:27:17 (5688): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:10:18 (5759): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 21:24:29 (8201): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 BUFFOUT: Write Failed: No space left on device BUFFOUT: C I/O Error - Return code = 1 Model crashed: WRITDUMP: BAD BUFFOUT OF DATA tmp/pipe_dummy 2048 20:59:06 (8625): No heartbeat from core client for 30 sec - exiting 20:59:07 (8625): No heartbeat from core client for 30 sec - exiting 20:59:08 (8625): No heartbeat from core client for 30 sec - exiting 20:59:09 (8625): No heartbeat from core client for 30 sec - exiting 20:59:10 (8625): No heartbeat from core client for 30 sec - exiting 20:59:11 (8625): No heartbeat from core client for 30 sec - exiting 20:59:12 (8625): No heartbeat from core client for 30 sec - exiting 20:59:13 (8625): No heartbeat from core client for 30 sec - exiting 20:59:14 (8625): No heartbeat from core client for 30 sec - exiting 20:59:15 (8625): No heartbeat from core client for 30 sec - exiting 20:59:16 (8625): No heartbeat from core client for 30 sec - exiting 20:59:17 (8625): No heartbeat from core client for 30 sec - exiting 20:59:18 (8625): No heartbeat from core client for 30 sec - exiting 20:59:19 (8625): No heartbeat from core client for 30 sec - exiting 20:59:20 (8625): No heartbeat from core client for 30 sec - exiting 20:59:21 (8625): No heartbeat from core client for 30 sec - exiting 20:59:22 (8625): No heartbeat from core client for 30 sec - exiting 20:59:23 (8625): No heartbeat from core client for 30 sec - exiting 20:59:24 (8625): No heartbeat from core client for 30 sec - exiting 20:59:25 (8625): No heartbeat from core client for 30 sec - exiting 20:59:26 (8625): No heartbeat from core client for 30 sec - exiting 20:59:27 (8625): No heartbeat from core client for 30 sec - exiting 20:59:28 (8625): No heartbeat from core client for 30 sec - exiting 20:59:29 (8625): No heartbeat from core client for 30 sec - exiting 20:59:30 (8625): No heartbeat from core client for 30 sec - exiting 20:59:31 (8625): No heartbeat from core client for 30 sec - exiting 20:59:32 (8625): No heartbeat from core client for 30 sec - exiting 20:59:33 (8625): No heartbeat from core client for 30 sec - exiting 20:59:34 (8625): No heartbeat from core client for 30 sec - exiting 20:59:35 (8625): No heartbeat from core client for 30 sec - exiting 20:59:36 (8625): No heartbeat from core client for 30 sec - exiting 20:59:37 (8625): No heartbeat from core client for 30 sec - exiting 20:59:38 (8625): No heartbeat from core client for 30 sec - exiting 20:59:39 (8625): No heartbeat from core client for 30 sec - exiting 20:59:40 (8625): No heartbeat from core client for 30 sec - exiting 20:59:41 (8625): No heartbeat from core client for 30 sec - exiting 20:59:42 (8625): No heartbeat from core client for 30 sec - exiting 20:59:43 (8625): No heartbeat from core client for 30 sec - exiting 20:59:44 (8625): No heartbeat from core client for 30 sec - exiting 20:59:45 (8625): No heartbeat from core client for 30 sec - exiting 21:00:53 (8625): No heartbeat from core client for 30 sec - exiting 21:01:09 (8625): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:25:07 (16581): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:27:04 (507): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... CPDN Monitor - Quit request from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x09b601c8 *** ======= Backtrace: ========= /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x70f01)[0xf7589f01] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x72768)[0xf758b768] /lib/i386-linux-gnu/i686/cmov/libc.so.6(cfree+0x6d)[0xf758e8ad] /usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xf770d4bf] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i386-linux-gnu/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xf752fe46] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 ca:02 778248 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 ca:02 778248 /var/lib/boinc-client/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 09b0a000-09b70000 rw-p 00000000 00:00 0 [heap] f6f00000-f6f21000 rw-p 00000000 00:00 0 f6f21000-f7000000 ---p 00000000 00:00 0 f709b000-f7516000 rw-s 00000000 ca:02 778460 /var/lib/boinc-client/slots/4/137135 f7516000-f7519000 rw-p 00000000 00:00 0 f7519000-f7676000 r-xp 00000000 ca:02 393741 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so f7676000-f7677000 ---p 0015d000 ca:02 393741 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so f7677000-f7679000 r--p 0015d000 ca:02 393741 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so f7679000-f767a000 rw-p 0015f000 ca:02 393741 /lib/i386-linux-gnu/i686/cmov/libc-2.13.so f767a000-f767d000 rw-p 00000000 00:00 0 f767d000-f7699000 r-xp 00000000 ca:02 1041545 /lib/i386-linux-gnu/libgcc_s.so.1 f7699000-f769a000 rw-p 0001b000 ca:02 1041545 /lib/i386-linux-gnu/libgcc_s.so.1 f769a000-f76be000 r-xp 00000000 ca:02 393724 /lib/i386-linux-gnu/i686/cmov/libm-2.13.so f76be000-f76bf000 r--p 00023000 ca:02 393724 /lib/i386-linux-gnu/i686/cmov/libm-2.13.so f76bf000-f76c0000 rw-p 00024000 ca:02 393724 /lib/i386-linux-gnu/i686/cmov/libm-2.13.so f76c0000-f77a0000 r-xp 00000000 ca:02 336568 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.17 f77a0000-f77a4000 r--p 000e0000 ca:02 336568 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.17 f77a4000-f77a5000 rw-p 000e4000 ca:02 336568 /usr/lib/i386-linux-gnu/libstdc++.so.6.0.17 f77a5000-f77ad000 rw-p 00000000 00:00 0 f77ad000-f77af000 r-xp 00000000 ca:02 393730 /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so f77af000-f77b0000 r--p 00001000 ca:02 393730 /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so f77b0000-f77b1000 rw-p 00002000 ca:02 393730 /lib/i386-linux-gnu/i686/cmov/libdl-2.13.so f77b1000-f77c6000 r-xp 00000000 ca:02 393740 /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so f77c6000-f77c7000 r--p 00014000 ca:02 393740 /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so f77c7000-f77c8000 rw-p 00015000 ca:02 393740 /lib/i386-linux-gnu/i686/cmov/libpthread-2.13.so f77c8000-f77ca000 rw-p 00000000 00:00 0 f77cd000-f77ce000 rw-p 00000000 00:00 0 f77ce000-f77cf000 ---p 00000000 00:00 0 f77cf000-f77d2000 rw-p 00000000 00:00 0 f77d2000-f77d4000 rw-s 00000000 ca:02 778457 /var/lib/boinc-client/slots/4/boinc_mmap_file f77d4000-f77d6000 rw-p 00000000 00:00 0 f77d6000-f77d7000 r-xp 00000000 00:00 0 [vdso] f77d7000-f77f3000 r-xp 00000000 ca:02 24589 /lib/i386-linux-gnu/ld-2.13.so f77f3000-f77f4000 r--p 0001b000 ca:02 24589 /lib/i386-linux-gnu/ld-2.13.so f77f4000-f77f5000 rw-p 0001c000 ca:02 24589 /lib/i386-linux-gnu/ld-2.13.so ff985000-ff9f5000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (17 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf77d6400] [0xf77d6430] /lib/i386-linux-gnu/i686/cmov/libc.so.6(gsignal+0x51)[0xf7543941] /lib/i386-linux-gnu/i686/cmov/libc.so.6(abort+0x182)[0xf7546d72] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x66e15)[0xf757fe15] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x70f01)[0xf7589f01] /lib/i386-linux-gnu/i686/cmov/libc.so.6(+0x72768)[0xf758b768] /lib/i386-linux-gnu/i686/cmov/libc.so.6(cfree+0x6d)[0xf758e8ad] /usr/lib/i386-linux-gnu/libstdc++.so.6(_ZdlPv+0x1f)[0xf770d4bf] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib/i386-linux-gnu/i686/cmov/libc.so.6(__libc_start_main+0xe6)[0xf752fe46] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
17 Mar 2014 22:42:31 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 1,036,800 | 1,529,788 | 1.4755 |
17 Mar 2014 13:11:03 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 1,010,880 | 1,495,522 | 1.4794 |
17 Mar 2014 03:44:42 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 984,960 | 1,461,515 | 1.4838 |
16 Mar 2014 18:16:24 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 959,040 | 1,427,404 | 1.4884 |
16 Mar 2014 08:47:02 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 933,120 | 1,393,386 | 1.4933 |
15 Mar 2014 23:20:32 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 907,200 | 1,359,425 | 1.4985 |
15 Mar 2014 13:51:57 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 881,280 | 1,325,382 | 1.5039 |
15 Mar 2014 01:24:29 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 855,360 | 1,290,692 | 1.5089 |
14 Mar 2014 15:38:20 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 829,440 | 1,255,480 | 1.5136 |
14 Mar 2014 05:53:26 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 803,520 | 1,220,321 | 1.5187 |
13 Mar 2014 20:01:20 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 777,600 | 1,185,109 | 1.5241 |
13 Mar 2014 10:13:19 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 751,680 | 1,149,842 | 1.5297 |
13 Mar 2014 00:51:19 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 725,760 | 1,113,742 | 1.5346 |
12 Mar 2014 13:57:23 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 699,840 | 1,077,064 | 1.5390 |
12 Mar 2014 03:46:41 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 673,920 | 1,041,941 | 1.5461 |
11 Mar 2014 16:55:36 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 648,000 | 1,003,939 | 1.5493 |
11 Mar 2014 05:54:02 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 622,080 | 974,662 | 1.5668 |
10 Mar 2014 17:02:54 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 596,160 | 937,635 | 1.5728 |
10 Mar 2014 03:59:07 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 570,240 | 890,710 | 1.5620 |
09 Mar 2014 14:06:57 | 1313937 | 16291628 | hadcm3n_ofw3_1900_40_008475718_3 | 544,320 | 844,289 | 1.5511 |
©2024 climateprediction.net