Name | hadcm3n_o4bv_2020_40_007857213_2 |
Workunit | 8012325 |
Created | 4 Apr 2012, 22:52:35 UTC |
Sent | 4 Apr 2012, 22:56:19 UTC |
Report deadline | 5 Jul 2012, 6:23:30 UTC |
Received | 11 May 2012, 1:46:33 UTC |
Server state | Over |
Outcome | Computation error |
Client state | Compute error |
Exit status | 193 (0x000000C1) EXIT_SIGNAL |
Computer ID | 1196538 |
Run time | 22 days 7 hours 13 min 1 sec |
CPU time | 22 days 2 hours 28 min 29 sec |
Validate state | Invalid |
Credit | 12,441.60 |
Device peak FLOPS | 2.59 GFLOPS |
Application version | UK Met Office Coupled Model Full Resolution Ocean v6.07 i686-pc-linux-gnu |
Stderr | <core_client_version>7.0.24</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Signal 15 received, exiting... Called boinc_finish Signal 15 received, exiting... Called boinc_finish Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Controller:: CPDN process is not running, exiting, bRetVal = 1, checkPID=0, selfPID=8873, iMonCtr=1 Model crash detected, will try to restart... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... 06:42:44 (3484): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:46:12 (24536): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:46:19 (24536): No heartbeat from core client for 30 sec - exiting 06:50:14 (24585): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:50:17 (24585): No heartbeat from core client for 30 sec - exiting 06:51:20 (24611): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:51:22 (24611): No heartbeat from core client for 30 sec - exiting 06:51:23 (24611): No heartbeat from core client for 30 sec - exiting 06:51:24 (24611): No heartbeat from core client for 30 sec - exiting 06:51:25 (24611): No heartbeat from core client for 30 sec - exiting 06:51:26 (24611): No heartbeat from core client for 30 sec - exiting 06:54:12 (24626): No heartbeat from core client for 30 sec - exiting 06:54:37 (24626): No heartbeat from core client for 30 sec - exiting 06:54:38 (24626): No heartbeat from core client for 30 sec - exiting 06:54:39 (24626): No heartbeat from core client for 30 sec - exiting 06:54:40 (24626): No heartbeat from core client for 30 sec - exiting 06:54:41 (24626): No heartbeat from core client for 30 sec - exiting 06:54:42 (24626): No heartbeat from core client for 30 sec - exiting 06:54:43 (24626): No heartbeat from core client for 30 sec - exiting 06:54:44 (24626): No heartbeat from core client for 30 sec - exiting 06:54:45 (24626): No heartbeat from core client for 30 sec - exiting 06:54:53 (24626): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 06:55:02 (24626): No heartbeat from core client for 30 sec - exiting 07:00:19 (24651): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:00:20 (24651): No heartbeat from core client for 30 sec - exiting 07:02:05 (24784): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:02:06 (24784): No heartbeat from core client for 30 sec - exiting 07:03:11 (24841): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:03:13 (24841): No heartbeat from core client for 30 sec - exiting 07:03:14 (24841): No heartbeat from core client for 30 sec - exiting 07:03:15 (24841): No heartbeat from core client for 30 sec - exiting 07:09:24 (24890): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:12:40 (25194): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:12:43 (25194): No heartbeat from core client for 30 sec - exiting 07:14:41 (25316): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:15:58 (25377): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:30 (25426): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:17:36 (25426): No heartbeat from core client for 30 sec - exiting 07:17:37 (25426): No heartbeat from core client for 30 sec - exiting 07:18:53 (25479): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:18:55 (25479): No heartbeat from core client for 30 sec - exiting 07:18:56 (25479): No heartbeat from core client for 30 sec - exiting 07:20:21 (25499): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:20:23 (25499): No heartbeat from core client for 30 sec - exiting 07:21:23 (25531): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:24:30 (25586): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:31:25 (25653): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:31:32 (25653): No heartbeat from core client for 30 sec - exiting 07:31:33 (25653): No heartbeat from core client for 30 sec - exiting 07:31:34 (25653): No heartbeat from core client for 30 sec - exiting 07:31:35 (25653): No heartbeat from core client for 30 sec - exiting 07:31:45 (25653): No heartbeat from core client for 30 sec - exiting 07:34:08 (25761): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:43:35 (26023): No heartbeat from core client for 30 sec - exiting 07:44:23 (26023): No heartbeat from core client for 30 sec - exiting 07:44:24 (26023): No heartbeat from core client for 30 sec - exiting 07:44:25 (26023): No heartbeat from core client for 30 sec - exiting 07:44:26 (26023): No heartbeat from core client for 30 sec - exiting 07:44:34 (26023): No heartbeat from core client for 30 sec - exiting 07:44:35 (26023): No heartbeat from core client for 30 sec - exiting 07:44:52 (26023): No heartbeat from core client for 30 sec - exiting 07:44:53 (26023): No heartbeat from core client for 30 sec - exiting 07:44:54 (26023): No heartbeat from core client for 30 sec - exiting 07:44:55 (26023): No heartbeat from core client for 30 sec - exiting 07:44:56 (26023): No heartbeat from core client for 30 sec - exiting 07:44:59 (26023): No heartbeat from core client for 30 sec - exiting 07:45:00 (26023): No heartbeat from core client for 30 sec - exiting 07:45:01 (26023): No heartbeat from core client for 30 sec - exiting 07:45:02 (26023): No heartbeat from core client for 30 sec - exiting 07:45:03 (26023): No heartbeat from core client for 30 sec - exiting 07:45:23 (26023): No heartbeat from core client for 30 sec - exiting 07:45:24 (26023): No heartbeat from core client for 30 sec - exiting 07:45:25 (26023): No heartbeat from core client for 30 sec - exiting 07:45:26 (26023): No heartbeat from core client for 30 sec - exiting 07:45:27 (26023): No heartbeat from core client for 30 sec - exiting 07:45:28 (26023): No heartbeat from core client for 30 sec - exiting 07:45:30 (26023): No heartbeat from core client for 30 sec - exiting 07:45:31 (26023): No heartbeat from core client for 30 sec - exiting 07:45:32 (26023): No heartbeat from core client for 30 sec - exiting 07:45:33 (26023): No heartbeat from core client for 30 sec - exiting 07:45:34 (26023): No heartbeat from core client for 30 sec - exiting 07:45:51 (26023): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 07:48:44 (26047): No heartbeat from core client for 30 sec - exiting 07:49:31 (26047): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:01:49 (26065): No heartbeat from core client for 30 sec - exiting CPDN Monitor - No 'heartbeat' from BOINC... 08:01:50 (26065): No heartbeat from core client for 30 sec - exiting 08:01:51 (26065): No heartbeat from core client for 30 sec - exiting Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... Suspended CPDN Monitor - Suspend request from BOINC... *** glibc detected *** ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu: double free or corruption (out): 0x083a7d80 *** ======= Backtrace: ========= /lib32/libc.so.6(+0x6f121)[0xf755f121] /lib32/libc.so.6(+0x709a8)[0xf75609a8] /lib32/libc.so.6(cfree+0x6d)[0xf7563a5d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x1f)[0xf76e526f] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe6)[0xf7506e46] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] ======= Memory map: ======== 08048000-080e3000 r-xp 00000000 08:06 655479 /home/micha/BOINC/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e3000-080e4000 rw-p 0009b000 08:06 655479 /home/micha/BOINC/projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu 080e4000-0813b000 rw-p 00000000 00:00 0 08352000-083b8000 rw-p 00000000 00:00 0 [heap] f6f00000-f6f21000 rw-p 00000000 00:00 0 f6f21000-f7000000 ---p 00000000 00:00 0 f7070000-f74eb000 rw-s 00000000 08:06 1059024 /home/micha/BOINC/slots/2/136080 f74f0000-f7647000 r-xp 00000000 08:02 3145758 /lib32/libc-2.13.so f7647000-f7649000 r--p 00156000 08:02 3145758 /lib32/libc-2.13.so f7649000-f764a000 rw-p 00158000 08:02 3145758 /lib32/libc-2.13.so f764a000-f764d000 rw-p 00000000 00:00 0 f7650000-f766c000 r-xp 00000000 08:02 10882488 /usr/lib32/libgcc_s.so.1 f766c000-f766d000 rw-p 0001b000 08:02 10882488 /usr/lib32/libgcc_s.so.1 f7670000-f7694000 r-xp 00000000 08:02 3145772 /lib32/libm-2.13.so f7694000-f7695000 r--p 00023000 08:02 3145772 /lib32/libm-2.13.so f7695000-f7696000 rw-p 00024000 08:02 3145772 /lib32/libm-2.13.so f7698000-f7778000 r-xp 00000000 08:02 10879110 /usr/lib32/libstdc++.so.6.0.17 f7778000-f777c000 r--p 000e0000 08:02 10879110 /usr/lib32/libstdc++.so.6.0.17 f777c000-f777d000 rw-p 000e4000 08:02 10879110 /usr/lib32/libstdc++.so.6.0.17 f777d000-f7784000 rw-p 00000000 00:00 0 f7788000-f778a000 r-xp 00000000 08:02 3145789 /lib32/libdl-2.13.so f778a000-f778b000 r--p 00001000 08:02 3145789 /lib32/libdl-2.13.so f778b000-f778c000 rw-p 00002000 08:02 3145789 /lib32/libdl-2.13.so f7790000-f77a5000 r-xp 00000000 08:02 3145794 /lib32/libpthread-2.13.so f77a5000-f77a6000 r--p 00014000 08:02 3145794 /lib32/libpthread-2.13.so f77a6000-f77a7000 rw-p 00015000 08:02 3145794 /lib32/libpthread-2.13.so f77a7000-f77a9000 rw-p 00000000 00:00 0 f77af000-f77b0000 rw-p 00000000 00:00 0 f77c3000-f77c4000 rw-p 00000000 00:00 0 f77c4000-f77c5000 ---p 00000000 00:00 0 f77c5000-f77c8000 rw-p 00000000 00:00 0 f77c8000-f77ca000 rw-s 00000000 08:06 1059021 /home/micha/BOINC/slots/2/boinc_mmap_file f77ce000-f77d0000 rw-p 00000000 00:00 0 f77d0000-f77ec000 r-xp 00000000 08:02 3145799 /lib32/ld-2.13.so f77ec000-f77ed000 r--p 0001b000 08:02 3145799 /lib32/ld-2.13.so f77ed000-f77ee000 rw-p 0001c000 08:02 3145799 /lib32/ld-2.13.so f77ee000-f77f0000 rw-p 00000000 00:00 0 f77f0000-f77f1000 r-xp 00000000 00:00 0 [vdso] ffeb3000-fff22000 rw-p 00000000 00:00 0 [stack] SIGABRT: abort called Stack trace (17 frames): ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(boinc_catch_signal+0x6f)[0x80b80df] [0xf77f0400] [0xf77f0425] /lib32/libc.so.6(gsignal+0x51)[0xf751ac01] /lib32/libc.so.6(abort+0x182)[0xf751e022] /lib32/libc.so.6(+0x6503d)[0xf755503d] /lib32/libc.so.6(+0x6f121)[0xf755f121] /lib32/libc.so.6(+0x709a8)[0xf75609a8] /lib32/libc.so.6(cfree+0x6d)[0xf7563a5d] /usr/lib32/libstdc++.so.6(_ZdlPv+0x1f)[0xf76e526f] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8057bc4] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x804f232] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x8050491] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805112c] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu[0x805137a] /lib32/libc.so.6(__libc_start_main+0xe6)[0xf7506e46] ../../projects/climateprediction.net/hadcm3n_6.07_i686-pc-linux-gnu(__gxx_personality_v0+0x169)[0x804cb51] Exiting... </stderr_txt> ]]> |
Latest Trickles Received | ||||||
---|---|---|---|---|---|---|
Time Sent (UTC) | Host ID | Result ID | Result Name | Timestep | CPU Time (sec) | Average (sec/TS) |
11 May 2012 01:48:08 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 1,036,800 | 1,909,702 | 1.8419 |
09 May 2012 23:29:39 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 1,010,880 | 1,861,192 | 1.8412 |
07 May 2012 00:17:17 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 984,960 | 1,813,051 | 1.8407 |
06 May 2012 03:37:27 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 959,040 | 1,767,200 | 1.8427 |
04 May 2012 16:43:11 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 933,120 | 1,721,074 | 1.8444 |
03 May 2012 09:51:26 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 907,200 | 1,673,182 | 1.8443 |
02 May 2012 20:25:30 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 881,280 | 1,625,229 | 1.8442 |
02 May 2012 01:11:22 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 855,360 | 1,576,590 | 1.8432 |
01 May 2012 09:27:46 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 829,440 | 1,527,826 | 1.8420 |
30 Apr 2012 19:59:43 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 803,520 | 1,479,727 | 1.8416 |
29 Apr 2012 19:44:13 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 777,600 | 1,430,681 | 1.8399 |
29 Apr 2012 05:17:24 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 751,680 | 1,382,710 | 1.8395 |
28 Apr 2012 14:19:02 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 725,760 | 1,336,687 | 1.8418 |
28 Apr 2012 02:13:48 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 699,840 | 1,293,282 | 1.8480 |
27 Apr 2012 06:29:09 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 673,920 | 1,246,720 | 1.8500 |
26 Apr 2012 10:29:00 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 648,000 | 1,199,145 | 1.8505 |
25 Apr 2012 20:50:26 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 622,080 | 1,150,757 | 1.8499 |
25 Apr 2012 05:33:40 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 596,160 | 1,103,135 | 1.8504 |
24 Apr 2012 16:24:53 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 570,240 | 1,055,998 | 1.8518 |
23 Apr 2012 23:23:48 | 1196538 | 14360877 | hadcm3n_o4bv_2020_40_007857213_2 | 544,320 | 1,007,955 | 1.8518 |
©2024 cpdn.org