From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/78488 Path: news.gmane.org!not-for-mail From: Ted Zlatanov Newsgroups: gmane.emacs.gnus.general Subject: Re: Splitting based on character sets Date: Tue, 12 Apr 2011 12:20:08 -0500 Organization: =?utf-8?B?0KLQtdC+0LTQvtGAINCX0LvQsNGC0LDQvdC+0LI=?= @ Cienfuegos Message-ID: <87lizfxu7r.fsf@lifelogs.com> References: <87y63qb0b9.fsf@lifelogs.com> <87vcyjo1pl.fsf@topper.koldfront.dk> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1302629135 17478 80.91.229.12 (12 Apr 2011 17:25:35 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Tue, 12 Apr 2011 17:25:35 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M26791@lists.math.uh.edu Tue Apr 12 19:25:32 2011 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Q9hLH-000791-Lw for ding-account@gmane.org; Tue, 12 Apr 2011 19:25:31 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1Q9hL0-0003vE-OF; Tue, 12 Apr 2011 12:25:14 -0500 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1Q9hKz-0003v4-Mf for ding@lists.math.uh.edu; Tue, 12 Apr 2011 12:25:13 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.72) (envelope-from ) id 1Q9hKt-0006BV-PG for ding@lists.math.uh.edu; Tue, 12 Apr 2011 12:25:08 -0500 Original-Received: from lo.gmane.org ([80.91.229.12]) by quimby.gnus.org with esmtp (Exim 4.72) (envelope-from ) id 1Q9hKs-0001jZ-91 for ding@gnus.org; Tue, 12 Apr 2011 19:25:06 +0200 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1Q9hKs-0006tL-3w for ding@gnus.org; Tue, 12 Apr 2011 19:25:06 +0200 Original-Received: from 38.98.147.130 ([38.98.147.130]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 Apr 2011 19:25:06 +0200 Original-Received: from tzz by 38.98.147.130 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 12 Apr 2011 19:25:06 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 29 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 38.98.147.130 X-Face: bd.DQ~'29fIs`T_%O%C\g%6jW)yi[zuz6;d4V0`@y-~$#3P_Ng{@m+e4o<4P'#(_GJQ%TT= D}[Ep*b!\e,fBZ'j_+#"Ps?s2!4H2-Y"sx" User-Agent: Gnus/5.110016 (No Gnus v0.16) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:pRNFbuXAqxd8wXh91UzM9Kes/RE= X-Spam-Score: -0.7 (/) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:78488 Archived-At: On Tue, 12 Apr 2011 18:44:49 +0200 David Engster wrote: DE> Lars Magne Ingebrigtsen writes: >> Ted Zlatanov writes: >> >>> Use CRM114 before mail is delivered. It works *really well* for Russian >>> and any other languages (at least Bulgarian and Spanish in my experience). >> >> Hm... any particular reason CRM114 isn't used by SpamAssassin already? >> (I've just skimmed the CRM114 page, and I'm somewhat unclear about what >> it does. :-) DE> There is a crm114 plugin for spamassassin; it's in the "CoolThings" DE> section of the crm114 site. It may be that it's well suited for foreign DE> languages, but I tried it some time ago, and wasn't particularly DE> impressed, especially regarding the elaborated setup. The thing which DE> made me drop it was that I got false positives (yes, I read the docs and DE> trained it correctly). Middle-of-the-road Spamassassin in combination DE> with the Bayes-plugin, Razor and a few blacklists catches practically DE> all spam for me, without any false positives. I've been happy with CRM114 (since the first Spam Conference :) so I can't say why it didn't work for you. I like that it only has one way to classify spam, as opposed to the SA multi-pronged approach. It was definitely better against foreign languages than SA 5 years ago, when I tested it. Ted