From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.user/990 Path: news.gmane.org!not-for-mail From: Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai =?iso-8859-15?q?Gro=DFjohann?=) Newsgroups: gmane.emacs.gnus.user Subject: Re: spam-stat.el -- filtering spam based on statistics as suggested by Paul Graham Date: Wed, 28 Aug 2002 13:14:49 +0200 Organization: University of Dortmund, Germany Message-ID: References: <87hehtirad.fsf@emacswiki.org> <87wuqm6egm.fsf@emacswiki.org> <87lm6t5hwl.fsf@emacswiki.org> <87bs7nlr3m.fsf@emacswiki.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1138667833 9233 80.91.229.2 (31 Jan 2006 00:37:13 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 31 Jan 2006 00:37:13 +0000 (UTC) Original-X-From: nobody Tue Jan 17 17:28:27 2006 Original-Path: quimby.gnus.org!news.ccs.neu.edu!news.dfci.harvard.edu!news.harvard.edu!iad-peer.news.verio.net!news.verio.net!news.maxwell.syr.edu!eusc.inter.net!Informatik.Uni-Dortmund.DE!not-for-mail Original-Newsgroups: gnu.emacs.gnus Original-NNTP-Posting-Host: lucy.cs.uni-dortmund.de Mail-Copies-To: never User-Agent: Gnus/5.090008 (Oort Gnus v0.08) Emacs/21.3.50 (i686-pc-linux-gnu) Cancel-Lock: sha1:WQCsfecBwxWVul8W8/5A0ABtZHk= Original-Xref: bridgekeeper.physik.uni-ulm.de gnus-emacs-gnus:1130 Original-Lines: 12 X-Gnus-Article-Number: 1130 Tue Jan 17 17:28:27 2006 Xref: news.gmane.org gmane.emacs.gnus.user:990 Archived-At: Alex Schroeder writes: > The word size seems to be important. Perhaps a lot of the Korean > words are longer than 14 characters. I doubt this. But Korean does not use spaces between words, AFAIK, so it is not so easy to find out where does one word end and where does the next word start. kai -- A large number of young women don't trust men with beards. (BFBS Radio)