From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from majordomo@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id LAA06395; Mon, 21 Oct 2002 11:37:36 +0200 (MET DST) X-Authentication-Warning: pauillac.inria.fr: majordomo set sender to owner-caml-list@pauillac.inria.fr using -f Received: from nez-perce.inria.fr (nez-perce.inria.fr [192.93.2.78]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id LAA05329 for ; Mon, 21 Oct 2002 11:37:35 +0200 (MET DST) Received: from fichte.ai.univie.ac.at (fichte.ai.univie.ac.at [131.130.174.156]) by nez-perce.inria.fr (8.11.1/8.11.1) with ESMTP id g9L9bXD00774 for ; Mon, 21 Oct 2002 11:37:34 +0200 (MET DST) Received: from fichte.ai.univie.ac.at (markus@localhost [127.0.0.1]) by fichte.ai.univie.ac.at (8.12.3/8.12.3/Debian -4) with ESMTP id g9L9bXEI003731; Mon, 21 Oct 2002 11:37:33 +0200 Received: (from markus@localhost) by fichte.ai.univie.ac.at (8.12.3/8.12.3/Debian -4) id g9L9bWIO003730; Mon, 21 Oct 2002 11:37:32 +0200 Date: Mon, 21 Oct 2002 11:37:32 +0200 From: Markus Mottl To: =?iso-8859-1?B?Suly9G1l?= Marant Cc: caml-list@inria.fr Subject: Re: [Caml-list] Announcement: SpamOracle Message-ID: <20021021093732.GB3139@fichte.ai.univie.ac.at> Mail-Followup-To: =?iso-8859-1?B?Suly9G1l?= Marant , caml-list@inria.fr References: <20020826151138.A32572@pauillac.inria.fr> <20021020104354.GA11059@iliana> <20021020204946.GP31578@cs.unibo.it> <87vg3wvn8b.fsf@marant.org> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable In-Reply-To: <87vg3wvn8b.fsf@marant.org> User-Agent: Mutt/1.4i Organization: Austrian Research Institute for Artificial Intelligence Sender: owner-caml-list@pauillac.inria.fr Precedence: bulk On Sun, 20 Oct 2002, J=E9r=F4me Marant wrote: > I've already tried spamoracle: I fed it with about 2000 spams and 3000 > good mails and it too often considered good mail as spam. To add my experience with spamoracle, I use it on a regular basis and am very content with its performance. Of course, this is really a matter of what kind of e-mail you usually get. It has happened only about 4 times so far (since end of August - I get about 30 mails per day) that it misclassified admittedly "strange looking" e-mail (no contents, only attachments). I have trained it using about 1000 spam and 10000 good mails. Though this is probably quite obvious anyway, I'd like to point out that it is really important that all of the good and spam mails are the ones that you have personally received. If you just take any kind of spam or good mails, performance will definitely suffer. If you absolutely don't want to miss good mails, you'll have to regularly look at your spam folder. Even in this case spamoracle is very helpful, because it decreases total entropy, i.e. makes it easier for you to classify things with your own eyes. Regards, Markus Mottl --=20 Markus Mottl markus@oefai.at Austrian Research Institute for Artificial Intelligence http://www.oefai.at/~markus ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners