Gnus development mailing list
 help / color / mirror / Atom feed
From: Ted Zlatanov <tzz@lifelogs.com>
Cc: Hubert Chan <hubert@uhoreg.ca>
Subject: Re: spam.el: generic bayes interface?
Date: Wed, 21 Jan 2004 13:47:00 -0500	[thread overview]
Message-ID: <4n7jzldtqz.fsf@collins.bwh.harvard.edu> (raw)
In-Reply-To: <87d69e54qg.fsf@uhoreg.ca> (Hubert Chan's message of "Tue, 20 Jan 2004 23:02:15 -0500")

On Tue, 20 Jan 2004, hubert@uhoreg.ca wrote:

> Well, there are at least some good reasons that someone might want
> to use multiple Bayesian filters.  For example, one might want to
> just try out the effectiveness of one filter, while retaining their
> original filter as a backup.  Also, if one wishes to switch Bayesian
> filters, and does not have a corpus of spam/ham to train the filter,
> there would have to be a transition time during which the new filter
> is trained, while the old filter is still being used for splitting.
> And, of course, during this time, one would still want to keep
> training the old filter at the same time.

I'm OK with that, we can add a spam-use-generic-bayesian if it's
necessary.  I just think customization, registry tracking, and other
things won't work so well when we generalize the interface too much.

If you or someone else produces that generic-bayesian backend, I
don't see a problem with putting it in.  We can't anticipate the new
bayesian filters people might want to use, after all.

> This got me thinking, though, Ted, that the registration code for
> the spam/ham processors is pretty similar.  They seem to mostly work
> in one of two ways -- either register one at a time, or register
> multiple articles at a time in a mbox-style format.  

Yes, I've noticed that too after the 3rd time I wrote that code :)

> I think they all feed the articles via standard input.  I would
> imagine that we would be able to share a lot of common code.  Maybe
> write a function that feeds the article(s) to the registration
> program, and pass the name of the program and its arguments as
> arguments to that function.  Then the registration functions just
> have to call that function with the appropriate arguments.  Hmm.
> I'll have to look at the code to see if that would actually work...

It could work.  I've been trying to make the functions generic on the
API side, now it's time to make them generic on the backend side as
well.  I'm afraid it will make the code more complex, but adding new
backends should be significantly easier.

I'll work on gnus-encrypt.el first though, so feel free to start on
this if you have the interest :)

Thanks
Ted



  reply	other threads:[~2004-01-21 18:47 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-01-20 21:17 Reiner Steib
2004-01-21  0:08 ` Ted Zlatanov
2004-01-21  4:02   ` Hubert Chan
2004-01-21 18:47     ` Ted Zlatanov [this message]
2004-01-21 20:24       ` Hubert Chan
2004-01-22 18:23         ` Ted Zlatanov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4n7jzldtqz.fsf@collins.bwh.harvard.edu \
    --to=tzz@lifelogs.com \
    --cc=hubert@uhoreg.ca \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).