9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
* [9fans] (no subject)
@ 2003-02-24 13:49 David Presotto
  2003-02-24 15:11 ` Sam
  0 siblings, 1 reply; 5+ messages in thread
From: David Presotto @ 2003-02-24 13:49 UTC (permalink / raw)
  To: 9fans

I just went through a thousand or so 9fans archived messages looking for spam.
About 70% were html or had html attachments.  A few valid messages also
had html attachments.  Just rejecting html would be helpful but
still let a lot through.

We could enter the arms race and throw an automated filter
or go moderated (not really moderated but people filtered).
The latter is the only one likely to be 100% effective, but
a Bayesian filter might be good enough.  Even the flames on
the net are stylized enough to be recognizable as real
9fans mail.  Of course even the Bayesian one needs a
moderator(s) that will reclassify any misclassified stuff
so that the filter will keep up with a changing world
so some people will have to do extra work.
However, it won't insert people delays in normal delivery.


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [9fans] (no subject)
  2003-02-24 13:49 [9fans] (no subject) David Presotto
@ 2003-02-24 15:11 ` Sam
  2003-02-24 15:14   ` [9fans] spam filtering Russ Cox
  0 siblings, 1 reply; 5+ messages in thread
From: Sam @ 2003-02-24 15:11 UTC (permalink / raw)
  To: 9fans

What list software are we using?  Is it smart
enough to be told to only bother filtering
new messages, ie, let all messages pertaining
to an existing (recent) thread through?  If
so, we could be a little more strict on the
filtering.

Cheers,

Sam

On Mon, 24 Feb 2003, David Presotto wrote:

> I just went through a thousand or so 9fans archived messages looking for spam.
> About 70% were html or had html attachments.  A few valid messages also
> had html attachments.  Just rejecting html would be helpful but
> still let a lot through.
>
> We could enter the arms race and throw an automated filter
> or go moderated (not really moderated but people filtered).
> The latter is the only one likely to be 100% effective, but
> a Bayesian filter might be good enough.  Even the flames on
> the net are stylized enough to be recognizable as real
> 9fans mail.  Of course even the Bayesian one needs a
> moderator(s) that will reclassify any misclassified stuff
> so that the filter will keep up with a changing world
> so some people will have to do extra work.
> However, it won't insert people delays in normal delivery.
>





^ permalink raw reply	[flat|nested] 5+ messages in thread

* [9fans] spam filtering
  2003-02-24 15:11 ` Sam
@ 2003-02-24 15:14   ` Russ Cox
  2003-02-24 16:04     ` Sam
  0 siblings, 1 reply; 5+ messages in thread
From: Russ Cox @ 2003-02-24 15:14 UTC (permalink / raw)
  To: 9fans

As Scott said yesterday, the list uses Mailman.
http://www.gnu.org/software/mailman/mailman.html
As he also said, he can't do anything that isn't a stock
Mailman option.  If you want to suggest something,
please look in the documentation first.

We've all gotten more metaspam in the last two days
from this list than we have spam.  Please stop.

Russ



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [9fans] spam filtering
  2003-02-24 15:14   ` [9fans] spam filtering Russ Cox
@ 2003-02-24 16:04     ` Sam
  2003-02-24 19:17       ` Sam
  0 siblings, 1 reply; 5+ messages in thread
From: Sam @ 2003-02-24 16:04 UTC (permalink / raw)
  To: 9fans


> As Scott said yesterday, the list uses Mailman.
> http://www.gnu.org/software/mailman/mailman.html
> As he also said, he can't do anything that isn't a stock
> Mailman option.  If you want to suggest something,
> please look in the documentation first.
>

Actually, what he said was anything that wasn't a stock
option "would require actual work."  I took that to
mean we're looking for a least effort solution to the
problem, not that we're handcuffed by mailman.  I
think it's already been stated that the stock features
aren't sufficient.  Before I go hacking, which version
are we using?

Sam




^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [9fans] spam filtering
  2003-02-24 16:04     ` Sam
@ 2003-02-24 19:17       ` Sam
  0 siblings, 0 replies; 5+ messages in thread
From: Sam @ 2003-02-24 19:17 UTC (permalink / raw)
  To: 9fans

Scott,

I downloaded 2.0.13 since 2.0.11 wasn't
available.  The file
	$MAILMAN/Mailman/Handlers/SpamDetect.py
where MAILMAN is your install directory
is a hard coded spam filter file.  Adding
an RE for html or anything other combination
of incantations seems easy.  The current code
does an OR (if any RE matches, fail).  I would
suggest adding an AND mechanism such that
any message lacking 'RE:' in the subject
header gets sent through a rigorous set
of RE filters (html, nigerian, etc).  I can
come up with something if you don't want to.

Btw, that's some nasty Python.  I hope it's
not representative of the language.

Cheers,

Sam


On Mon, 24 Feb 2003, Sam wrote:

>
> > As Scott said yesterday, the list uses Mailman.
> > http://www.gnu.org/software/mailman/mailman.html
> > As he also said, he can't do anything that isn't a stock
> > Mailman option.  If you want to suggest something,
> > please look in the documentation first.
> >
>
> Actually, what he said was anything that wasn't a stock
> option "would require actual work."  I took that to
> mean we're looking for a least effort solution to the
> problem, not that we're handcuffed by mailman.  I
> think it's already been stated that the stock features
> aren't sufficient.  Before I go hacking, which version
> are we using?
>
> Sam
>
>
>




^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2003-02-24 19:17 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-02-24 13:49 [9fans] (no subject) David Presotto
2003-02-24 15:11 ` Sam
2003-02-24 15:14   ` [9fans] spam filtering Russ Cox
2003-02-24 16:04     ` Sam
2003-02-24 19:17       ` Sam

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).