Gnus development mailing list
 help / color / mirror / Atom feed
* spam-stat regeneration notes
@ 2003-06-14  0:43 Bill White
  2003-06-14 11:57 ` François Pinard
  2003-06-15 18:38 ` Ted Zlatanov
  0 siblings, 2 replies; 3+ messages in thread
From: Bill White @ 2003-06-14  0:43 UTC (permalink / raw)


I've been using spam-stat for about 6 months now, and noticed lately
that spam processing was getting mightly slow - thanks (I suspect) to
the hashbusters spammers are putting in their messages.  I even do
(spam-stat-reduce-size) when quitting gnus each day, so the thing was
as small as possible.

So today I did my first rebuild of the spam-stat database.  That's not
bad in my book - 6 months for one constantly-growing database.  Here's
the code, which I should probably put in a function "spam-reset" or
something.

----------------------------------------------------------------------
;; Reset:
(spam-stat-reset)

;; Learn spam:
(spam-stat-process-spam-directory "/billw/Mail-2003/spam")

;; Learn non-spam:
(spam-stat-process-non-spam-directory "/billw/Mail-2003/mail/misc/2003/01")
(spam-stat-process-non-spam-directory "/billw/Mail-2003/mail/misc/2003/02")
(spam-stat-process-non-spam-directory "/billw/Mail-2003/mail/misc/2003/03")
(spam-stat-process-non-spam-directory "/billw/Mail-2003/mail/misc/2003/04")
(spam-stat-process-non-spam-directory "/billw/Mail-2003/mail/misc/2003/05")
(spam-stat-process-non-spam-directory "/billw/Mail-2003/mail/misc/2003/06")

;; Reduce table size:
(spam-stat-reduce-size)

;; Save table:
(spam-stat-save)
----------------------------------------------------------------------

The results:

----------------------------------------------------------------------
-rw-rw-r--    1 billw    math      3066100 Jun 13 10:19 .spam-stat.el
-rw-rw-r--    1 billw    math       175109 Jun 13 19:22 .spam-stat.el
----------------------------------------------------------------------

Couple of questions:

- Has anyone else needed to regenerate a Bayesian hash system?

- Is there an easy way to run a function over an entire directory
  tree, while specifying which dirs to include or avoid?

Cheers -

bw
-- 
Bill White . billw@wolfram.com . http://members.wri.com/billw
"No ma'am, we're musicians."




^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-06-15 18:38 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-06-14  0:43 spam-stat regeneration notes Bill White
2003-06-14 11:57 ` François Pinard
2003-06-15 18:38 ` Ted Zlatanov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).