Gnus development mailing list
 help / color / mirror / Atom feed
From: Ted Zlatanov <tzz@lifelogs.com>
Cc: John Owens <jowens@ece.ucdavis.edu>
Subject: Re: spam-stat and base64 encoded messages
Date: Mon, 09 Jun 2003 16:06:42 -0400	[thread overview]
Message-ID: <4n1xy32ey5.fsf@lockgroove.bwh.harvard.edu> (raw)
In-Reply-To: <m3brx8ujt9.fsf@defun.localdomain> (Jesper Harder's message of "Mon, 09 Jun 2003 03:21:54 +0200")

On Mon, 09 Jun 2003, harder@myrealbox.com wrote:
> I think that to make this work correctly, you'll need to parse the
> MIME structure of the message, and then apply the proper decoding to
> the approriate parts.

Hmm, are you sure we need to do full MIME parsing?  That would slow
down the incoming mail splitting a lot, I would think.  But I don't
know all the Gnus MIME parsing functionality, or how fast it is.  See
below for more questions.

John Owens (cc-ed on this) was asking about forwarded spam messages,
which are inside an envelope from SpamAssassin.  That's another case
where spam-split or spam-stat-split has to do a lot of parsing.  Maybe
there's a better way?  

We can invoke spam-split or spam-stat-split on each part of the
messages, then if they return t we know it's ham; if they return a
string it's spam, and nil means the part was neither.  In other words,
we don't care about the deep structure, for instance one attachment
inside another.  We just want to find MIME boundaries, take the text
up to the next MIME boundary (even if it includes other MIME
boundaries), decode if needed (no decoding should be done on plain
text!), and analyze the part.  Is that possible already?

Referring to the decode-if-needed part, is the
gnus-article-decode-hook going to try decoding content even if it's
plain text or is there some detection done?  If not, spam.el and
spam-stat.el or gnus-art.el should do some heuristics.

Thanks
Ted



  reply	other threads:[~2003-06-09 20:06 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-06-05  7:01 Oystein Viggen
2003-06-05 20:05 ` Ted Zlatanov
2003-06-06  2:02   ` Jesper Harder
2003-06-06  3:22     ` Ted Zlatanov
2003-06-06 15:30       ` Jesper Harder
2003-06-06 23:21         ` Oystein Viggen
2003-06-09  1:21           ` Jesper Harder
2003-06-09 20:06             ` Ted Zlatanov [this message]
2003-06-11 19:42               ` Jesper Harder
2003-08-02 21:17                 ` Alex Schroeder
2003-08-04  7:36                   ` Adam Sjøgren
2003-08-08  0:02                     ` Alex Schroeder
2003-06-06  1:59 ` Jesper Harder

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4n1xy32ey5.fsf@lockgroove.bwh.harvard.edu \
    --to=tzz@lifelogs.com \
    --cc=jowens@ece.ucdavis.edu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).