From: "François Pinard" <pinard@iro.umontreal.ca>
Subject: Scoring problem for short messages
Date: 22 Feb 1999 21:35:00 -0500 [thread overview]
Message-ID: <oqhfsdluej.fsf@titan.progiciels-bpi.ca> (raw)
[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset=us-ascii, Size: 1540 bytes --]
Hi, people. Maybe someone will be kind enough to help me on this.
I receive many short messages which are created automatically on many
different systems, and filter them through a score file roughly looking like:
(
((and ("lines" 2 =)
(or ;; [...]
("body" "rapquot" s)
;; [...]
))
-1000)
)
The problem is that, depending on the distribution of cosmic rays along
Earth orbit, and probably other factors as well, I get a variable amount
of spurious white lines at end of messages, and even, sometimes, before
the first line.
The ideal would be that I go to each machine, recreate the exact conditions
of the invoice, try to reproduce the problem, understand it, and repair
all occurrences. But I really do not have the time to do that now, and
I would like some other solution in the meantime.
I could use some bigger number of lines and use an inequality, but then,
the score might be decreased if a message happens to contain `rapquot'
together with something else which then interests me. I would like the
score to be decreased /only/ if the message contains `rapquot' and nothing
else than white lines, say. Do you have an idea how I could manage this?
I also do not know when and how lines are counted. Is there a way to
arrange so that count excludes prior and subsequent white lines? I guess
this approach might also solve my little problem.
--
François Pinard mailto:pinard@iro.umontreal.ca
Join the free Translation Project! http://www.iro.umontreal.ca/~pinard
next reply other threads:[~1999-02-23 2:35 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
1999-02-23 2:35 François Pinard [this message]
1999-02-26 8:17 ` Lars Magne Ingebrigtsen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=oqhfsdluej.fsf@titan.progiciels-bpi.ca \
--to=pinard@iro.umontreal.ca \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).