From: Felix Lee <flee@teleport.com>
Subject: adaptive word scoring
Date: Thu, 28 Nov 1996 21:25:08 -0800 [thread overview]
Message-ID: <199611290525.VAA00464@kim.teleport.com> (raw)
so after using adaptive word scoring for a while, I've
decided that it's mostly useless.
say you're an avid fan of alt.sex.pictures.emacs. the word
"gif" is fairly common and mostly neutral: you can't tell if
an article is interesting based on the word "gif".
however, adaptive scoring treats "gif" as significant in an
odd way. if you kill a massive series of "vi pinup gif"s,
then adaptive scoring is going to reduce the score of "gif"
by an amount proportional to the number of articles you've
killed. this significantly affects the score of those
really sexy emacs gifs.
ok, you could add "gif" to the ignored-word list, but this
is just one instance of a more general problem.
my current thoughts are:
- adaptive scoring should try to discover _useful_
discriminants by comparing interesting v. uninteresting
articles. the ignored-word list should be unnecessary.
- rather than adjusting score by N for every article marked,
marked articles should be assigned a score target, and
adaptive-scoring elements should be adjusted to try to hit
the target.
comments? I'm not sure how to implement this, yet.
--
next reply other threads:[~1996-11-29 5:25 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
1996-11-29 5:25 Felix Lee [this message]
1996-11-29 8:09 ` Kai Grossjohann
1996-11-29 22:48 ` Felix Lee
1996-11-30 13:18 ` Lars Magne Ingebrigtsen
1996-12-01 8:39 ` Felix Lee
1996-11-29 15:45 ` Jan Vroonhof
1996-11-30 2:28 ` Felix Lee
1996-12-02 9:37 ` Steinar Bang
1996-12-02 9:40 ` Wesley.Hardaker
1996-12-05 18:49 ` Lars Magne Ingebrigtsen
1996-12-06 8:18 ` Wesley.Hardaker
1996-12-02 11:46 ` Hans de Graaff
1996-12-02 15:08 ` Robert Bihlmeyer
1996-12-05 18:50 ` Lars Magne Ingebrigtsen
1996-12-05 21:21 ` Sean Lynch
1996-12-06 10:39 ` Lars Magne Ingebrigtsen
1996-12-08 22:19 ` Sean Lynch
1996-12-11 0:44 ` Lars Magne Ingebrigtsen
1996-12-06 21:02 ` Janne Sinkkonen
1996-12-08 22:48 ` Sean Lynch
1996-12-10 22:25 ` nnspool virtual server shows funny numbers of articles C. R. Oldham
1996-12-11 0:42 ` Lars Magne Ingebrigtsen
[not found] ` <vcn2vvixpz.fsf@totally-fudged-out-message-id>
1996-12-03 13:51 ` adaptive word scoring Holger Franz
-- strict thread matches above, loose matches on Subject: below --
1996-10-31 1:34 Adaptive " Sten Drescher
1996-11-05 15:51 ` Robert Bihlmeyer
1996-11-05 17:16 ` Per Abrahamsen
1996-11-05 21:24 ` Lars Magne Ingebrigtsen
1996-11-05 21:25 ` Lars Magne Ingebrigtsen
1996-08-04 2:57 Lars Magne Ingebrigtsen
1996-08-04 17:19 ` François Pinard
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=199611290525.VAA00464@kim.teleport.com \
--to=flee@teleport.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).