From: Simon Josefsson <jas@extundo.com>
Cc: ding@gnus.org
Subject: Re: nnml/nnfolder marks faster
Date: Fri, 14 Sep 2001 15:43:34 +0200 [thread overview]
Message-ID: <ilupu8uynj0.fsf@barbar.josefsson.org> (raw)
In-Reply-To: <87lmjki4b5.fsf@uwo.ca> (Dan Christensen's message of "Wed, 12 Sep 2001 15:42:38 -0400")
Dan Christensen <jdc+news@uwo.ca> writes:
> I have a few groups that have really large .mrk files, and this is
> causing slow performance. I've noticed this more since the new
> .mrk code was introduced, and it is in particular noticeable when
> I *exit* a group with a large .mrk file. But I don't know if the
> new .mrk code is causing any problems, or whether it just makes
> them more visible.
I guess the latter, I think the problem you mention existed earlier
(because the backend marks code hasn't touched that code).
> As an example, my nnfolder:debian.big group has about 1000 articles
> in it, and the .mrk file is somehow using about 840 bytes per article,
> which seems really excessive:
Yes, too much.
> Here are a few excerpts from the .mrk file which look suspicious:
>
> There are screenfuls and screenfuls of "seen" marks that look like:
>
> (184755 . 184758) (184759 . 184762)
>
> which should be merged into a single long list I would think.
That is probably better yes, but I don't know the seen mark, it seem
to be handled specially somehow. Lars?
> Also, the score caching has stuff like:
>
> (score (185776 . -1) (185820 . -1) (185962 . -1) (185831 . -1)
> (185991 . -1)
>
> which isn't even stored in numerical order (how is lookup done??).
> I also wonder if this could be stored in a collapsed form using ranges
>
> (score ((12345 . 15000) . -1)
>
> etc. (in cases where there are consecutive articles with the same
> score.)
>
> Hmm, looking further, it is the score entries that are taking up 99%
> of the .mrk file, and most of them don't correspond to articles that
> exist in the group! Shouldn't score cache entries be removed when
> the article is deleted from the group (e.g. by the expiry mechanism)?
> I smell a bug.
I'm not sure score marks should be in the file at all. Isn't the
scores calculated each time you enter the group? Maybe the score is
cached in .newsrc/.mark?
Likewise, `downloaded', `cache' and `unsent' should probably never be
propagated to the backend.
next prev parent reply other threads:[~2001-09-14 13:43 UTC|newest]
Thread overview: 39+ messages / expand[flat|nested] mbox.gz Atom feed top
2001-09-08 19:38 Simon Josefsson
2001-09-12 19:42 ` Dan Christensen
2001-09-14 13:43 ` Simon Josefsson [this message]
2001-09-14 14:51 ` Paul Jarc
2001-09-14 17:32 ` Simon Josefsson
2001-09-14 18:08 ` Paul Jarc
2001-12-30 3:52 ` Lars Magne Ingebrigtsen
2001-12-31 10:02 ` Paul Jarc
2001-12-31 10:07 ` Lars Magne Ingebrigtsen
2001-12-31 10:28 ` Paul Jarc
2001-12-31 10:34 ` Lars Magne Ingebrigtsen
2001-12-31 10:56 ` Paul Jarc
2001-12-31 10:59 ` Lars Magne Ingebrigtsen
2001-12-31 11:20 ` Paul Jarc
2001-09-14 18:11 ` Dan Christensen
2001-09-14 18:23 ` Dan Christensen
2001-09-20 18:38 ` Simon Josefsson
2001-09-21 15:00 ` Dan Christensen
2001-09-21 16:23 ` Simon Josefsson
2001-09-22 15:29 ` Dan Christensen
2001-09-22 16:53 ` Simon Josefsson
2001-09-22 21:19 ` Paul Jarc
2001-09-22 22:50 ` Simon Josefsson
2001-09-23 6:17 ` Paul Jarc
2001-09-23 10:13 ` Simon Josefsson
2001-09-23 15:13 ` Harry Putnam
2001-09-23 16:56 ` Nevin Kapur
2001-09-23 20:46 ` Simon Josefsson
2001-09-23 20:39 ` Simon Josefsson
2001-09-23 22:50 ` Kai Großjohann
2001-09-25 3:25 ` Paul Jarc
2001-09-25 10:04 ` Simon Josefsson
2001-09-25 15:45 ` Paul Jarc
2001-09-25 16:53 ` Simon Josefsson
2001-09-25 17:08 ` Paul Jarc
2001-09-24 17:03 ` Dan Christensen
2001-09-24 17:40 ` Simon Josefsson
2001-09-25 2:28 ` Dan Christensen
2001-09-25 17:19 ` Simon Josefsson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ilupu8uynj0.fsf@barbar.josefsson.org \
--to=jas@extundo.com \
--cc=ding@gnus.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).