Gnus development mailing list
 help / color / mirror / Atom feed
From: George McNinch <gmcninch@gmail.com>
To: ding@gnus.org
Subject: nnir + namazu: always <= 999 matches ?
Date: Wed, 05 Mar 2014 16:32:59 -0500	[thread overview]
Message-ID: <82siqws54k.fsf@jazz-via-gmail.com> (raw)

Hi,

Probably I'm doing something bone-headed, but I can't work out what.

I have some nnml groups, and I've configured nnir for use with the
namazu engine. For any search, the ephemeral groups produced by nnir
seem to contain no more than 999 matches even when namazu gets more hits
than 999, and I'm not sure how to override that max.

Here are more details.  My config files contain:

,----
| (setq gnus-secondary-select-methods
|       '(...
| 	  (nnml "mail"
| 	    (nnir-search-engine namazu)
| 	    (nnml-directory "/home/george/Mail/"))))
`----

and also              
              
,----
| (setq nnir-namazu-index-directory "/home/george/Config/namazu/")
| (setq nnir-namazu-remove-prefix "/home/george/Mail/")
`----

I've created the index via mknmz.

If I run the following (in an *eshell*, or what have you) to search for
mails containing (say) the phrase "linear":

,----
| namazu -c -q -a -s linear /home/george/Config/namazu/
`----

I get the answer => 1403

(the "-c" flag means: "print the count of the number of hits").

But, if I "GG" on one of the groups associated with this nnml server,
and type "linear", I'm prompted

,----
| How many articles from nnir-82y50pqfrq.fsf (available 999, default 200):
`----

To confirm this number, note that if I eval (length nnir-artlist) in a
scratch buffer, I get 999.

So where did the remaining 1403-999 hits go?

In fact, as far as I can tell (i.e. varying the search term), *all*
namazu searches result in an ephemeral buffer containing <= 999
articles. I didn't immediately see a setting to over-ride that size
limit (sorry in advance if I'm missed something obvious...), because I
can't seem to see how/where in nnir.el it is imposed.

Note that the *Messages* buffer confirms that namazu is being called
with "the same" args [except the -c] as above; here is the line from the
*Messages* buffer:

,----
| namazu args: -q -a -s linear /home/george/Config/namazu/
`----

And subsequent to running the search ["GG linear" in the group buffer]
the " *nnir*" buffer indeed contains 1403 "entries".

Looking at the *nnir* buffer in more detail, the first 821 hits have
score >=3, and entries 822 through 1403 have score 2.  So there is no
"obvious" (to me) reason for there to be only 999 hits.

For what it is worth, after changing back to find-grep:

,----
| (setq gnus-secondary-select-methods
|       '(...
| 	  (nnml "mail"
| ;	    (nnir-search-engine namazu)
|           (nnir-search-engine find-grep)
| 	    (nnml-directory "/home/george/Mail/"))))
`----

restarting emacs, starting gnus, marking all groups in the nnml+mail
server via #, and searching with "GG linear", I get approx. 1600 hits,
and (upon request) an ephemeral buffer containing all (~1600) matching
articles.

So the issue seems to be related to use of namazu in nnir.

(Note that I'm *not* complaining here about the difference between ~1600
find-grep hits and the ~1400 namazu hits ... )

I should have tried also swish-e for comparison, but it was slow in
building the index and I got impatient...

Thanks for any help!

All the best,
gm



-- 
   ^^^^^   George McNinch <gmcninch (at) gmail.com>





             reply	other threads:[~2014-03-05 21:32 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-03-05 21:32 George McNinch [this message]
2014-03-06 17:13 ` nnir + namazu: always <= 999 matches ? ANS: commas! George McNinch
2014-03-06 18:53   ` Andreas Schwab
2014-03-07 16:58   ` Lars Ingebrigtsen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=82siqws54k.fsf@jazz-via-gmail.com \
    --to=gmcninch@gmail.com \
    --cc=ding@gnus.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).