From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/84278 Path: news.gmane.org!not-for-mail From: George McNinch Newsgroups: gmane.emacs.gnus.general Subject: nnir + namazu: always <= 999 matches ? Date: Wed, 05 Mar 2014 16:32:59 -0500 Message-ID: <82siqws54k.fsf@jazz-via-gmail.com> Reply-To: George McNinch NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: ger.gmane.org 1394058942 23382 80.91.229.3 (5 Mar 2014 22:35:42 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 5 Mar 2014 22:35:42 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M32526@lists.math.uh.edu Wed Mar 05 23:35:49 2014 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1WLKPk-0005Zl-FB for ding-account@gmane.org; Wed, 05 Mar 2014 23:35:48 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1WLKP4-00068P-Ft; Wed, 05 Mar 2014 16:35:06 -0600 Original-Received: from mx2.math.uh.edu ([129.7.128.33]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1WLJRn-0005pW-Jj for ding@lists.math.uh.edu; Wed, 05 Mar 2014 15:33:51 -0600 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx2.math.uh.edu with esmtps (TLSv1:AES128-SHA:128) (Exim 4.76) (envelope-from ) id 1WLJRc-0001ql-V9 for ding@lists.math.uh.edu; Wed, 05 Mar 2014 15:33:50 -0600 Original-Received: from plane.gmane.org ([80.91.229.3]) by quimby.gnus.org with esmtp (Exim 4.80) (envelope-from ) id 1WLJRa-0006XZ-TM for ding@gnus.org; Wed, 05 Mar 2014 22:33:38 +0100 Original-Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1WLJRZ-00028c-K0 for ding@gnus.org; Wed, 05 Mar 2014 22:33:37 +0100 Original-Received: from dhcp-130-64-79-235.medford.tufts.edu ([130.64.79.235]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 05 Mar 2014 22:33:37 +0100 Original-Received: from gmcninch by dhcp-130-64-79-235.medford.tufts.edu with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 05 Mar 2014 22:33:37 +0100 X-Injected-Via-Gmane: http://gmane.org/ Original-Lines: 106 Original-X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: dhcp-130-64-79-235.medford.tufts.edu User-Mail-Address: gmcninch@gmail.com User-Agent: Gnus/5.13001 (Ma Gnus v0.10) Emacs/24.3.50 (gnu/linux) Cancel-Lock: sha1:n4Z7wWob66O54VAZohs8B9lpEn8= X-Spam-Score: -1.0 (-) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:84278 Archived-At: Hi, Probably I'm doing something bone-headed, but I can't work out what. I have some nnml groups, and I've configured nnir for use with the namazu engine. For any search, the ephemeral groups produced by nnir seem to contain no more than 999 matches even when namazu gets more hits than 999, and I'm not sure how to override that max. Here are more details. My config files contain: ,---- | (setq gnus-secondary-select-methods | '(... | (nnml "mail" | (nnir-search-engine namazu) | (nnml-directory "/home/george/Mail/")))) `---- and also ,---- | (setq nnir-namazu-index-directory "/home/george/Config/namazu/") | (setq nnir-namazu-remove-prefix "/home/george/Mail/") `---- I've created the index via mknmz. If I run the following (in an *eshell*, or what have you) to search for mails containing (say) the phrase "linear": ,---- | namazu -c -q -a -s linear /home/george/Config/namazu/ `---- I get the answer => 1403 (the "-c" flag means: "print the count of the number of hits"). But, if I "GG" on one of the groups associated with this nnml server, and type "linear", I'm prompted ,---- | How many articles from nnir-82y50pqfrq.fsf (available 999, default 200): `---- To confirm this number, note that if I eval (length nnir-artlist) in a scratch buffer, I get 999. So where did the remaining 1403-999 hits go? In fact, as far as I can tell (i.e. varying the search term), *all* namazu searches result in an ephemeral buffer containing <= 999 articles. I didn't immediately see a setting to over-ride that size limit (sorry in advance if I'm missed something obvious...), because I can't seem to see how/where in nnir.el it is imposed. Note that the *Messages* buffer confirms that namazu is being called with "the same" args [except the -c] as above; here is the line from the *Messages* buffer: ,---- | namazu args: -q -a -s linear /home/george/Config/namazu/ `---- And subsequent to running the search ["GG linear" in the group buffer] the " *nnir*" buffer indeed contains 1403 "entries". Looking at the *nnir* buffer in more detail, the first 821 hits have score >=3, and entries 822 through 1403 have score 2. So there is no "obvious" (to me) reason for there to be only 999 hits. For what it is worth, after changing back to find-grep: ,---- | (setq gnus-secondary-select-methods | '(... | (nnml "mail" | ; (nnir-search-engine namazu) | (nnir-search-engine find-grep) | (nnml-directory "/home/george/Mail/")))) `---- restarting emacs, starting gnus, marking all groups in the nnml+mail server via #, and searching with "GG linear", I get approx. 1600 hits, and (upon request) an ephemeral buffer containing all (~1600) matching articles. So the issue seems to be related to use of namazu in nnir. (Note that I'm *not* complaining here about the difference between ~1600 find-grep hits and the ~1400 namazu hits ... ) I should have tried also swish-e for comparison, but it was slow in building the index and I got impatient... Thanks for any help! All the best, gm -- ^^^^^ George McNinch