From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/61917 Path: news.gmane.org!not-for-mail From: Andreas Seltenreich Newsgroups: gmane.emacs.gnus.general Subject: Re: nnweb + Gmane search Date: Thu, 09 Feb 2006 05:47:23 +0100 Message-ID: <871wyd9m04.fsf@gate450.dyndns.org> References: <877j9lob4v.fsf@gate450.dyndns.org> <87hd8pmtjf.fsf@gate450.dyndns.org> <87u0bqf4pn.fsf@gate450.dyndns.org> <87acdduewd.fsf@gate450.dyndns.org> <87vevqcs10.fsf_-_@gate450.dyndns.org> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1139460690 29501 80.91.229.2 (9 Feb 2006 04:51:30 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 9 Feb 2006 04:51:30 +0000 (UTC) Original-X-From: ding-owner+m10446@lists.math.uh.edu Thu Feb 09 05:51:26 2006 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from malifon.math.uh.edu ([129.7.128.13]) by ciao.gmane.org with esmtp (Exim 4.43) id 1F73mA-00063y-DE for ding-account@gmane.org; Thu, 09 Feb 2006 05:51:26 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu ident=lists) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1F73m6-0002sv-00; Wed, 08 Feb 2006 22:51:22 -0600 Original-Received: from nas02.math.uh.edu ([129.7.128.40]) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1F73iQ-0002sq-00 for ding@lists.math.uh.edu; Wed, 08 Feb 2006 22:47:34 -0600 Original-Received: from quimby.gnus.org ([80.91.224.244]) by nas02.math.uh.edu with esmtp (Exim 4.52) id 1F73iN-00005r-SR for ding@lists.math.uh.edu; Wed, 08 Feb 2006 22:47:34 -0600 Original-Received: from smtp1.rz.uni-karlsruhe.de ([129.13.185.217]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1F73iK-0006Hl-00 for ; Thu, 09 Feb 2006 05:47:28 +0100 Original-Received: from rzstud1.stud.uni-karlsruhe.de (rzstud1.stud.uni-karlsruhe.de [193.196.41.33]) by smtp1.rz.uni-karlsruhe.de with esmtp (Exim 4.50 #1) id 1F73iI-0002X1-Ht; Thu, 09 Feb 2006 05:47:28 +0100 Original-Received: from uwi7 by rzstud1.stud.uni-karlsruhe.de with local (Exim 3.36 #1) id 1F73iL-0003O7-00 for ding@gnus.org; Thu, 09 Feb 2006 05:47:30 +0100 Original-To: ding@gnus.org X-PGP-Key: 0x2C006B340F8C8C1B X-Face: $:F<87a[gD1?#R6S3j21cr1&C&7bd63GHC.tSdskUb}hhwG(ci*=D5kJ<_N+p9q(7-,PnG. Et.Yh Olly Betts writes: > Andreas Seltenreich writes: >> If an option to customise the number of results per page could be >> added to the Xapian-powered search, it'd be as usable via nnweb as the >> google one. > > It already exists, just pass a CGI parameter called HITSPERPAGE. The value > is clipped to the range 10-1000. Great, would it be okay to just set HITSPERPAGE to nnweb-max-hits and doing a single request per search? One could also decrease the default upper bound of 999 articles, but if this really is an issue, it would be better to hardcode a limit for the Gmane part of nnweb, since nnweb-max-hits is a user-customizable variable. >> Of course, if there was an option to output the result set as S-exps >> or nov-lines it'd be even better . > > I don't have enough information available to produce nov-lines unless some > fields aren't required (no message-id for example). It wouldn't have to be complete, just enough to present the hits to the user plus something in the xref header one can construct links to the raw articles from. Currently, nnweb parses the html output and generates nov-lines like the following: 19 (gmane.emacs.gnus.general) Re: nnweb fix (96%) Andreas Seltenreich gate450 dyndns org> 2006-01-31 05:48:02 GMT <87vevtxjjv.fsf@totally-fudged-out-message-id> 0 0 Xref: http://article.gmane.org/gmane.emacs.gnus.general/61805/raw > Other output formats aren't hard to add, but require me to update > two places if anything changes so are a bit more work to support. So > I'd prefer something more generic than S-exps. Now, does "more generic than S-exps" apply to nov-lines :-)? As stated, it already works by parsing the html output format, but it kind of pains me to waste bandwidth on useless markup, and those regexps are likely to break on markup changes. Thanks! Andreas