From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/81599 Path: news.gmane.org!not-for-mail From: David Engster Newsgroups: gmane.emacs.gnus.general Subject: Re: Does nnweb with Google work any more? Date: Wed, 14 Mar 2012 18:06:00 +0100 Message-ID: <87ipi7w1s7.fsf@randomsample.de> References: <87fwdy3lss.fsf@marauder.physik.uni-ulm.de> <87y5r8u2ad.fsf@randomsample.de> <87ty1wtwrm.fsf@randomsample.de> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1331744773 24378 80.91.229.3 (14 Mar 2012 17:06:13 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 14 Mar 2012 17:06:13 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M29879@lists.math.uh.edu Wed Mar 14 18:06:12 2012 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1S7reM-0001Ux-Sq for ding-account@gmane.org; Wed, 14 Mar 2012 18:06:11 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1S7reL-0005NW-C8; Wed, 14 Mar 2012 12:06:09 -0500 Original-Received: from mx2.math.uh.edu ([129.7.128.33]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1S7reK-0005NR-Il for ding@lists.math.uh.edu; Wed, 14 Mar 2012 12:06:08 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx2.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.76) (envelope-from ) id 1S7reF-0004hW-Tt for ding@lists.math.uh.edu; Wed, 14 Mar 2012 12:06:08 -0500 Original-Received: from randomsample.de ([83.169.19.17]) by quimby.gnus.org with esmtp (Exim 4.72) (envelope-from ) id 1S7reD-0003ie-T2 for ding@gnus.org; Wed, 14 Mar 2012 18:06:01 +0100 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=randomsample.de; s=a; h=Content-Type:MIME-Version:Message-ID:Date:References:In-Reply-To:Subject:To:From; bh=0qcCw3ubPK/eNET/HWf7yS3EDbxejuCxrYf2JFxCh3I=; b=sTfLV4H622xVPT3irzDfIa7a6LzngNUBMTxtW2l++5YP9XBKTSlxIn2vxj/NBDqPt8NGTzzTeQ3WY1p+6ka3OfjztxZ7ZA1piMHEDMZelcRBxffyD7eMqLHOSg8VwGT4; Original-Received: from dslc-082-082-177-250.pools.arcor-ip.net ([82.82.177.250] helo=spaten) by randomsample.de with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1S7reD-0000It-Ch for ding@gnus.org; Wed, 14 Mar 2012 18:06:01 +0100 In-Reply-To: (Lars Magne Ingebrigtsen's message of "Wed, 14 Mar 2012 16:28:34 +0100") User-Agent: Gnus/5.110018 (No Gnus v0.18) Emacs/24.0.93 (gnu/linux) Mail-Copies-To: never Mail-Followup-To: ding@gnus.org X-Spam-Score: -2.0 (--) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:81599 Archived-At: Lars Magne Ingebrigtsen writes: > David Engster writes: > >> No, they just don't want to be crawled. A simple "-A foobar" will make >> it work. Also, adding "&output=gplain" will give raw text. > > Oh, nice. :-) > > curl -A foobar 'http://groups.google.com/group/rec.arts.sf.written/msg/eeb018dcf3c1688e?dmode=source&output=gplain' > > works fine. > > Then the only question is how to get from the Message-ID to the Google > ID. Let's see... the first URL had this snippet in the HTML: > > Michael Stemper wrote: In article<9rt27vF38...@mid.individual.net>, ...
http://groups.google.com/g/0897fef7/t/d00e330e9c82797a/d/eeb018dcf3c1688e > > Will there only be one of these URLs in the output? No idea. Maybe it would be safer to snarf the q=#eeb018... anchor from the title's target: