From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/81588 Path: news.gmane.org!not-for-mail From: Lars Magne Ingebrigtsen Newsgroups: gmane.emacs.gnus.general Subject: Re: Does nnweb with Google work any more? Date: Wed, 14 Mar 2012 16:28:34 +0100 Message-ID: References: <87fwdy3lss.fsf@marauder.physik.uni-ulm.de> <87y5r8u2ad.fsf@randomsample.de> <87ty1wtwrm.fsf@randomsample.de> NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain X-Trace: dough.gmane.org 1331738933 6886 80.91.229.3 (14 Mar 2012 15:28:53 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 14 Mar 2012 15:28:53 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M29868@lists.math.uh.edu Wed Mar 14 16:28:52 2012 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1S7q8C-0003jn-26 for ding-account@gmane.org; Wed, 14 Mar 2012 16:28:52 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1S7q88-0004ae-Jy; Wed, 14 Mar 2012 10:28:48 -0500 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1S7q87-0004aX-Ny for ding@lists.math.uh.edu; Wed, 14 Mar 2012 10:28:47 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.76) (envelope-from ) id 1S7q82-0000cV-Q5 for ding@lists.math.uh.edu; Wed, 14 Mar 2012 10:28:47 -0500 Original-Received: from hermes.netfonds.no ([80.91.224.195]) by quimby.gnus.org with esmtp (Exim 4.72) (envelope-from ) id 1S7q81-0001KI-88 for ding@gnus.org; Wed, 14 Mar 2012 16:28:41 +0100 Original-Received: from cm-84.215.51.58.getinternet.no ([84.215.51.58] helo=stories.gnus.org) by hermes.netfonds.no with esmtpsa (TLS1.0:DHE_RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1S7q7v-0008RL-6R for ding@gnus.org; Wed, 14 Mar 2012 16:28:35 +0100 Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAGFBMVEVZNDI2HiD96MwIAQIT BgkjEBTesZyTY1mTHVyTAAACFklEQVQ4jVWTPW7jMBCFh9i12ZKIue5VaA8gHSCGKKS1YBFpFyqo lg6w4vXzhhw59lSyPs+8+Xkia0iiPFgOo6ytT0/hQo1nwL+nkmlfgHsCfnwpVYv4ErtylUOSTv6S 7wxSRiShISh9+qBfXU/5JwBcACb63XUU18ghIMzaGKvUDpYY15wUlGejtfrPIOcdqNKSoUx3BoY2 gCau9zqzKo0DOHveishBdjFeyXcAGPaMaivVhNl7dxAQJg1Qxp557PGt28G2NGtNuDAY+h0sTfPX yZ7mAWyo4MrAugMrjBAZg4DPpmla7RwDg/8HN2BX+N8fBgrvJ2XyyXtrJePIAAuxpHNCNeMYqDB9 MYhZGWctJC4k4AxtREya9wsZAWYr75t25YM5XJtOPAfpmoAUgLtDQgVaEgBWwoktTjgzyHEHbUx4 CyF3ewXcGcsoK2B5kIYPiT6HovEMWhysVuKM15SseIqS8SPCnT0AyF5GADfLZpiOspJYh1S+gnm8 CkDjaBhLqQB3+aq1VrYRwCDg4nagt6LdCXhPO0iaDXwa+oHX7j9iXeOC2Xgh8yNj3aSrSDqRaDPI pd8WK/hHsigGUzCfxdRGx1WpIJXwGcAWR2zVeXBYxe8Z5gbr6XzwFdgwsnthuOnWd350MOA7rsTf 7ewnN3gq3mYjsz2yfPGwIaFcj7K9t4EPLqGJpAl/USbhannj8yK+Abc2/PaRveAHAAAAAElFTkSu QmCC X-Now-Playing: Eurythmics's _1984 (For the Love of Big Brother)_ In-Reply-To: (David Engster's message of "Wed, 14 Mar 2012 16:24:00 +0100") User-Agent: Gnus/5.130004 (Ma Gnus v0.4) Emacs/24.0.94 (gnu/linux) X-MailScanner-ID: 1S7q7v-0008RL-6R MailScanner-NULL-Check: 1332343715.30572@5QPTGqUHRfYzlwG6TFDubg X-Spam-Status: No X-Spam-Score: -1.9 (-) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:81588 Archived-At: David Engster writes: > No, they just don't want to be crawled. A simple "-A foobar" will make > it work. Also, adding "&output=gplain" will give raw text. Oh, nice. :-) curl -A foobar 'http://groups.google.com/group/rec.arts.sf.written/msg/eeb018dcf3c1688e?dmode=source&output=gplain' works fine. Then the only question is how to get from the Message-ID to the Google ID. Let's see... the first URL had this snippet in the HTML: Michael Stemper wrote: In article<9rt27vF38...@mid.individual.net>, ...
http://groups.google.com/g/0897fef7/t/d00e330e9c82797a/d/eeb018dcf3c1688e Will there only be one of these URLs in the output? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog http://lars.ingebrigtsen.no/