From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/59546 Path: main.gmane.org!not-for-mail From: Michael Widerkrantz Newsgroups: gmane.emacs.gnus.general Subject: Re: Read certain forums with Gnus (Canopus) Date: Sat, 08 Jan 2005 20:54:18 +0100 Organization: Temple of the Moby Hack Message-ID: <87mzvjpuhh.fsf@tim.hack.org> References: NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1105299464 9759 80.91.229.6 (9 Jan 2005 19:37:44 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 9 Jan 2005 19:37:44 +0000 (UTC) Original-X-From: ding-owner+M8085@lists.math.uh.edu Sun Jan 09 20:37:37 2005 Return-path: Original-Received: from malifon.math.uh.edu ([129.7.128.13] ident=mail) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Cnisa-0007eN-00 for ; Sun, 09 Jan 2005 20:37:36 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu ident=lists) by malifon.math.uh.edu with smtp (Exim 3.20 #1) id 1CnikU-0004Nr-00; Sun, 09 Jan 2005 13:29:14 -0600 Original-Received: from util2.math.uh.edu ([129.7.128.23]) by malifon.math.uh.edu with esmtp (Exim 3.20 #1) id 1CnMfQ-0000ul-00 for ding@lists.math.uh.edu; Sat, 08 Jan 2005 13:54:32 -0600 Original-Received: from quimby.gnus.org ([80.91.224.244]) by util2.math.uh.edu with esmtp (Exim 4.30) id 1CnMfJ-0003Ps-RU for ding@lists.math.uh.edu; Sat, 08 Jan 2005 13:54:26 -0600 Original-Received: from main.gmane.org ([80.91.224.249]) by quimby.gnus.org with esmtp (Exim 3.35 #1 (Debian)) id 1CnMfI-0001hw-00 for ; Sat, 08 Jan 2005 20:54:24 +0100 Original-Received: from list by main.gmane.org with local (Exim 3.35 #1 (Debian)) id 1CnMfI-0002x3-00 for ; Sat, 08 Jan 2005 20:54:24 +0100 Original-Received: from 1-1-9-34a.msp.mlm.bostream.se ([82.182.98.74]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 08 Jan 2005 20:54:24 +0100 Original-Received: from mc by 1-1-9-34a.msp.mlm.bostream.se with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 08 Jan 2005 20:54:24 +0100 X-Injected-Via-Gmane: http://gmane.org/ Original-To: ding@gnus.org Original-Lines: 27 Original-X-Complaints-To: usenet@sea.gmane.org X-Gmane-NNTP-Posting-Host: 1-1-9-34a.msp.mlm.bostream.se Face-Url: http://hack.org/mc/images/mc-face.jpg User-Agent: Gnus/5.1006 (Gnus v5.10.6) Emacs/21.3 (berkeley-unix) Cancel-Lock: sha1:4ezTNy2RJg9XlKbD60A+eOCQpOs= Precedence: bulk Original-Sender: ding-owner@lists.math.uh.edu Xref: main.gmane.org gmane.emacs.gnus.general:59546 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:59546 Jonas Steverud writes: > I think it is possible using W3 and the nnweb interface - I have not > looked into neither but the forums are web based and not very > obfuscated. The problem is that the design of the forum differ from > place to place so if there are three forums forum.x.com, > www.y.com/forums and foobar.z.com/chat, there is a risk we need to > make nnforumsX, nnforumsY, and nnforumsZ. Even of all use UBBTHreads > or phpthread (or whatever the other larger project is called). In general, I agree with you about the horridness of web forums, but there are better ways of getting at them then web scraping: Many of these forums are based on some database. If you can get permission to access the database in a more direct way, you can throw together an NNTP gateway. There's a pretty nice framework known as Papercut to help you do this. It's written in Python and fairly easy to work with. I once did a plugin to it to access Gnuheter, the Swedish free software news site. My plugin worked last I tested it, but that, I'm afraid was some time ago. What I did is here: http://hack.org/mc/hacks/papercut-gnuheter-1.1.tar.gz -- M.C. Widerkrantz, http://hack.org/mc/ WAR IS PEACE. FREEDOM IS SLAVERY. ERASE IS BACKSPACE.