From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/26581 Path: main.gmane.org!not-for-mail From: Lars Magne Ingebrigtsen Newsgroups: gmane.emacs.gnus.general Subject: Re: Announce: nnwarchive Date: 10 Nov 1999 16:28:24 +0100 Organization: Programmerer Ingebrigtsen Sender: owner-ding@hpc.uh.edu Message-ID: References: <5biu3bd2dh.fsf@giga.cs.rochester.edu> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035163762 20148 80.91.224.250 (21 Oct 2002 01:29:22 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 01:29:22 +0000 (UTC) Return-Path: Original-Received: from lisa.math.uh.edu (lisa.math.uh.edu [129.7.128.49]) by sclp3.sclp.com (8.8.5/8.8.5) with ESMTP id KAA04775 for ; Wed, 10 Nov 1999 10:25:53 -0500 (EST) Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by lisa.math.uh.edu (8.9.1/8.9.1) with ESMTP id JAB02947; Wed, 10 Nov 1999 09:25:51 -0600 (CST) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Wed, 10 Nov 1999 09:26:09 -0600 (CST) Original-Received: from sclp3.sclp.com (root@sclp3.sclp.com [204.252.123.139]) by sina.hpc.uh.edu (8.9.3/8.9.3) with ESMTP id JAA18928 for ; Wed, 10 Nov 1999 09:25:58 -0600 (CST) Original-Received: from quimby.gnus.org (quimby.gnus.org [193.69.4.139]) by sclp3.sclp.com (8.8.5/8.8.5) with ESMTP id KAA04770 for ; Wed, 10 Nov 1999 10:25:26 -0500 (EST) Original-Received: (from news@localhost) by quimby.gnus.org (8.9.3/8.9.3) id QAA25045 for ding@gnus.org; Wed, 10 Nov 1999 16:27:36 +0100 (CET) Original-To: ding@gnus.org Original-Path: not-for-mail Original-Newsgroups: gnus.ding Original-Lines: 16 Original-NNTP-Posting-Host: quimbies.gnus.org Original-X-Trace: quimby.gnus.org 942247656 7613 193.69.4.148 (10 Nov 1999 15:27:36 GMT) Original-X-Complaints-To: usenet@quimby.gnus.org Original-NNTP-Posting-Date: 10 Nov 1999 15:27:36 GMT Mail-Copies-To: never X-Now-Playing: Chris Watson's _Outside the Circle of Fire_: "Tawny owls, calls on territorial boundaries, Bolam Lake, Northumberland" User-Agent: Gnus/5.070099 (Pterodactyl Gnus v0.99) XEmacs/21.2 (Sumida) X-Face: &w!^oO~dS|}-P0~ge{$c!h\ How about xql.el? XQL seems to be a fairly simple way to get stuff > from XML documents, and one would `only' have to normalize the HTML so > that it is valid HTML. w3 parses HTML very well; almost no matter how broken, w3 will give out reasonable results. The problem with HTML is that the elements are basically meaningless. Finding out what's what ("Is this number the number of articles in the topic or the number of authors?") is what is difficult. -- (domestic pets only, the antidote for overdose, milk.) larsi@gnus.org * Lars Magne Ingebrigtsen