From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from majordomo@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id SAA19663; Mon, 21 Jun 2004 18:16:36 +0200 (MET DST) X-Authentication-Warning: pauillac.inria.fr: majordomo set sender to owner-caml-list@pauillac.inria.fr using -f Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id SAA19667 for ; Mon, 21 Jun 2004 18:16:35 +0200 (MET DST) Received: from mail6.speakeasy.net (mail6.speakeasy.net [216.254.0.206]) by concorde.inria.fr (8.12.10/8.12.10) with ESMTP id i5LGGXSH003239 for ; Mon, 21 Jun 2004 18:16:34 +0200 Received: (qmail 2948 invoked from network); 21 Jun 2004 16:16:32 -0000 Received: from dialup-4.242.36.41.dial1.seattle1.level3.net (HELO sherlock.localdomain) (shawnw@[4.242.36.41]) (envelope-sender ) by mail6.speakeasy.net (qmail-ldap-1.03) with SMTP for ; 21 Jun 2004 16:16:32 -0000 Received: by sherlock.localdomain (Postfix, from userid 502) id 67F2AFD15; Mon, 21 Jun 2004 09:19:23 -0700 (PDT) Date: Mon, 21 Jun 2004 09:19:23 -0700 From: Shawn Wagner To: caml-list@inria.fr Subject: Re: [Caml-list] Parse crazy HTML, output XML Message-ID: <20040621161923.GZ595@speakeasy.org> Mail-Followup-To: caml-list@inria.fr References: <20040621160328.GA28952@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040621160328.GA28952@redhat.com> User-Agent: Mutt/1.4.2i X-Miltered: at concorde with ID 40D709E1.000 by Joe's j-chkmail (http://j-chkmail.ensmp.fr)! X-Loop: caml-list@inria.fr X-Spam: no; 0.00; shawnw:01 caml-list:01 2004:99 pxp:01 ocamlnet:01 shawnw:01 ocaml:01 speakeasy:01 speakeasy:01 parser:02 wrote:03 library:03 library:03 parse:04 parse:04 Sender: owner-caml-list@pauillac.inria.fr Precedence: bulk On Mon, Jun 21, 2004 at 05:03:28PM +0100, Richard Jones wrote: > > The problem is the parsing phase. Both PXP and XmlLight will only > parse valid XML (as far as I can see). Is there any simple pure OCaml > library for parsing HTML and producing a DOM? > There's a html parser in the ocamlnet library. -- Shawn Wagner shawnw@speakeasy.org ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners