From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wi0-x235.google.com (mail-wi0-x235.google.com [IPv6:2a00:1450:400c:c05::235]) by hurricane.the-brannons.com (Postfix) with ESMTPS id 8EE967891C for ; Sat, 12 Sep 2015 06:18:39 -0700 (PDT) Received: by wicfx3 with SMTP id fx3so91525424wic.1 for ; Sat, 12 Sep 2015 06:21:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:cc:subject:message-id:references:mime-version :content-type:content-disposition:in-reply-to:user-agent; bh=tp/JpDGyn3OHB6k0xYZ+D3gPXGMiAwfSoR6k+LPg4r8=; b=xML4OFuSs/cRapyGmin4SaTatbQ1TJzqETndVyLcmTKOcl5Bop6EyuIzRylnfQ3kJB 9+4jWa9Njeu/KgHvhCj/Dgw1gtlJwmZ9c3Zsyd8l0sq/SMOVO7cSXXIbFviAzg8SPhzl 6E16qzbNWDNI42X9nXi4Cov0JUSFv6NSuKBIYhi8nF3mZLSMuvko2kZA+oqmLihReX+0 u+jxUv05efwGQAiAzETQPRDipQBCDyBagacd2GK/YtZsyCZCOuYZJp7WVIoFH7jEQKY4 aumRZ0fk9Zm6MRYwbib6/cfR/9iYs5dQqA2D6ikhNlEFLQRRr8A0/+25DFtEPsXt4svC cQGQ== X-Received: by 10.180.89.41 with SMTP id bl9mr6419652wib.85.1442064080750; Sat, 12 Sep 2015 06:21:20 -0700 (PDT) Received: from toaster.adamthompson.me.uk (toaster.adamthompson.me.uk. [2001:8b0:1142:9042::2]) by smtp.gmail.com with ESMTPSA id x10sm4305451wiy.6.2015.09.12.06.21.19 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sat, 12 Sep 2015 06:21:19 -0700 (PDT) Date: Sat, 12 Sep 2015 14:21:17 +0100 From: Adam Thompson To: Chris Brannon Cc: edbrowse-dev@lists.the-brannons.com Message-ID: <20150912132117.GE29720@toaster.adamthompson.me.uk> References: <20150911181459.GC29720@toaster.adamthompson.me.uk> <87r3m4u0id.fsf@mushroom.localdomain> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="XuV1QlJbYrcVoo+x" Content-Disposition: inline In-Reply-To: <87r3m4u0id.fsf@mushroom.localdomain> User-Agent: Mutt/1.5.23 (2014-03-12) Subject: Re: [Edbrowse-dev] Possible tidy5 alternative X-BeenThere: edbrowse-dev@lists.the-brannons.com X-Mailman-Version: 2.1.20 Precedence: list List-Id: Edbrowse Development List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 12 Sep 2015 13:18:40 -0000 --XuV1QlJbYrcVoo+x Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Sep 11, 2015 at 02:08:58PM -0700, Chris Brannon wrote: > Adam Thompson writes: >=20 > > Hi all, > > > > Following my previous email, I managed to locate another html5 parsing = library > > from google namely: > > Gumbo >=20 > I've also had libhubbub suggested to me. It's actively being used in > another independent browser project (netsurf), so it could be promising. My issue with libhubbub is exactly the fact that it's tied into netsurf's development, so much so that the Debian package for it was removed because = it's now packaged as part of netsurf. I appreciate the fact that being part of another browser means it's battle-tested, but after spidermonkey I'd rather have a library which is explicitly suppos= ed to be a library rather than something which can be used as a library but is developed as part of another browser. Cheers, Adam. ps: If none of the html options out there turns out to be of any help, at least html5 has standardised the parsing algorithm so we can make our pa= rser algorithmically correct, and we have a more maintainable design out of all = this. --XuV1QlJbYrcVoo+x Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iQEcBAEBAgAGBQJV9CbNAAoJELZ22lNQBzHOpzAIAJhC9NH2Ia5wGD6qZdXh1mvt E71lzraFX/yXCXeo1e10KcMK0hSnpd7JUpt+j5d7xWCx+50OBrsSLJZKzjvE5EWL LkqguN7rA00uWnEkpCE7vb/+aMCt0xySI0NLELognA4IzwRO9WiNZHN/wXoXRDbD YzUpa+koKi0uLj12kwsVcXbZOi8uhYcU2Yydk6RgCZETtw2SuFAnH0O2yyoq0k+h 7bV/3Jzi29RKBs+7Ff71rX/kQznlMyJtRDW//ZYWZXJnvuTSDp9cWVLVFZHIoD+w 4nLNKRxmwykrrGqvfvQh0DoSbOzwD70fQ+8A6HqVnrcl4jFgodw4KXseuWkYKbQ= =k9D7 -----END PGP SIGNATURE----- --XuV1QlJbYrcVoo+x--