From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-0.1 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,FREEMAIL_FROM autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 1026 invoked from network); 19 Oct 2022 09:13:46 -0000 Received: from hurricane.the-brannons.com (2602:ff06:725:1:20::25) by inbox.vuxu.org with ESMTPUTF8; 19 Oct 2022 09:13:46 -0000 Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by hurricane.the-brannons.com (OpenSMTPD) with ESMTP id 4abdf3d2 for ; Wed, 19 Oct 2022 02:13:45 -0700 (PDT) Received: from resqmta-a1p-077724.sys.comcast.net (resqmta-a1p-077724.sys.comcast.net [2001:558:fd01:2bb4::5]) by hurricane.the-brannons.com (OpenSMTPD) with ESMTPS id faec3425 (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256:NO) for ; Wed, 19 Oct 2022 02:13:39 -0700 (PDT) Received: from resomta-a1p-077060.sys.comcast.net ([96.103.145.238]) by resqmta-a1p-077724.sys.comcast.net with ESMTP id l544oRxJfI22gl58qow31P; Wed, 19 Oct 2022 09:13:36 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=comcast.net; s=20190202a; t=1666170816; bh=SyvKfxbIz8D0rV2XO8oYbtWxH7YiCjOpQJvlwzJ2JzU=; h=Received:Received:To:From:Reply-To:Subject:Date:Message-ID: MIME-Version:Content-Type; b=BuwUFQ8Mlfsb//GVpLgKahSipS0X/HMYu+OZEodUnij6pklfzK+mbZg9miFwRaDTj /McjtSveM+1qAh3xyNK6rR0tXTiuhTZwYoJrIBTIhAZ0VsKl8v3VdJGe3kKvE2/dfg YhwlrAm+xBmHZsdhtHVhXcEK1iasaSmm0IHytAWGIWWiU4WPzNIXR4R5kHq+V5iAj4 S25eVRgjXHWtZwC/Xccj+sEaiMa+aez1c5WA+F4a7EdDGvrmfQ/Ak4UmH/vzk6YGWM dzdrY6IuojQlM88WbhTkuMRVnQnp1fUtmfPT7o4U68sngQszg11QeprC3ySZuF/gTp EGRAh8wezuWOA== Received: from unknown ([IPv6:2601:408:c500:8ff0::27ed]) by resomta-a1p-077060.sys.comcast.net with ESMTPSA id l58ooNw0tbG4yl58po9t7D; Wed, 19 Oct 2022 09:13:36 +0000 X-Xfinity-VMeta: sc=0.00;st=legit To: edbrowse-dev@edbrowse.org From: Karl Dahlke Reply-To: Karl Dahlke References: <20220912185105.eklhad@comcast.net> User-Agent: edbrowse/3.8.5+ Subject: I don't know shit about xml Date: Wed, 19 Oct 2022 05:13:34 -0400 Message-ID: <20220919051334.eklhad@comcast.net> X-BeenThere: edbrowse-dev@edbrowse.org List-Id: Edbrowse Development List MIME-Version: 1.0 Content-Type: text/plain; format=flowed; delsp=no Content-Transfer-Encoding: 7bit Others have also pointed me to sgml, just, you know, if we want to understand the evolution of things. > garbage which people wrote (and continue to write) and browsers somehow turn > into something sane. Yes tidy did a lot of this for us, I didn't realize how much until I wrote my own html scanner. Ugh. I'm still making tweaks now and then. And yet, my scanner isn't much bigger than the interface code that connected to the tidy library, so there ya go. > the current direction seems to make sense. Yes I think so. Thank you. xml as received through xhr is now parsed as xml, and that may make a difference to some websites. We also do some of the cdata parsing and representation, which tidy would not be able to do for us. So this is the right path. Karl Dahlke