From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: * X-Spam-Status: No, score=1.9 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,FREEMAIL_FROM,PDS_OTHER_BAD_TLD autolearn=no autolearn_force=no version=3.4.4 Received: (qmail 24820 invoked from network); 2 Sep 2021 14:13:19 -0000 Received: from 4ess.inri.net (216.126.196.42) by inbox.vuxu.org with ESMTPUTF8; 2 Sep 2021 14:13:19 -0000 Received: from mail-pj1-f48.google.com ([209.85.216.48]) by 4ess; Thu Sep 2 08:39:50 -0400 2021 Received: by mail-pj1-f48.google.com with SMTP id l3so1247219pji.5 for <9front@9front.org>; Thu, 02 Sep 2021 05:39:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=a/QUzKfK38hnNq204o+K9u8vnZ3LWO6UnHmamZKJI5Y=; b=ljrqDG7qeSn2/xZ7uKWmZ2OgIVS37Vkx7kXRmXH+XlxdqaHLXjyjbF3cstn76ZJlap i04x9TnynsbkGKpfKhm7G73DOXfx+5u/WdSsvWAZGG+Oo5n5xX7NOdEGVsR+rLo0yEOQ /iM75QIfRJhtBX+TPjq5MPfwvhVFbScT/pqbBQMMdbMTj9EKgZVeF/wmyey+svwz98r4 x46HVJ6u+cCLcjdev0KkiJBdxXlXJoR1slvpyE4oSGkNJQjAAE6fiabZiBA7R1f/7p5V jEkg2G3vwHm21Kw3Vu2XkTfbUMRBbyxQk8cs+6bvPBi61ksoNFm4XF03EiRCijXjhHgz YgKA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=a/QUzKfK38hnNq204o+K9u8vnZ3LWO6UnHmamZKJI5Y=; b=CPbiS1VpS5GiueVCtdR1W/xq73LP2v1uH9vd9TOTCw9JF4wOCKUEMXyee4ML7/k+dY d/jDF0SW83co+0KdxNBNMVCvLOrVLWgzK/t54dCY9by3Cow6419SjUnqgSHQ3DHeZCDJ b/+4o4cREOH60Xy1r0HW7nZFA0+ZD9xDBnbjE2P38wnoq8CUqKDsEt3sC+ocWrwGDFVC LNbLiv4mQzY9TSheGGgVX73GA6xU5Ttr7MUCtOm790e/MfVm6v8Ge+VR3A9ynvyY2MFH sqpQKBMhTVLzuvy2knsDWXFfFXpPSk/mOmHNdiEXuMACCeOQu62EZNVVeL0m4NN8H7mI 1LWg== X-Gm-Message-State: AOAM533Us11Uev3Bd54rdZotvwpimI+5jdmjPdN97uiSn4/OctiTHFD1 y/9/8nHUiHPROCffH7nOaDTgQbqJq7U5lbjFhcCXjfIEWuo= X-Google-Smtp-Source: ABdhPJwwTBIP6YmGCPgu1J/on+o1Y31EsfndtI3+NbJ3lvrHAIaycdoJQcJGGNMBHVxf2d91/dXa8kUnPS3zWsowy0Q= X-Received: by 2002:aa7:8199:0:b0:3fc:ae11:badd with SMTP id g25-20020aa78199000000b003fcae11baddmr3006785pfi.12.1630585947388; Thu, 02 Sep 2021 05:32:27 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:a05:6a20:3f9c:b0:56:fd24:af with HTTP; Thu, 2 Sep 2021 05:32:26 -0700 (PDT) In-Reply-To: References: <3C2596FB-B368-4A1D-8974-F8F63A99E671@gmail.com> From: hiro <23hiro@gmail.com> Date: Thu, 2 Sep 2021 14:32:26 +0200 Message-ID: To: 9front@9front.org Content-Type: text/plain; charset="UTF-8" List-ID: <9front.9front.org> List-Help: X-Glyph: ➈ X-Bullshit: encrypted map/reduce SVG over SQL method injection-aware replication blockchain Subject: Re: [9front] htmlfs Reply-To: 9front@9front.org Precedence: bulk after writing my last email i googled xpath, and realize that i only ever got to see a subset of it's insane complexity (the simple path notation that includes a way to specify this n-th-element of a type, which i have seen used a lot in practice by adblockers and anything that needs to scrape content from websites that don't supply meaningful element names. contrary, i have indeed seen that some websites randomize their element names to prevent this kind of javascript-free processing. so yes, our low effort will not help with websites that really don't want to be scraped... On 9/2/21, hiro <23hiro@gmail.com> wrote: > On 8/31/21, Pavel Renev wrote: >> I have a half-backed DOMfs: >> http://git.nsmpr.xyz/domfs/files.html >> but it just represents documents as a flat list of numbered nodes (the >> way >> rio serves its windows) and their hierarchy is provided through a >> separate >> file. >> >> The challenge with xml/html is that unlike traditional file trees their >> elements do not have unique names and instead addressed by their order. >> Additionaly, element's attributes often play bigger role than text data >> they >> contain. >> Style also can override tree hierarchy when it comes to rendering, and >> when >> it comes to javascript, programs look up needed elements via global >> search >> by id and usually only care about element's immediate parent/children. >> >> TL;DR: the tree is a lie. >> Maybe serving html via some kind of database query interface would be >> better. >> > > why not do like xpath? numbers can signify order. > we don't support javascript anyway, so the tree wouldn't really change > under our feet... >