From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32097 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: ChrisD Newsgroups: gmane.text.pandoc Subject: Re: How to process chunkedhtml output with Lua Date: Thu, 26 Jan 2023 10:41:50 -0700 Message-ID: <621a843e-049e-1a2b-1c60-df3158b6dc2e@intielectronics.com> References: <35211aad-9b34-1c74-b25f-c2c3777da632@intielectronics.com> <84b97b97-8fe6-fb71-7d97-6ee0733b5763@intielectronics.com> <3F114306-007A-47CB-A067-3F7EE07900B0@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="39657"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.7.0 To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDHMHIETYAGRBY7WZKPAMGQEWJKDQEA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu Jan 26 18:42:00 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lj1-f190.google.com ([209.85.208.190]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1pL6G8-000A9E-8I for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 26 Jan 2023 18:42:00 +0100 Original-Received: by mail-lj1-f190.google.com with SMTP id 17-20020a05651c009100b0028f23beb02bsf600615ljq.13 for ; Thu, 26 Jan 2023 09:42:00 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1674754919; cv=pass; d=google.com; s=arc-20160816; b=M0hJ6VbQdEMA2PDdBzWcVdLTGROQOQVg4kGcyNsUu91kvnDbBvaAAQgqul0iijtrGD 3Ii1Q4gVPHTEAvy/pxGoI1Bt4LJ9nyBtv4JOQGFfYwVeJ764s7mXcF/hdlxMOkxJ38ZX rEUROOX/VKS5sxLcuTrTWbOHY0JtGiAk6BWHn3UcI60K/pC24cmpxbiL4oqMdw4ScE+v 1OCMtMf1dkAeRxdUMfrqOuXDVyuXACROrE0580dKfP+ZEubzaLvqxteYXQi8adRLPB/k 1NmcGT2QUyCO803UhzaXbTeuVBFTbgEt3S+CTZlXPvznHXYvjqGpV40cOaly+xzfu6Rw 9aBw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :in-reply-to:from:references:to:content-language:subject:user-agent :mime-version:date:message-id:sender:dkim-signature; bh=jvLGeAV3djt+hP0iK9/8hgvyza4eqPkE7ysvDEoHwKI=; b=lq0LOAAFmTJ4XGiOnNdM76aLQadOXVxvjiCDSbeEIICM0lCY/LPt1RzHcUSiIF/MY8 dq02VInyh5CWL8jPYjg7bB4kD1PVOGKwXDlRtjgjsdRqMhJ5ii16JYVhYvBArz5LFrLk rXXef4mjrb1KOwF2YbYTVMOBbHYf6X89J3qk63gT/l4MiGQO5vcZWaJibMpVhZXdgiGY wHnOAXiod3Ny94QVWOJ5+o2y4jOpgoJHIkwWsH/V4ssmKDMc3Tt61i3o8zL099tJ0AOy sqveexwVraDObrEmGf2cq1ApOMSEmY/KmXJXIH5/fSISBZHVQ4WYUbF5UWPF5Shecybg 7EtQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) smtp.mailfrom=cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender :content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :sender:from:to:cc:subject:date:message-id:reply-to; bh=jvLGeAV3djt+hP0iK9/8hgvyza4eqPkE7ysvDEoHwKI=; b=t/mxzqXtpyeEM+fOxOv0utTU09dNjldKPvfGc95gdN8O1RqqfRK8yQ9ddCyTOLX6kv a2ERAUguJOs8XKiPC5E9+rxlC35Y7ONv3nuu9bYYECVu3taLkwxeM10t1YbhZB5CkcE2 4Rrp1fkuj3BDqDPZu36UvQjWtqM0CP55DxPMoZJFEcbSZxTir56AVz8DkPCOe4RaJdR/ J68nNFL5H5BdoGuA7FAZTLLfoCzb2aQU3lrKKRfl5VB53LATij1y+NK8fOuxAAZ4Nzsm xEdvjdzmahjCQGRhawK5rX5qtEQikQ3Eio14l X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender :content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:sender:from:to:cc:subject:date:message-id :reply-to; bh=jvLGeAV3djt+hP0iK9/8hgvyza4eqPkE7ysvDEoHwKI=; b=TB3atqvrL5ans3OgEQseQ91KvkaeygDoFYM2CKd4GcMAVd+NYv4SCjisRJtTe5OjIs 5tWDxVWYtS3k6IIU5weZV/P3p54rYcS7+wVYUBknkMVoqBxVPOW1rZOGGW96J01opyKh TOnSOxnZsEnS+FUPAcrJvHrpW38FUmisLVBnZSlrnCUplXQBwmqah2+xWlWJDoxUtTao QsazrbcBPqxeekjr2G/C6DW6Ia6k3QSxQvzGvgKvARwkouw4VoPZv1YoH/kB Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AFqh2kpM2c+chpUt593/Ltp0u9A39pZA5z+6V1g79XqaVSTyAN4W7HR+ ndUacTpW0y4DzzwCllgz5lg= X-Google-Smtp-Source: AMrXdXuSqhuS72RuYwrbs8jangFht2ZItAMzuiJEFD0wZOD9XqOiylsdVK1YCdw7Lqt0ADiCjLBEKw== X-Received: by 2002:a05:6512:3ca6:b0:4cc:53e2:5382 with SMTP id h38-20020a0565123ca600b004cc53e25382mr2129534lfv.220.1674754919549; Thu, 26 Jan 2023 09:41:59 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:ac2:4891:0:b0:4d1:8575:2d31 with SMTP id x17-20020ac24891000000b004d185752d31ls1904026lfc.0.-pod-prod-gmail; Thu, 26 Jan 2023 09:41:54 -0800 (PST) X-Received: by 2002:a19:430c:0:b0:4d8:4f53:37b2 with SMTP id q12-20020a19430c000000b004d84f5337b2mr517808lfa.1.1674754914750; Thu, 26 Jan 2023 09:41:54 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1674754914; cv=none; d=google.com; s=arc-20160816; b=dkEFbVavcqaCk9hj54ZEJTwR+DvmQtD8nZfE8wA6VxPEIQUdS/1DU0rTmkQNVkTNcM qOGdZqbhB7S9i5IUsOJdrnv/8uvagBeKEiBjgjj5OTdSlUU581hhtuliQmVyfcou4UnV 5moYMHVfnjRhMH4geywyeY5sVuL5i/l4tup0fkO8jF6JihNav1QrjBXEi84oSoxvSIOy deNbDiHKxPSpT+bDXTsq1LvuTdRDBt9kez+4L/W/E6F0NH6sBXAWFlJHLUoievQ8a4OG deO2i+5cSP/5f4tWWkaCB29d8lBSw+ewkC3kyGZM/5ekt5ehFvvsFi7pPhwZv+u8x5PU cRvw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id; bh=oMtuxh0d+tBevLE+NC4dlrcc2GnbU9aQLCRnsCuAsLk=; b=ye9pyplxPvkMNuzRPU+sBPilPiydvmQ1MzlXc7rqrqllXZ1NrMvcYy2WPSx/Y8Wwhl 2/ZJLDZvwLqZBf2cW/2Bs5vEWnyi37P4XUnmKY1Yws4ZAznOl5s05fYkpihOpimYYwuQ oU+EoOe8zB4F2Z7XHCuinpB94sayOft4jJnPiCi0tSZDi8GFmxc1+uwSU8ytSEJ/+83B kB1Jv/BkUawE6z6LjYhOFYMi+abcbJVOi9KRj6zG4HZYcs0IvFJu4jVL6ZSPQYB21KfY PlQ1ZBcgpBDKe3D5nK7dHYqGC4gK4nOT22lneSTgJNwsyp9gn++aTdPbfabYFXsEA6zC +Szg== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) smtp.mailfrom=cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org Original-Received: from hamza.pair.com (hamza.pair.com. [209.68.5.143]) by gmr-mx.google.com with ESMTPS id x8-20020a056512130800b004ce3ceb0e80si104161lfu.5.2023.01.26.09.41.54 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 26 Jan 2023 09:41:54 -0800 (PST) Received-SPF: pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) client-ip=209.68.5.143; Original-Received: from hamza.pair.com (localhost [127.0.0.1]) by hamza.pair.com (Postfix) with ESMTP id 8832C33E65 for ; Thu, 26 Jan 2023 12:41:52 -0500 (EST) Original-Received: from [10.104.138.18] (static-198-54-133-88.cust.tzulo.com [198.54.133.88]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by hamza.pair.com (Postfix) with ESMTPSA id 4E37333E0B for ; Thu, 26 Jan 2023 12:41:52 -0500 (EST) Content-Language: en-US In-Reply-To: <3F114306-007A-47CB-A067-3F7EE07900B0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> X-Scanned-By: mailmunge 3.10 on 209.68.5.143 X-Original-Sender: cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) smtp.mailfrom=cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32097 Archived-At: On 1/25/2023 5:50 PM, John MacFarlane wrote: > 3) Is there a simple way to get a list of files (including the image file= s) that will be included in the chunked html output folder? Maybe I can gen= erate this from the ChunkedDoc, but it's going to take some parsing. > It should be easy to get the non-image files from the ChunkedDoc. Then t= here=E2=80=99s index.json. What is index.json? If you mean sitemap.json, that doesn't exist yet, and i= t doesn't include the image files. > Image files, not so easy. I'm thinking this task may be easier to do as a post-processing step, rathe= r than as a filter. I'll have sitemap.json, and I can generate a list of fi= les from the output folder or zip file. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/621a843e-049e-1a2b-1c60-df3158b6dc2e%40intielectronics.com.