From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32059 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: ChrisD Newsgroups: gmane.text.pandoc Subject: How to process chunkedhtml output with Lua Date: Sat, 21 Jan 2023 06:42:04 -0700 Message-ID: <35211aad-9b34-1c74-b25f-c2c3777da632@intielectronics.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="25296"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.6.1 To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDHMHIETYAGRBLGXV6PAMGQEJ3OPECQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Sat Jan 21 14:42:11 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lf1-f60.google.com ([209.85.167.60]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1pJE8J-0006RW-5c for gtp-pandoc-discuss@m.gmane-mx.org; Sat, 21 Jan 2023 14:42:11 +0100 Original-Received: by mail-lf1-f60.google.com with SMTP id d2-20020a0565123d0200b004d1b23f2047sf3264075lfv.20 for ; Sat, 21 Jan 2023 05:42:11 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1674308530; cv=pass; d=google.com; s=arc-20160816; b=gTpVblF8jz/SCljgRRvO4wETZn8PUKH7EJeg8hqB+F2/m23lLqq4NDJ6+PuGUJBQdB qMSQCjoTX/LaI3mwZ5/EkycIz3hhtKGHgcc1VeWRqXfJfUN8ZG8jcL8dhIXNTvjGAAeb DGwi2UgoEzxgI10kQth+WwUKBc7O+t1i9EsipOMGjuE+TWJT/WEIWiahdWyFzyTZSYWi +Mp1UmEkld7FiTPRk+q3mQdOj7o6NzCjn8kfOEv4VZ5HUhvojieTj0vl5dy8tGrtA/Fa FLiGfIvJ9usZu5aYbLhCDEwO+HmscvIMmY07PLVM1QIjV6iWlhxrlvXhe8Drs5eslkSV aTmQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :subject:from:content-language:to:user-agent:mime-version:date :message-id:sender:dkim-signature; bh=teJHQRAFIawEtWt8XhHaV6D+2LwTmvSsai25LvtkqQ8=; b=GEAYlG2qacxQ/fn7V6u5jDhrp3/KJcaFQ7d38o8hWsbHszzQrrDh2KVS1QrceMc6RE cW2u+6cfJZeTO/jUHjDVk+BR6cOoHgrQ6X/a2iTIcoheY6ZaKJe3M+zq3tJdd9mKK4/M X3NPb2pjh5LFVg9ICqZfdCd9AVRVImI/7A5aKmCearAxRgdjF2vyeL1BczYNfqpGD9O3 oaoXqpJMlUNi2ADWgIPjgH+CAkf31vsrYkf84rFxqx3TSfVv1/GbSl4UB/tZPt9tNDHx GUVYOj4WXQkkbn4lB6HcTxvriwDxldLDCS9qPTOOtMIUcD23EZcRIuITf7yhlDlHRgTB gAtg== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) smtp.mailfrom=cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender :content-transfer-encoding:subject:from:content-language:to :user-agent:mime-version:date:message-id:sender:from:to:cc:subject :date:message-id:reply-to; bh=teJHQRAFIawEtWt8XhHaV6D+2LwTmvSsai25LvtkqQ8=; b=JeD57au3BgG09flnHEgyeuPh9t2PSTAs3dbsuXOmQr/vtE8hsFEYctpCtG0s7iosFk rzsCrd2i0XJybBGIwLFNO5WeZEP5mDgmOjE4ByKM+BxOA/oWw2X5VHvbv+BCHs3XtnRF D7PPMtcRNmxfNWiECncV48rdiiIB347jg4ttPmKy9dyfGTxXJQH66yhzu7il+qBgrNrh LuFs4/xaODuIDel2OE5ROCv7pEWgpPSm4iT3uQfLzbnX97jmbPPNlUgxvBJ7KTUsgzW1 TluTFULE4qBCGo+gjV2p3WFwPVkUzMGVqbE7K9TqyKSEH6StljwWaT83XvbX X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender :content-transfer-encoding:subject:from:content-language:to :user-agent:mime-version:date:message-id:x-gm-message-state:sender :from:to:cc:subject:date:message-id:reply-to; bh=teJHQRAFIawEtWt8XhHaV6D+2LwTmvSsai25LvtkqQ8=; b=7h/sFlA+IBU5bIRcW+khNYWujm3d2oSbFCwz6nFuF0mlDWmvgbSN3isFcW0UDDrPeM Igqi1llxkFvNQLtvPGN+kijVlILK7jl4I8KTniPr3lkH+yBfSXBySqjTb+KdVKGGEBha 5uJgOsnf0dr6NhjAieKhWcbD8dR3D/+d4GxgqdC+D6rIUoOfM08I6TuBDoQbTY7JlU3s QNddLzOaCvsosH8gyFeLDbcRT8H3m8kjXpb32tALeeaWDuRbKvXw1+80hzu3F5L3tP7V ulCZwf1AnHqwX2j Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AFqh2krSKlq133zlZEN5fuU/lShldZPyhxzoPjCPhoxmzYcQOEApxvQM YqUgXDP6UbfDwmGU3P5aq2U= X-Google-Smtp-Source: AMrXdXvbqA09j+KIs/Z8gfMiida99EBezo2dPclGoLHQwOgd79FYyrw3/JHjfuRb2qLEaTa7UN2RkQ== X-Received: by 2002:a05:6512:1184:b0:4cc:db38:d84e with SMTP id g4-20020a056512118400b004ccdb38d84emr966380lfr.180.1674308530477; Sat, 21 Jan 2023 05:42:10 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6512:214f:b0:4c8:8384:83f3 with SMTP id s15-20020a056512214f00b004c8838483f3ls2296285lfr.3.-pod-prod-gmail; Sat, 21 Jan 2023 05:42:03 -0800 (PST) X-Received: by 2002:ac2:5209:0:b0:4d5:7be5:ba22 with SMTP id a9-20020ac25209000000b004d57be5ba22mr4862434lfl.58.1674308523112; Sat, 21 Jan 2023 05:42:03 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1674308523; cv=none; d=google.com; s=arc-20160816; b=OODBYaaWK+XGwB4BwZGU7G5mnyZCCIwVzd6t8OA/Peg4+UHTvvehon0NpGIyTsYhWo zt8fUs5wc/eWR0QVw8fcam/j+PRFvMHfie6JlIofNDUgSzELFWY+iKgW0d5hiSGvGEuD 7VqciwgUJWLNkpLQS8t5zRBZ/NUVYIAK3fSJ/dkzcxHUlZ9Fujd51r58LrwkIM9H9Pl7 KQ8MJsIaCIXINQDzah3nFOJw/f+f5DJztg68lZquFluOwX+SZyuaDNFePc3yVfcIr0ms MwCoFSqlqvJsLnsK5ra+zlI0ypgnoSHXcIoPrm31f4eWckwHqInnuwt5KvKBxdOycCEm CIUQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:subject:from:content-language:to :user-agent:mime-version:date:message-id; bh=VBd2YxgrLbdvQdjc0m0WUfjK/6p8fHTiVb/9NK9jD+M=; b=Pf/Zzl3lVDaukOdKQsrQX5OxenPAjTuy0Q2dkVB0PLNpczHOZPFTMvFJNzqjLEHN2C O2PUvfcVZ9u9y0ytK3UUkuPw0Yu1bPlf/qG+RYaYiRtdnNaenowcJipXH5UdwqCPus0y j6aa1hYPiPIRs1gzIdeRazyiJ1NLXkPg6VnOh9WiIqzIkAxl3Ohmq6yf9CLyRgLLK706 McLNEqi72G759POcSo4/X15wgT0MiUNT85loM2eczXbbop0Tj5HcYotP5SJDfmR6kmRd VYdLx3EsdobKDFqFPZd7K/az0BbtYXOH+VyGL7V0dZ/vWhzyd3aSw+0LF4Z7mVXdS44P NzHA== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) smtp.mailfrom=cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org Original-Received: from hamza.pair.com (hamza.pair.com. [209.68.5.143]) by gmr-mx.google.com with ESMTPS id s15-20020a056512214f00b004b58f5274c1si1623454lfr.1.2023.01.21.05.42.02 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 21 Jan 2023 05:42:02 -0800 (PST) Received-SPF: pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) client-ip=209.68.5.143; Original-Received: from hamza.pair.com (localhost [127.0.0.1]) by hamza.pair.com (Postfix) with ESMTP id E814433E86 for ; Sat, 21 Jan 2023 08:42:00 -0500 (EST) Original-Received: from [10.104.138.18] (static-198-54-133-152.cust.tzulo.com [198.54.133.152]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by hamza.pair.com (Postfix) with ESMTPSA id AA5B733E7A for ; Sat, 21 Jan 2023 08:42:00 -0500 (EST) Content-Language: en-US X-Scanned-By: mailmunge 3.10 on 209.68.5.143 X-Original-Sender: cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org designates 209.68.5.143 as permitted sender) smtp.mailfrom=cd34-gg-4SSc53hpTiu9TMao6EloiEEOCMrvLtNR@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32059 Archived-At: The thread "Lua filter to process chunkedhtml output" has gotten kind of si= de tracked with a discussion of the logging module and how data is represen= ted. Which is fine, this is good info and I appreciate the improvements to = logging. But I'd like to get back to the question of how to process chunked html out= put into other formats with Lua. I don't understand what data is available,= when it's available, and what data structures are used. 1. Lua filter: I'm looking for the list of all files that will exist in the= output folder, and the table of contents (essentially the data in sitemap.= json). Is that data even available at the time a filer runs? If so, how do = I access it? 2. Lua custom writer: If it can't be done in a filter, can it be done using= a custom writer? Where would I find the relevant data? 3. Post-processing with Lua: Pandoc can now be run as a Lua interpreter. If= neither (1) nor (2) is possible, I'm thinking I could run pandoc normally = to produce a chunked html output folder, and then run pandoc again with a L= ua script that finds all the files in the output folder, and reads sitemap.= json. This is the same approach as doing the post processing in some other = language, except that we don't have to have any additional tools installed.= Is there anything that would prevent this approach? Thanks, --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/35211aad-9b34-1c74-b25f-c2c3777da632%40intielectronics.com.