From: Albert Krewinkel <albert+pandoc-9EawChwDxG8hFhg+JK9F0w@public.gmane.org>
To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
Subject: Re: Filtering entire files based on markdown list item
Date: Fri, 11 Sep 2020 18:19:29 +0200 [thread overview]
Message-ID: <87h7s4l20e.fsf@zeitkraut.de> (raw)
In-Reply-To: <2dcd5988-9b74-47a9-80bd-ce4a64fee169n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
I believe that this would be possible, but not straight-forward.
Pandoc doesn't preserve information about which file an element
originated in, so one would have to iterate over all files
manually.
E.g., maybe pass the files via a metadata value as a comma-
separated list. Then step through all file names, read their
contents, and decide whether to keep or discard the result.
Non-functioning pseudo code:
function Pandoc (doc)
local categories = find_categories(doc.blocks)
local files = doc.meta.input_files
for _, f in files:gmatch '[^,]+' do
local contents = io.open(f, 'r'):read '*a'
local file_doc = pandoc.read(contents)
if not contains_category(file_doc, categories) then
doc = append_doc(doc, file_doc)
end
end
return doc
end
The functions `append_doc` and `find_categories` would have to be
written.
Cheers,
Albert
Henrik Klang writes:
> Hi,
>
> I have a number of markdown files that I concatenate with Pandoc. Each
> markdown file has a second level top header (##). The markdown files are
> specified in the variable $document_list.
>
> pandoc --from=markdown --wrap=auto --to=markdown $documents_list >
> output.md
>
> In the next step I create a PDF:
>
> pandoc -f markdown+table_captions --pdf-engine=xelatex --listings -o
> output.pdf output.md
>
> What I want to do is to filter out certain markdown files in the first
> step. The filter shall be based on a list under a specific category in the
> markdown file:
>
> ### Group(s)
> Category1, Category3
>
> E.g. if *Category1* exists in the markdown file I want to filter the entire
> markdown file out.
>
> Do you think I could achieve this with Lua filters?
>
> Thank you.
>
> / Henrik
--
Albert Krewinkel
GPG: 8eed e3e2 e8c5 6f18 81fe e836 388d c0b2 1f63 1124
prev parent reply other threads:[~2020-09-11 16:19 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-09-09 11:33 Henrik Klang
[not found] ` <2dcd5988-9b74-47a9-80bd-ce4a64fee169n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2020-09-11 16:19 ` Albert Krewinkel [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87h7s4l20e.fsf@zeitkraut.de \
--to=albert+pandoc-9eawchwdxg8hfhg+jk9f0w@public.gmane.org \
--cc=pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).