From: Ken Perry <kperry-rhKcP+tiLR7by3iVrkZq2A@public.gmane.org>
To: pandoc-discuss <pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
Subject: Parsing a list
Date: Wed, 15 Feb 2023 20:50:11 -0800 (PST) [thread overview]
Message-ID: <39a03df1-6228-449f-8f20-1b5b92248cecn@googlegroups.com> (raw)
[-- Attachment #1.1: Type: text/plain, Size: 3513 bytes --]
We use Pandoc as a backend for a Braille word processor. to support file
types that we don't currently support. We have most of the filter working
for our file type but the lists are not working right I can't seem to get
the nested lists to come out right. I am trying to get the attributes of a
list in a LUA filter to give me the same information I can see when I
output to JSON. Currently I am using an ugly list like this to test with:
Broken list
1. Item 1
2. Item 2
3. Item 3
1. item 7
2. Item 8
a. Item a
b. Item b
i. Item i
ii. Item ii
1. Item I
2. Item ii
3. Item 9
4. Item 10
When I run it in the native json I get :
,Header 3 ("broken-list",[],[]) [Str "Broken",Space,Str "list"]
,OrderedList (1,Decimal,Period)
[[Para [Str "Item",Space,Str "1"]]
,[Para [Str "Item",Space,Str "2"]]
,[Para [Str "Item",Space,Str "3"]]]
,OrderedList (7,Decimal,Period)
[[Para [Str "item",Space,Str "7"]]
,[Para [Str "Item",Space,Str "8"]
,OrderedList (1,LowerAlpha,DefaultDelim)
[[Para [Str "Item",Space,Str "a"]]
,[Para [Str "Item",Space,Str "b"]
,OrderedList (1,LowerRoman,DefaultDelim)
[[Para [Str "Item",Space,Str "i"]]
,[Para [Str "Item",Space,Str "ii"]
,OrderedList (1,Decimal,DefaultDelim)
[[Para [Str "Item",Space,Str "I"]]
,[Para [Str "Item",Space,Str "ii"]]]]]]]]
,[Para [Str "Item",Space,Str "9"]]
,[Para [Str "Item",Space,Str "10"]]]]
I am trying to parse this with LUA with the function:
function OrderedList(items)
local newItems = {}
local listItems = ''
local itemCtr = 1
beginTag = '<CONTAINER bb:type="LIST" bb:listType="NORMAL"
bb:listLevel="0">'
endTag = '</CONTAINER>'
for _, item in pairs(items) do
if nil ~= item and string.len(item) > 0 then
item = removeTags(item)
-- listItems = listItems .. '<BLOCK bb:type="LIST_ITEM"
bb:itemLevel="0">'.. itemCtr .. '. ' .. item .. '</BLOCK>'
print ('fuck' .. itemCtr .. '. ' .. item)
table.insert(newItems,'<BLOCK bb:type="LIST_ITEM"
bb:itemLevel="0">'.. itemCtr .. '. ' .. item .. '</BLOCK>')
itemCtr = itemCtr + 1
end
end
newItems.bullet = true
return BulletList(newItems)
end
I know I can adjust the above LUA if I cna just figure out how to get the
start value, and the style out of the native into the LUA. Can someone
tell me the line of code I need to ge the attributes. I have tried things
like:
items.attributes
items.listAttributes
I even checked the item before I output the json and saw that tthe
attributes are on the main list. I can't seem to get the LUA function to
see the things like starting value and I don't know what I have to do so
that I can parse the levels correctly. All help is welcome. Heck if
someone has something that can take the above docx list and output it as a
text list that is all I need. I can wrap that in our tags. I have been
looking for any examples of LUA filters that make multi level broken lists
into anything and I can not find an example.
Thanks for any help.
--
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/39a03df1-6228-449f-8f20-1b5b92248cecn%40googlegroups.com.
[-- Attachment #1.2: Type: text/html, Size: 4496 bytes --]
next reply other threads:[~2023-02-16 4:50 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-02-16 4:50 Ken Perry [this message]
[not found] ` <39a03df1-6228-449f-8f20-1b5b92248cecn-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2023-02-16 6:38 ` Bastien DUMONT
2023-02-18 7:41 ` kperry-rhKcP+tiLR7by3iVrkZq2A
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=39a03df1-6228-449f-8f20-1b5b92248cecn@googlegroups.com \
--to=kperry-rhkcp+tilr7by3ivrkzq2a@public.gmane.org \
--cc=pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).