public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed
From: Ken Perry <kperry-rhKcP+tiLR7by3iVrkZq2A@public.gmane.org>
To: pandoc-discuss <pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
Subject: Parsing a list
Date: Wed, 15 Feb 2023 20:50:11 -0800 (PST)	[thread overview]
Message-ID: <39a03df1-6228-449f-8f20-1b5b92248cecn@googlegroups.com> (raw)


[-- Attachment #1.1: Type: text/plain, Size: 3513 bytes --]


We use Pandoc as a backend for a Braille word processor. to support file 
types that we don't currently support.   We have most of the filter working 
for our file type but the lists are not working right   I can't seem to get 
the nested lists to come out right.  I am trying to get the attributes of a 
list in a LUA filter to give me the same information I can see when I 
output to JSON.  Currently I am using an ugly list like this to test with:

Broken list 

 

1. Item 1

2.  Item 2

3.  Item 3

1.  item 7

2.  Item 8

a. Item a

b. Item b

                     i.   Item i

                   ii.   Item ii

1. Item I

2. Item ii

3. Item 9

4. Item 10



When I run it in the native json I get :


,Header 3 ("broken-list",[],[]) [Str "Broken",Space,Str "list"]
,OrderedList (1,Decimal,Period)
 [[Para [Str "Item",Space,Str "1"]]
 ,[Para [Str "Item",Space,Str "2"]]
 ,[Para [Str "Item",Space,Str "3"]]]
,OrderedList (7,Decimal,Period)
 [[Para [Str "item",Space,Str "7"]]
 ,[Para [Str "Item",Space,Str "8"]
  ,OrderedList (1,LowerAlpha,DefaultDelim)
   [[Para [Str "Item",Space,Str "a"]]
   ,[Para [Str "Item",Space,Str "b"]
    ,OrderedList (1,LowerRoman,DefaultDelim)
     [[Para [Str "Item",Space,Str "i"]]
     ,[Para [Str "Item",Space,Str "ii"]
      ,OrderedList (1,Decimal,DefaultDelim)
       [[Para [Str "Item",Space,Str "I"]]
       ,[Para [Str "Item",Space,Str "ii"]]]]]]]]
 ,[Para [Str "Item",Space,Str "9"]]
 ,[Para [Str "Item",Space,Str "10"]]]]


I am trying to parse this with LUA with the function:


function OrderedList(items)
  local newItems = {}
  local listItems = ''
  local itemCtr = 1
  beginTag  = '<CONTAINER bb:type="LIST" bb:listType="NORMAL" 
bb:listLevel="0">'
   endTag    = '</CONTAINER>'
  for _, item in pairs(items) do
    if nil ~= item and string.len(item) > 0 then
      item = removeTags(item)
      -- listItems = listItems .. '<BLOCK bb:type="LIST_ITEM" 
bb:itemLevel="0">'.. itemCtr .. '. ' .. item .. '</BLOCK>'
      print ('fuck' .. itemCtr .. '. ' .. item) 
      table.insert(newItems,'<BLOCK bb:type="LIST_ITEM" 
bb:itemLevel="0">'.. itemCtr .. '. ' .. item .. '</BLOCK>')
      itemCtr = itemCtr + 1
    end
  end
  newItems.bullet = true
  return BulletList(newItems)
end

I know I can adjust the above LUA if I cna just figure out how to get the 
start value, and the style out of the native into the LUA.  Can someone 
tell me the line of code I need to ge the attributes.  I have tried things 
like:


items.attributes

items.listAttributes


I even checked the item before I output the json and saw that tthe 
attributes are on the main list.  I can't seem to get the LUA function to 
see the things like starting value and I don't know what I have to do so 
that I can parse the levels correctly.  All help is welcome.  Heck if 
someone has something that can take the above docx list and output it as a 
text list that is all I need.  I can wrap that in our tags.    I  have been 
looking for any examples of LUA filters that make multi  level broken lists 
into anything and I can not find an example.  


Thanks for any help.



 

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/39a03df1-6228-449f-8f20-1b5b92248cecn%40googlegroups.com.

[-- Attachment #1.2: Type: text/html, Size: 4496 bytes --]

             reply	other threads:[~2023-02-16  4:50 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-02-16  4:50 Ken Perry [this message]
     [not found] ` <39a03df1-6228-449f-8f20-1b5b92248cecn-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2023-02-16  6:38   ` Bastien DUMONT
2023-02-18  7:41     ` kperry-rhKcP+tiLR7by3iVrkZq2A

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=39a03df1-6228-449f-8f20-1b5b92248cecn@googlegroups.com \
    --to=kperry-rhkcp+tilr7by3ivrkzq2a@public.gmane.org \
    --cc=pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).