docx reader not capturing document's metadata

public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed

* docx reader not capturing document's metadata
@ 2017-04-13  9:57 dorice
       [not found] ` <e3f3103b-9da7-4c8f-b543-c46e8d64db75-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 3+ messages in thread
From: dorice @ 2017-04-13  9:57 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 975 bytes --]

Hello,

I have recently discover Pandoc, and I am trying to use it to parse and 
convert my docx documents.
In my document I have set some meta data (title, author, and custom 
properties)

Converting the document to pandoc's native yields a file with following 
meta block :
Pandoc (Meta {unMeta = fromList []})

Please can anyone tell me how to get pandoc capturing the document's 
metadata ?

Thanks

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/e3f3103b-9da7-4c8f-b543-c46e8d64db75%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 1429 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: docx reader not capturing document's metadata
       [not found] ` <e3f3103b-9da7-4c8f-b543-c46e8d64db75-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2017-04-15  9:48   ` John MacFarlane
       [not found]     ` <20170415094852.GB19885-jF64zX8BO080ojEdktfPof8+0UxHXcjY@public.gmane.org>
  0 siblings, 1 reply; 3+ messages in thread
From: John MacFarlane @ 2017-04-15  9:48 UTC (permalink / raw)
  To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw

See this issue:
https://github.com/jgm/pandoc/issues/3034

@jkr may be able to confirm this, but I think currently the
docx reader takes title, author, abstract, etc. from
paragraphs with Title, Author, Abstract, etc. styles, rather
than from document metadata.

+++ dorice [Apr 13 17 02:57 ]:
>   Hello,
>   I have recently discover Pandoc, and I am trying to use it to parse and
>   convert my docx documents.
>   In my document I have set some meta data (title, author, and custom
>   properties)
>   Converting the document to pandoc's native yields a file with following
>   meta block :
>   Pandoc (Meta {unMeta = fromList []})
>   Please can anyone tell me how to get pandoc capturing the document's
>   metadata ?
>   Thanks
>
>   --
>   You received this message because you are subscribed to the Google
>   Groups "pandoc-discuss" group.
>   To unsubscribe from this group and stop receiving emails from it, send
>   an email to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
>   To post to this group, send email to
>   [2]pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
>   To view this discussion on the web visit
>   [3]https://groups.google.com/d/msgid/pandoc-discuss/e3f3103b-9da7-4c8f-
>   b543-c46e8d64db75%40googlegroups.com.
>   For more options, visit [4]https://groups.google.com/d/optout.
>
>References
>
>   1. mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
>   2. mailto:pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
>   3. https://groups.google.com/d/msgid/pandoc-discuss/e3f3103b-9da7-4c8f-b543-c46e8d64db75-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org?utm_medium=email&utm_source=footer
>   4. https://groups.google.com/d/optout


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: docx reader not capturing document's metadata
       [not found]     ` <20170415094852.GB19885-jF64zX8BO080ojEdktfPof8+0UxHXcjY@public.gmane.org>
@ 2017-07-24 14:29       ` dorice
  0 siblings, 0 replies; 3+ messages in thread
From: dorice @ 2017-07-24 14:29 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 2601 bytes --]

Hello John,
Late reply.
Thanks a lot for the clarification
Dorice

On Saturday, April 15, 2017 at 11:49:08 AM UTC+2, John MacFarlane wrote:
>
> See this issue: 
> https://github.com/jgm/pandoc/issues/3034 
>
> @jkr may be able to confirm this, but I think currently the 
> docx reader takes title, author, abstract, etc. from 
> paragraphs with Title, Author, Abstract, etc. styles, rather 
> than from document metadata. 
>
> +++ dorice [Apr 13 17 02:57 ]: 
> >   Hello, 
> >   I have recently discover Pandoc, and I am trying to use it to parse 
> and 
> >   convert my docx documents. 
> >   In my document I have set some meta data (title, author, and custom 
> >   properties) 
> >   Converting the document to pandoc's native yields a file with 
> following 
> >   meta block : 
> >   Pandoc (Meta {unMeta = fromList []}) 
> >   Please can anyone tell me how to get pandoc capturing the document's 
> >   metadata ? 
> >   Thanks 
> > 
> >   -- 
> >   You received this message because you are subscribed to the Google 
> >   Groups "pandoc-discuss" group. 
> >   To unsubscribe from this group and stop receiving emails from it, send 
> >   an email to [1]pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org <javascript:>. 
> >   To post to this group, send email to 
> >   [2]pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org <javascript:>. 
> >   To view this discussion on the web visit 
> >   [3]
> https://groups.google.com/d/msgid/pandoc-discuss/e3f3103b-9da7-4c8f- 
> >   b543-c46e8d64db75%40googlegroups.com. 
> >   For more options, visit [4]https://groups.google.com/d/optout. 
> > 
> >References 
> > 
> >   1. mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org <javascript:> 
> >   2. mailto:pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org <javascript:> 
> >   3. 
> https://groups.google.com/d/msgid/pandoc-discuss/e3f3103b-9da7-4c8f-b543-c46e8d64db75-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org?utm_medium=email&utm_source=footer 
> >   4. https://groups.google.com/d/optout 
>
>

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/4d86da7a-bf10-423d-9a46-5f2479f334da%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 6474 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2017-07-24 14:29 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-04-13  9:57 docx reader not capturing document's metadata dorice
     [not found] ` <e3f3103b-9da7-4c8f-b543-c46e8d64db75-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2017-04-15  9:48   ` John MacFarlane
     [not found]     ` <20170415094852.GB19885-jF64zX8BO080ojEdktfPof8+0UxHXcjY@public.gmane.org>
2017-07-24 14:29       ` dorice

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).