From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/22089 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Nyoman Bennyamino Newsgroups: gmane.text.pandoc Subject: Re: docx -> markdown & Citavi Content Control Date: Tue, 12 Feb 2019 10:47:22 -0800 (PST) Message-ID: <36b188d6-6558-4458-832c-c6ba8ec6c584@googlegroups.com> References: <87d9e5ab-3b83-46cf-a538-a6f2308454d1@googlegroups.com> <87o97gvrgw.fsf@jhu.edu> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_1906_2036633246.1549997242332" Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="227716"; mail-complaints-to="usenet@blaine.gmane.org" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCY2337C5EMBBO5JRTRQKGQEWPJTJUY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Feb 12 19:47:26 2019 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-ot1-f62.google.com ([209.85.210.62]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.89) (envelope-from ) id 1gtd5Z-000x65-PD for gtp-pandoc-discuss@m.gmane.org; Tue, 12 Feb 2019 19:47:26 +0100 Original-Received: by mail-ot1-f62.google.com with SMTP id o13sf3444901otl.20 for ; Tue, 12 Feb 2019 10:47:25 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=VcNhERCIAz8xMIYKtYdCQFPvMJoOtddhBa6kdYt2CfY=; b=YWtxITmix0ivPGSJ2jTchrkke3ZfwNWDxOGjAXAwDvPQgUegn5BTm+eAhWiVet8N3I MXT3OwkaSUiqlYoO0auZLXRi65wAVDbUk3TiARst8Tg++1Lpc6T48Xw1jh3DZWAzDYmw 2vKiT1HmyRJJrNR9UlcmzLNaiM4Hzo4pkxmWvsYI8GrulskDVy8EeOlgj/V7YovjyATL 2d8xtLylKDoMySzDfFQVQ34wAT0k+IuN4AqoJxnXxD1S8uaUYRom9KWtFKTGz/Rm7ew+ nMhib6jiZRhF+aziGGhsLymWrttBb3Rahg2G8yezV/hYnT+jDieRw7SGgcF5ajEqySS5 ppmw== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=VcNhERCIAz8xMIYKtYdCQFPvMJoOtddhBa6kdYt2CfY=; b=aCl/ruwwjmDnAeWzlnv20uO3WSfzdecF5zYoxFvkiMKaowfgFkJN6xBF1KuC8231vz Po1kLSPJWVuANtB51SSq2tRX6nEU9ond0RqSWbZmL4u4wOKfHaPnyVaizzID0IVE9umI V6Vjbws3lvbQQJhUraBp2XuN3dHC7AcqdThc1s1IYKP3FYvYcKmx3IH/pn6j/M42gpt+ Es5r21J40VSx5E9ff2kwGX8Hn8FM4dEISgDfcBZYo3Yk9ImLHjpLU7NaevIDpKG+b6Tb rC1iN/CwJ367nfJHBBz1xuKp1KTJx45p9q0dl7q2yr+LhFvMoHbwJ1LQDUiyl1Ck0jOU E5Zw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=VcNhERCIAz8xMIYKtYdCQFPvMJoOtddhBa6kdYt2CfY=; b=DEr6W2v/2DIRSdwGgNqVHEeQ7Vr6jbxQPdAmZ7mKDoVmszTLGfmbswf8DsKAdFlNWE H4dCM2sYuCOREz3nudyz7Wifv1yOw/xa+8FxFjLmIe36Wsvi2zHsJYSVKZTYnA/1vtHp Tr12bgn+kl8C9kpxaCXgt/GLZLqwOFn1MleeK73lmDZ8x6kRxaaZh5TzqDKcyCq1KdjZ 4rHFI3C2OUl4XqzB/wWmRdfus9TpivLDAayahZk11lBchCaRRTL+olgGBgIYvrOFRg2i X+vqias3LRKkP+AAcwLqnUx7In1nFWDW7B4yQIe1n6Bi9LB7IgASZnqoG6sc2Qt9bp1M HIUA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AHQUAuYFQVwYGGVoCqQk/J6bcnLAPj7MO2spMak5ZxB7UffYUhkdaDpl Z0q3pDL3m7QZs3NTkx1god8= X-Google-Smtp-Source: AHgI3IbaeC5UWFRADYWvWGZaL0iFCX4NeusFQ/nHkXO5bk5uv0fhuunm0Ki5gkDwhgC8FUMYh50M+g== X-Received: by 2002:a9d:2c22:: with SMTP id f31mr128458otb.4.1549997244577; Tue, 12 Feb 2019 10:47:24 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a9d:5619:: with SMTP id e25ls11230640oti.9.gmail; Tue, 12 Feb 2019 10:47:23 -0800 (PST) X-Received: by 2002:a9d:19a4:: with SMTP id k33mr129329otk.5.1549997243199; Tue, 12 Feb 2019 10:47:23 -0800 (PST) In-Reply-To: <87o97gvrgw.fsf-4GNroTWusrE@public.gmane.org> X-Original-Sender: nyoman.bennyamino-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:22089 Archived-At: ------=_Part_1906_2036633246.1549997242332 Content-Type: multipart/alternative; boundary="----=_Part_1907_2041921516.1549997242333" ------=_Part_1907_2041921516.1549997242333 Content-Type: text/plain; charset="UTF-8" Thank you all very much! I've submitted a bug report on GitHub. Cheers! Am Dienstag, 12. Februar 2019 18:36:50 UTC+1 schrieb Jesse Rosenthal: > > The instructions are in `instrtext` inside `fldchar`, which we do > support (at least as far as parsing text). So this looks like a bug. It > would be great if you could submit it to the github issue tracker. > > When you do sumbit a bug report, please make sure to note what version of > pandoc you're using (support for this was only added last year), as well > as a copy of your input docx. > > --Jesse > > Nyoman Bennyamino > writes: > > > Hello, > > > > I'd like to use pandoc to convert a MS Word 365 (*.docx) file to > markdown. > > My reference management program (Citavi) is using Word Content Control > > fields to inject references into footnotes. Unfortunately, pandoc is > > omitting the all references instead of at least converting the > references > > into plain text. > > > > For example: > > > > One of my footnote is: "92. See Author, New York 2018, p. 18" > > > > The source code for this footnote (from Word's footnotes.xml file): > > > > > w:rsidR="000B6667" w:rsidRDefault="000B6667" > > w:rsidP="00F42B65"> > w:rsidRPr="002C10AE"> > w:val="Funotenzeichen"/> > w:val="baseline"/> > xml:space="preserve"> > > > w:val="CitaviPlaceholder#ecdec0e6-4e94-491b-a6d1-74308e00ccd9"/> > w:val="827022648"/> > > w:val="B8F488EF9BC34FE5B5D83FE3A9CB28BD"/> > > w:fldCharType="begin"/>ADDIN > > CitaviPlaceholder{ey....AifQ==} > w:fldCharType="separate"/> > xml:space="preserve">*See* > > w:rsidRPr="00BB051A">Author > > w:rsidR="00BB051A" w:rsidRPr="00BB051A">*, **New York > 2018* > w:rsidR="00BB051A" w:rsidRPr="00BB051A"> > w:val="superscript"/>81 > w:rsidRPr="00BB051A">*, p. 18.* > w:fldCharType="end"/> > > > > Output after converting: > > > > pandoc -f docx "test.docx" -w markdown_strict --reference-location > "block" > > > > "92. " > > > > > > Any ideas how to fix this? > > > > > > Thanks, > > > > Nyoman > > > > -- > > You received this message because you are subscribed to the Google > Groups "pandoc-discuss" group. > > To unsubscribe from this group and stop receiving emails from it, send > an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org . > > To post to this group, send email to pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org > . > > To view this discussion on the web visit > https://groups.google.com/d/msgid/pandoc-discuss/87d9e5ab-3b83-46cf-a538-a6f2308454d1%40googlegroups.com. > > > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/36b188d6-6558-4458-832c-c6ba8ec6c584%40googlegroups.com. For more options, visit https://groups.google.com/d/optout. ------=_Part_1907_2041921516.1549997242333 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Thank you all very much!

I've submi= tted a bug report on GitHub.

Cheers!

Am Dienstag, 12. Februa= r 2019 18:36:50 UTC+1 schrieb Jesse Rosenthal:
The instructions are in `instrtext` inside `fldchar`, which= we do
support (at least as far as parsing text). So this looks like a bug. It
would be great if you could submit it to the github issue tracker.

When you do sumbit a bug report, please make sure to note what version = of
pandoc you're using (support for this was only added last year), as= well
as a copy of your input docx.

--Jesse

Nyoman Bennyamino <
nyoman.b...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:

> Hello,
>
> I'd like to use pandoc to convert a MS Word 365 (*.docx) file = to markdown.=20
> My reference management program (Citavi) is using Word Content Con= trol=20
> fields to inject references into footnotes. Unfortunately, pandoc = is=20
> omitting the all references instead of at least converting the ref= erences=20
> into plain text.
>
> For example:
>
> One of my footnote is: "92. See Author, New York 2018, p. 18&= quot;
>
> The source code for this footnote (from Word's footnotes.xml f= ile):
>
> <w:footnote w:id=3D"92"><w:p w14:paraId=3D"= ;315F39A7" w14:textId=3D"2167C15F"=20
> w:rsidR=3D"000B6667" w:rsidRDefault=3D"000B6667&quo= t;=20
> w:rsidP=3D"00F42B65"><w:pPr><w:pStyle w= :val=3D"Funoten"/></w:pPr><w:r=20
> w:rsidRPr=3D"002C10AE"><w:rPr><w:rStyle= =20
> w:val=3D"Funotenzeichen"/><w:b/><w:vert= Align=20
> w:val=3D"baseline"/></w:rPr><w:footnote= Ref/></w:r><w:r><w:t=20
> xml:space=3D"preserve">=20
> </w:t></w:r><w:r><w:tab/></w:r>= <w:sdt><w:sdtPr><w:alias w:val=3D"Don't edit=20
> this field"/><w:tag=20
> w:val=3D"CitaviPlaceholder#ecdec0e6-4e94-491b-a6d1-= 74308e00ccd9"/><w:id=20
> w:val=3D"827022648"/><w:placeholder><w:= docPart=20
> w:val=3D"B8F488EF9BC34FE5B5D83FE3A9CB28BD"/>= ;</w:placeholder></w:sdtPr><w:sdtContent><w:r>= <w:fldChar=20
> w:fldCharType=3D"begin"/></w:r><w:r>= <w:instrText>ADDIN=20
> CitaviPlaceholder{ey....AifQ=3D=3D}</w:instrText></w= :r><w:r><w:fldChar=20
> w:fldCharType=3D"separate"/></w:r><w:r = w:rsidR=3D"00BB051A"><w:t=20
> xml:space=3D"preserve">*See* </w:t></w:r>= <w:r w:rsidR=3D"00BB051A"=20
> w:rsidRPr=3D"00BB051A"><w:rPr><w:smallC= aps/></w:rPr><w:t>Author</w:t></w:r><w:r= =20
> w:rsidR=3D"00BB051A" w:rsidRPr=3D"00BB051A">= ;<w:t>*, **New York 2018*</w:t></w:r><w:r=20
> w:rsidR=3D"00BB051A" w:rsidRPr=3D"00BB051A">= ;<w:rPr><w:vertAlign=20
> w:val=3D"superscript"/></w:rPr><w:t>= 81</w:t></w:r><w:r w:rsidR=3D"00BB051A"=20
> w:rsidRPr=3D"00BB051A"><w:t>*, p. 18.*</w:t&= gt;</w:r><w:r><w:fldChar=20
> w:fldCharType=3D"end"/></w:r></w:sdtCon= tent></w:sdt></w:p></w:footnote>
>
> Output after converting:
>
> pandoc -f docx "test.docx" -w markdown_strict --referenc= e-location "block"
>
> "92. "
>
>
> Any ideas how to fix this?
>
>
> Thanks,
>
> Nyoman
>
> --=20
> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, = send an email to pandoc-discus...@googlegroups.com.
> To post to this group, send email to pandoc-...@googlegroups.com.
> To view this discussion on the web visit
https://groups.= google.com/d/msgid/pandoc-discuss/87d9e5ab-3b83-46cf-a538-a6f2308= 454d1%40googlegroups.com.
> For more options, visit https://groups.go= ogle.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/36b188d6-6558-4458-832c-c6ba8ec6c584%40googlegroups.co= m.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_1907_2041921516.1549997242333-- ------=_Part_1906_2036633246.1549997242332--