From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/22088 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Jesse Rosenthal Newsgroups: gmane.text.pandoc Subject: Re: docx -> markdown & Citavi Content Control Date: Tue, 12 Feb 2019 12:36:47 -0500 Message-ID: <87o97gvrgw.fsf@jhu.edu> References: <87d9e5ab-3b83-46cf-a538-a6f2308454d1@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="176527"; mail-complaints-to="usenet@blaine.gmane.org" To: Nyoman Bennyamino , pandoc-discuss Original-X-From: pandoc-discuss+bncBDF7DMU574PBBMEIRTRQKGQEHM4D4VQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Feb 12 18:36:52 2019 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-vk1-f191.google.com ([209.85.221.191]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.89) (envelope-from ) id 1gtbzG-000jh4-33 for gtp-pandoc-discuss@m.gmane.org; Tue, 12 Feb 2019 18:36:50 +0100 Original-Received: by mail-vk1-f191.google.com with SMTP id v193sf1346841vkd.17 for ; Tue, 12 Feb 2019 09:36:50 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1549993009; cv=pass; d=google.com; s=arc-20160816; b=WVOraJY3b5Ye+u9wEB+1uf17iXq3onrnypyL/fJbUOTR6VquSWogbMQSoZNWxfkqcG 7orpDYjuQ6CJem8oO4Mug9oh5mL05poMSSQichDbFLDg68SWHCb4lFhbz8MrQqeDTzPw v+6ZX0PsSDErS49RkhdKXFtxIT1cXZwD3x3rOCUDuaDXv/JH3K67+1mnzxFnWYDVA8K6 xxR8gvo2CfgeAVpOgMXUX3x7sxUZepaCyRNwi2XE13LybK//QOlM9UHBck3K6RxYZ4SX ug9awCNKcuyfoSiLPRO7c0TvQWknZayCTTdlIxsQT2oCUZLNSrLvkS0DHJ1OlyDQccfW Xr0Q== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:mime-version:message-id :date:references:in-reply-to:subject:to:from:sender:dkim-signature; bh=piE5ktoXF20TRiilwA5NWvQ8y6TvFZ3NfBg1jTxpsU0=; b=Ka2scF3R5OkRAzNtp8+ezAZLg0JYX5O48HU+wmZ0kQ3oRaVeGDkWAgKAWLmd9lBpRq 6HVVIgI+Jd9lLwtEAUA+wIIfW+6l4bOrscYQgk8J8JC2RW2xRSS9D2qW47TYSLvDyx0s 4Zi6dIwaFPDXN4NVwafPvFCXxOKb59IfXllWJ3Kpo8kDmxF0bpu4bDy2e4cpo3yrBaGM dTMG0xZAQJ8LgnQkGpkcPfECNQqlYDwj1XwVImwDbXCPa8ZcXK3OjY3LjvXLhX23e3ue UxqCDw+0+D4q3IcBrMj3kDRrRKlVsI/jtAxE0ILvXxqk3JpVa1eeNx9enS94kRTL/XMq NjOg== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.169 as permitted sender) smtp.mailfrom="prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=jhu.edu DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:from:to:subject:in-reply-to:references:date:message-id :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=piE5ktoXF20TRiilwA5NWvQ8y6TvFZ3NfBg1jTxpsU0=; b=Wewff+WMSZl2Zhhqb4xNKXLMJjj7fR1S758DpK3W27MrthRkfphJc9d9QNPzu5DAP9 LxViddOgNWQ1d4AWrIWPVfOQErI+gnu+M/cq3mR5vryfTKhYuAEk2oSYBQBzCKRkhsta pcqUxZt0iokgbn596KZSGpIHFxpEpA1JoKmithx4X8dkc3lIDjxzGgP6wjpmx6FuBRHx tDOUr8xbcOgFlzf/K6v35dMKJVCRxNTj81mEchC4DBbHfnzaHi0fyAeegFFryELrKY8m x8QIDYavDTs4benB6kkhGUieI73+z2hzwJWVDc/f6Aj11gqtKWuutPj0rbow15hCJfhP b3Zw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:from:to:subject:in-reply-to:references :date:message-id:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=piE5ktoXF20TRiilwA5NWvQ8y6TvFZ3NfBg1jTxpsU0=; b=N+0DraCzRM2RJ5xqkhf9Sbwr2xcMzypMVhVEnKIhpEfUMpqtcxnJzCbPR7moPrbvTp nToQWcV6WDc1X2QGR/Y269i/kflo3IYoxsBdy78rsEMB8X+0yx7bv/TvYxTjkTcYbxIj UX90O1clVXsez3+3IM7MnnB5lK7xCgzHv8ux3ScI7fkYc5lDIpzhFu458nWeUOJGpFFO D0lQQdMbPYXYLFappRblqcvnm0OezyElUMMUbGP1mjpCkGTtJATom7MY9Z3Ev6pryYO2 Jxf7iB2pX9RLtFZU7k9/ZBrg9GcVOjjTj/+j38Wer+YQTVAQy/OIjcCLL6Uwjz1bwg21 NwyA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AHQUAubDgAwWubINqRaSPQhNTmGRShoME8EiAL2MunlQ12g1rjSaCXD2 G3Ru/BkClM/8I3Tc6M727x8= X-Google-Smtp-Source: AHgI3IYTp+vM7tTqVTtHiHmUBf+avWnVr1WiclANJ3yFQ4SnyNr0XN53DpnVWpUpHdVoU+82fglyJw== X-Received: by 2002:a1f:3651:: with SMTP id d78mr14913vka.2.1549993008948; Tue, 12 Feb 2019 09:36:48 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a1f:4205:: with SMTP id p5ls1578894vka.3.gmail; Tue, 12 Feb 2019 09:36:48 -0800 (PST) X-Received: by 2002:ac5:c203:: with SMTP id m3mr3115536vkk.19.1549993008248; Tue, 12 Feb 2019 09:36:48 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549993008; cv=none; d=google.com; s=arc-20160816; b=SajVLHKD5NmwdU2Pzz3kfeJQ0NV6CdSXV5ol8/PsjPJZPKynIpEpDgx5gG439807PI SoeElfIhoulMIRTE3MlWkkt1AX3S3f07FOGoqr75haG4Jst/uggyuhccC2RQheZoOz7H SHtf4nIrjMNcvnxcvauhyV2sLSv+QT2EFlKD+PhpBqhHelCB/ucAFDZWZsTUdxgProKC iTlphbO7ILVM4qx4gBHp28uvnCChWKrBx27Zy5TJFYYam25q47EPdtjJxxFb9wr9YFYI P7hd43+Ug1xQqwzO0Y41lncxwxjmgBrfkR+3PRtGJpUItLRnhI1n3vb669VUX1bvAXjZ naMQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:message-id:date:references:in-reply-to:subject:to:from; bh=a89QXmxmHoXI35O+NZ82kzIwFCZIIGNN+46pEoCCc6U=; b=IOamFvhG0pIvI+lqQB9HI0QlGUAlqh/GN90GpRvCsIEskAOP7SQ2gZOSeqPcKkCfIh JzKeet2EQeXOlGclI/od7vFvEccCKC8oRCJe2WcHCcsAkFVDvJK0sDApL72TMQWz5MPg Hin1KUbEMlOd8Mb15f6ZqFpJTY/D2ZJEfPd2UTfMXOXMdlqafCZ9Hb/nwmUOIpiDtDEX lJxsMEcTufvQLWyNTZnAcPjO8q2esF9TzE8TQ9uX7wEP6JpB8RFq5QqLQ4qN9l0fxUpy wy69oH5hmo771H1YukhW4mYhFytWyn1Pb66i3UXR3UmbM+Kvo0xyA5JuRjnwGl/jxLYS Q/Xw== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.169 as permitted sender) smtp.mailfrom="prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=jhu.edu Original-Received: from smtpauth.johnshopkins.edu (smtpauth.johnshopkins.edu. [128.220.39.169]) by gmr-mx.google.com with ESMTPS id 191si898329vkj.4.2019.02.12.09.36.48 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 12 Feb 2019 09:36:48 -0800 (PST) Received-SPF: pass (google.com: domain of prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.169 as permitted sender) client-ip=128.220.39.169; X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Original-Received: from pip.hwcampus.jhu.edu (HELO localhost) ([10.161.32.141]) by IronMTW6.johnshopkins.edu with ESMTP/TLS/AES256-GCM-SHA384; 12 Feb 2019 12:36:47 -0500 In-Reply-To: <87d9e5ab-3b83-46cf-a538-a6f2308454d1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: jrosenthal-4GNroTWusrE@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org designates 128.220.39.169 as permitted sender) smtp.mailfrom="prvs=939d9d2f1=jrosenthal-4GNroTWusrE@public.gmane.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=jhu.edu Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:22088 Archived-At: The instructions are in `instrtext` inside `fldchar`, which we do support (at least as far as parsing text). So this looks like a bug. It would be great if you could submit it to the github issue tracker. When you do sumbit a bug report, please make sure to note what version of pandoc you're using (support for this was only added last year), as well as a copy of your input docx. --Jesse Nyoman Bennyamino writes: > Hello, > > I'd like to use pandoc to convert a MS Word 365 (*.docx) file to markdown. > My reference management program (Citavi) is using Word Content Control > fields to inject references into footnotes. Unfortunately, pandoc is > omitting the all references instead of at least converting the references > into plain text. > > For example: > > One of my footnote is: "92. See Author, New York 2018, p. 18" > > The source code for this footnote (from Word's footnotes.xml file): > > w:rsidR="000B6667" w:rsidRDefault="000B6667" > w:rsidP="00F42B65"> w:rsidRPr="002C10AE"> w:val="Funotenzeichen"/> w:val="baseline"/> xml:space="preserve"> > w:val="CitaviPlaceholder#ecdec0e6-4e94-491b-a6d1-74308e00ccd9"/> w:val="827022648"/> w:val="B8F488EF9BC34FE5B5D83FE3A9CB28BD"/> w:fldCharType="begin"/>ADDIN > CitaviPlaceholder{ey....AifQ==} w:fldCharType="separate"/> xml:space="preserve">*See* w:rsidRPr="00BB051A">Author w:rsidR="00BB051A" w:rsidRPr="00BB051A">*, **New York 2018* w:rsidR="00BB051A" w:rsidRPr="00BB051A"> w:val="superscript"/>81 w:rsidRPr="00BB051A">*, p. 18.* w:fldCharType="end"/> > > Output after converting: > > pandoc -f docx "test.docx" -w markdown_strict --reference-location "block" > > "92. " > > > Any ideas how to fix this? > > > Thanks, > > Nyoman > > -- > You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/87d9e5ab-3b83-46cf-a538-a6f2308454d1%40googlegroups.com. > For more options, visit https://groups.google.com/d/optout.