From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/17947 Path: news.gmane.org!.POSTED!not-for-mail From: Melroch Newsgroups: gmane.text.pandoc Subject: Re: Pandoc selectively transfers glyphs from LuaLaTeX to DOCX Date: Mon, 24 Jul 2017 13:34:05 +0200 Message-ID: References: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="001a1147ccd6aaefc205550e97de" X-Trace: blaine.gmane.org 1500896052 9730 195.159.176.226 (24 Jul 2017 11:34:12 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Mon, 24 Jul 2017 11:34:12 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCWMVYEK54FRBLVW27FQKGQEDYQIHFI-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Jul 24 13:34:07 2017 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-ua0-f186.google.com ([209.85.217.186]) by blaine.gmane.org with esmtp (Exim 4.84_2) (envelope-from ) id 1dZbcf-000201-NB for gtp-pandoc-discuss@m.gmane.org; Mon, 24 Jul 2017 13:34:02 +0200 Original-Received: by mail-ua0-f186.google.com with SMTP id 80sf5777504uas.0 for ; Mon, 24 Jul 2017 04:34:07 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1500896047; cv=pass; d=google.com; s=arc-20160816; b=hWWGXc19+QfPVG5hpmj9WQ25W404ah1TqvLKBGNpSvgRJCuV3724LXNxOwtt7v53ct 2eMdgpAX+abEqRgXdmYBZcL3iRaNpo5PNjotjLtNZc6715TnyOIrmWjCBevAoLF5BdiQ T84dy4igzIm2cxYeLx7X/fPmwBGig4KKveyRP/tw8i7z2Jihu/IJcvSCDQ85Qdw5sJLD 5zjt8ACBPcwxtlVZQhVt6DwYjKJAzIj2BObjtBjGI/9HVdZDWgJfD2LoUKN8911yP5Lt HuD7igo7wn/SmCQQDjJoDZ3LZqkl/k1bDGEbPF8Pv62Fj1C+Ivx0VHbuHOco9ulNw8AU XV/g== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:references:in-reply-to:mime-version:arc-authentication-results :arc-message-signature:sender:dkim-signature:dkim-signature :arc-authentication-results; bh=mo5TdyWJo+2nTSa9oakRg12+LNSMqhZlxKnxMpRtA/8=; b=b48WzvYSE/LKa6jMHk2rx+tvkt8OFQmgADANHHnrPr+khS1FIYBkAawFWFpSxajw5f MBbvfuyj9K0hGaLyV8d3YRAfgzMAvYgskdOD4he7nBZLzn8TnGQGctmWmVFGAH4HEkCX xom6x6AAQrOe/EmXisry56xGq1XgZRfCOW6DKYOGxEpN/7OxIRjZG82Dhp1yNNTgLalV 2hKNPGKRiJtseHhj6gVEGHPC3XJmhmD3Hnzd9mn/gO3y7EXVV5Mq97jFwGiF4e7G6QwI zdEK4nvrbM40O5dvi4YyuGnLMfU7w0WxiiMiTeDn+jpIm/1m6y7aVgiw7MDO/Zfwic65 96tQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=Uyv6bFx9; spf=pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4002:c05::229 as permitted sender) smtp.mailfrom=melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:in-reply-to:references:from:date:message-id :subject:to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=mo5TdyWJo+2nTSa9oakRg12+LNSMqhZlxKnxMpRtA/8=; b=ACnp6en076BwYu/cngIKVqIY1VewiWjW6Ue3jAmE03bDm+I9KOmfmcZr0sx43tKe1r mgP3hEtlaV5w2TntAVjKUPenKlmnhOjPc8x0dtbim7cDN627/pc9t+UPXtUF1I/GVOJX jsk0bsawYFE2AZca1v2tT4zCofPuM5ytUcWO73m5/+AxPHgB7GtSYZB5wVZCQDX2R9fr GyBbLeXhsh/wXNUAncHrhhWSOm8P0WojH0+1Q/R97As5FP5p1hAhpLsVLOmNHkFxfQmb Zvdbi86E0CNmc1h+IJVBdsLmxFHlCzfFGXzPsxyIHc55b72ClAZXWZ1jFujZGRVHui9n lsIg== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=mo5TdyWJo+2nTSa9oakRg12+LNSMqhZlxKnxMpRtA/8=; b=V7YD5nvMoBXv4PRPqbJYGfU7jmQaSiThju16lgcxYdAjMbMbQVGPbkGXM5rz84VN00 2abATTRG70Rkn+mR5JXZaO4UusDXEx7XgvgYxRuhsCKKy6/rELTR/hwZVxlYn7+ijPl7 yQnzQYP6W2hsDSqGNZfm7vBrRl7OTfCHm5rBd0YXxRmEu25wtKkpSjr5cMh14pntF+t5 ubtbSlRveiuKAiKzg09VC/qEmbQPW3msNFVEejDnbDM7kUottr/3CdMVqOfMnhUK9ywZ fHUMEmonq+usFaa+Fy8ISUM9CB4EpU7rhh7Sq01HaeCzQz5wuOMCl2ZemYoqCjTLmp1O +UaA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:mime-version:in-reply-to:references:from :date:message-id:subject:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=mo5TdyWJo+2nTSa9oakRg12+LNSMqhZlxKnxMpRtA/8=; b=FdufWIOmDlhgnjvyiUXoCzvx2Y/i6epJekafi+alK9zSrDqvKkPCPTvq9N8SOtj4fT WVslmRW4xtoT55y5wsCEAkHKIBeYrQbHNW4JAbnlB+Z4JQiNn9BkDY2MTfMtjWARIp6B 9BfTe9M17OXY0eQ7ySg7OoIKamMz60v9Uo73tBEnrCPAVM5JneWTQxlaxafPMHb1e5or Y2/R+rRuWsfvL8e11NdWJJhHMzPdsJJES1WLDxlwW5Prn0SDMpN0kb7Qy/7pNUBFptIf notVIvg7X+6HhgdiYSHBkiZsS6SpnPFjxDQNsrOlDvCA1qxLNUlPf7dEeAAj9PSqfGBc y+gw== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AIVw110eTIA9wBCiiL6At/nqlfYuURuWCVFxnaR4hI3DB29ncsvZpdFG oIxcA1h9n531gg== X-Received: by 10.36.89.7 with SMTP id p7mr253398itb.4.1500896047269; Mon, 24 Jul 2017 04:34:07 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.36.88.208 with SMTP id f199ls3360958itb.18.canary-gmail; Mon, 24 Jul 2017 04:34:06 -0700 (PDT) X-Received: by 10.237.59.122 with SMTP id q55mr11077128qte.4.1500896046456; Mon, 24 Jul 2017 04:34:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1500896046; cv=none; d=google.com; s=arc-20160816; b=GuU5t/iEqcMdHMi9PhSHWOZdAwgRSlSsRncsHwXYkPwmqk6pAwTDq7aMB7BmFAvJOj 4lOwa3l86eGd4EHJw/DD7N3IIsEvZz++F4dKhzvL0Dw5R/Gxv8P6aXdf2ooySM2oXqZ2 QnMA7ls0z45MI9CbOIwcenvsIlF5IvvgRlB3V1sWIfWYNztiT45CaseG+mAmnkyh0cHu tCjO2SD+yHjsybxRTkEhrKG6nXLex4uEccb2KtGDcVUWhYHFF1p+fuJiPI+erv7z05iU wXMi01z864aEd3tXK9PoEntJw6+GZrGdU6RiiKc+MW5u0VMkToMyowOBzWWBD+3I7prO xECA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:references:in-reply-to:mime-version :dkim-signature:arc-authentication-results; bh=Zre+Q9ukYjrFf5XgM5H6eCaXavEhWP2AA6d0Rjq3iUA=; b=Gtxc/zpaSWN6fx//sCaya+ixoRDr0d9evh786v/iTiAJRFloogiXys3hWYhDNy/Lej aHfiO5EYKfbg4bXYgH4met2Ajo+GrDL07Pi7WU0wbAYAJOpFfMm8w8yC87I9FrWnInkZ AANwaaec9e78ixEb58zJPXnhb6vsCVDG2plNQRiCGUM0QjCRFMoiCc5D68sqix5/33k/ K1H2chHsT9vd/BmT4/GxEha114A2vALVdGW1fcVsJPDWXoI4yQ9+FdMKEMk1CiViK0Xl 54KtTWRuXZXZiC8jMc/GGV9BYoSoqj0MyvJgh90PJ8Fu+N4UysFuPWnLyIWtWMGpWUsP ZEww== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=Uyv6bFx9; spf=pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4002:c05::229 as permitted sender) smtp.mailfrom=melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Original-Received: from mail-yw0-x229.google.com (mail-yw0-x229.google.com. [2607:f8b0:4002:c05::229]) by gmr-mx.google.com with ESMTPS id e186si273783ywh.4.2017.07.24.04.34.06 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 24 Jul 2017 04:34:06 -0700 (PDT) Received-SPF: pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4002:c05::229 as permitted sender) client-ip=2607:f8b0:4002:c05::229; Original-Received: by mail-yw0-x229.google.com with SMTP id h189so21286477ywf.2 for ; Mon, 24 Jul 2017 04:34:06 -0700 (PDT) X-Received: by 10.37.52.151 with SMTP id b145mr11349229yba.261.1500896046017; Mon, 24 Jul 2017 04:34:06 -0700 (PDT) Original-Received: by 10.37.101.7 with HTTP; Mon, 24 Jul 2017 04:34:05 -0700 (PDT) Original-Received: by 10.37.101.7 with HTTP; Mon, 24 Jul 2017 04:34:05 -0700 (PDT) In-Reply-To: X-Original-Sender: melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=Uyv6bFx9; spf=pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4002:c05::229 as permitted sender) smtp.mailfrom=melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:17947 Archived-At: --001a1147ccd6aaefc205550e97de Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Word was quite good at stacking diacritics already about a decade ago when I last looked, so if you just succeed in writing a newcommand which unpacks the LaTeX diacritics into the proper Unicode diacritics you should be good. It should be possible at least to combine some newcommands with a filter which translates the accents. Feel free to contact me offlist and I'll try to work something out. /bpj Den 22 jul 2017 18:38 skrev "Sean Winslow" : > I am trying to convert a dissertation from LaTex to Word, in order to > comply with publisher requirements. Part of why I used LaTeX is my need f= or > complicated diacritics in transcriptions, which XeLaTeX/LuaLaTeX and the > dblaccent package made easy. Now, when I use pandoc to output to docx, > certain glyphs are missing. See, for example, \b{q} in Maqala and \v{\d{C= }} > in Chelaqwot: > > > LuaLaTeX (or XeLaTeX) produces this: > > > > > But this is what I see in Word: > > > > > > Here is my MWE: > > %!TEX TS-program =3D lualatex > %!TEX encoding =3D UTF-8 Unicode > > \documentclass[a4]{memoir} > > %packages > \usepackage{fontspec} > \usepackage{dblaccnt} > > \usepackage{savesym} > \savesymbol{U} > \savesymbol{T} > \usepackage{semtrans} > > %newcommands > \newcommand{\schwa}{=C7=9D} > \newcommand{\mekele}{M\"{a}\b{q}\"{a}l\"{a}} > \newcommand{\chelekot}{\d{\v{C}}el\=3D{a}qwot S\schwa{}lasse} > > \defaultfontfeatures{Mapping=3Dtex-text} > \setromanfont[Mapping=3Dtex-text]{Brill} > > \begin{document} > > The two research locations visited were \mekele{} and \chelekot{}.\par > > \end{document} > > and the pandoc command I am using to convert it: > > pandoc test.tex \ > > --from=3Dlatex \ > > --to=3Ddocx \ > > --output=3Dtest.docx \ > > --latex-engine=3Dlualatex \ > > --reference-docx=3Dtest_ref.docx \ > > -S \ > > -R > > The reference-docx is just the output, but changed to use Brill as the > font. > > Is there any way to have pandoc pass along the special diacritics I need? > Re-doing all of them by hand will be a nightmare, and is a lot of the > reason I am learning pandoc. > > -- > You received this message because you are subscribed to the Google Groups > "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/ > msgid/pandoc-discuss/b4abf81b-74e7-490a-8cb9-f6a313c651e0% > 40googlegroups.com > > . > For more options, visit https://groups.google.com/d/optout. > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/CADAJKhDmsP1O67vC%3DxV5bUmpcJ%2BXh-AaV5rUN7JJ%2BMFE_d3Osg%40= mail.gmail.com. For more options, visit https://groups.google.com/d/optout. --001a1147ccd6aaefc205550e97de Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Word was quite good at stacking diacritics already about = a decade ago when I last looked, so if you just succeed in writing a newcom= mand which unpacks the LaTeX diacritics into the proper Unicode diacritics = you should be good. It should be possible at least to combine some newcomma= nds with a filter which translates the accents. Feel free to contact me off= list and I'll try to work something out.

/bpj

Den 22 jul 2017 18:38 skrev "Sean Winslow" <mrspot-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>:

I am trying to= convert a dissertation from LaTex to Word, in order to comply with publish= er requirements. Part of why I used LaTeX is my need for complicated diacri= tics in transcriptions, which XeLaTeX/LuaLaTeX and the dblaccent package ma= de easy. Now, when I use pandoc to output to docx, certain glyphs are missi= ng. See, for example, \b{q} in Maqala and \v{\d{C}} in Chelaqwot:

=

LuaLaTeX (or XeLaTeX) produces this:

But this is what I see in Word:


Here is my MWE:=C2=A0

=
%!TEX TS-p= rogram =3D lualatex
%!TEX encoding =3D UTF-8 Unicode

\documentclass[a4]{memoir}

%pack= ages
\usepackage{fo= ntspec}
\usepackage= {dblaccnt}

\usepackage{savesym}
\savesymbol{U}
=
\savesymbol{T}
\usepackage{semtrans}
<= div class=3D"m_7659280582344345582subprettyprint">
%newcommands
\newcommand{\schwa}{=C7=9D}
\newcommand{\mekele}{M\"{a}\b= {q}\"{a}l\"{a}}
\newcommand{\chelekot}{\d{\v{C}}el\=3D{a}qwot S\schwa{}las= se}

\defaultfontfeatures{Mapping= =3Dtex-text}
\= setromanfont[Mapping=3Dtex-text]{Brill}

\begin{document}

The = two research locations visited were \mekele{} and \chelekot{}.\par

\end{document}
<= /div>
and the pandoc command I am using to convert it:

pandoc test.tex \

=C2=A0 =C2=A0 --from=3Dlatex \

=C2=A0 =C2=A0 --to=3Ddocx \

=C2=A0 =C2=A0 --output=3Dtest.docx \

=C2=A0 =C2=A0 --latex-engine=3Dlualatex \

=C2=A0 =C2=A0 --reference-docx=3Dtest_ref.docx = \

=C2=A0 =C2=A0 -S \

=C2=A0 =C2=A0 -R


The = reference-docx is just the output, but changed to use Brill as the font.

Is there any way to have pandoc pass along the speci= al diacritics I need? Re-doing all of them by hand will be a nightmare, and= is a lot of the reason I am learning pandoc.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe@googlegroups.com.
To post to this group, send email to pandoc-discuss@googlegroups.com. To view this discussion on the web visit https:= //groups.google.com/d/msgid/pandoc-discuss/b4abf81b-74e7-490a-8cb= 9-f6a313c651e0%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://g= roups.google.com/d/msgid/pandoc-discuss/CADAJKhDmsP1O67vC%3DxV5bUmpcJ%2BXh-= AaV5rUN7JJ%2BMFE_d3Osg%40mail.gmail.com.
For more options, visit http= s://groups.google.com/d/optout.
--001a1147ccd6aaefc205550e97de--