From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/13658 Path: news.gmane.org!not-for-mail From: "'Jason Seeley' via pandoc-discuss" Newsgroups: gmane.text.pandoc Subject: Re: ligatures in html Date: Mon, 21 Sep 2015 07:50:09 -0700 (PDT) Message-ID: References: <7d633ff1-c25d-436c-a66f-9a8456699db6@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_2584_1070518452.1442847009732" X-Trace: ger.gmane.org 1442847021 15889 80.91.229.3 (21 Sep 2015 14:50:21 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 21 Sep 2015 14:50:21 +0000 (UTC) To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDTMJC43Q4LRBIVSQCYAKGQEEAT6OJQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Sep 21 16:50:12 2015 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-qg0-f61.google.com ([209.85.192.61]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Ze2Pz-000796-MS for gtp-pandoc-discuss@m.gmane.org; Mon, 21 Sep 2015 16:50:11 +0200 Original-Received: by qgt47 with SMTP id 47sf18208501qgt.1 for ; Mon, 21 Sep 2015 07:50:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :content-type:x-original-sender:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=ZW3NsBQzD3DgdFU2M4on4hdnsVPK+51auzPAZ9MWvBk=; b=Tan1J1JP0WxFjbbCj9Tj0rUt5QNrNspwLf1kebXStCfe9PPA/ZEeSI5oFpRLbiQlhu n86HNnTxaeNfDAZjchiFuENgNjx9yiDMaTn1Du7eE37HicPKnX9IP79JjmpURSPxX9mp S9xSODniFaeg+VBU3pRbY+/f9DM5+IFa3WWzb18X7M6fb+IkmsLf5bWIodCDNaKjKexD 7wUKSOtqZS9qronPc2dKSHohNlDFVKkh93UE9o7WbO4ievCFshBDoWyeY16UQUGt8Pxz eMofAGnYgaL9E+n/tul7+8pcl8gXBwL2JuueAHuG8RjE1kG2N2oPTst5NbtzbmtTqo+a dPbg== X-Received: by 10.140.31.35 with SMTP id e32mr39840qge.16.1442847011008; Mon, 21 Sep 2015 07:50:11 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.140.22.19 with SMTP id 19ls862878qgm.67.gmail; Mon, 21 Sep 2015 07:50:10 -0700 (PDT) X-Received: by 10.140.40.242 with SMTP id x105mr128576qgx.42.1442847010218; Mon, 21 Sep 2015 07:50:10 -0700 (PDT) In-Reply-To: <7d633ff1-c25d-436c-a66f-9a8456699db6-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: jamiseeley-/E1597aS9LQAvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Spam-Checked-In-Group: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Original-From: Jason Seeley Xref: news.gmane.org gmane.text.pandoc:13658 Archived-At: ------=_Part_2584_1070518452.1442847009732 Content-Type: multipart/alternative; boundary="----=_Part_2585_917571042.1442847009733" ------=_Part_2585_917571042.1442847009733 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hello, Ligatures like \ae are specific to the LaTeX (and thus PDF) writer, so they= =20 don't work in any other formats. Pandoc just passes it through unchanged.= =20 For HTML output, you can use an entity: `=C3=86` or `=C3=A6`, for upper=20 case or lower case. Another option is to use the unicode character directly= =20 (how you do this depends on your system and text editor; in Windows hold=20 Alt and type 0230 on the number pad; in vim type CTRL-K a e; use a=20 character-map app, etc.) This should work for most output formats. It'll=20 work with LaTeX if you use XeLaTeX or LuaLaTeX, as those allow unicode=20 input. Jason On Monday, September 21, 2015 at 5:57:37 AM UTC-5, Chris Wright wrote: > > I want to publish a document with an \ae ligature to html and to pdf. The= =20 > latex form "\ae robic" converts to the appropriate form and displays=20 > properly in pdf, but the html just drops the ligature. > > > Simple test case: > > > chriswri$ cat > test.txt > > \ae robic > > chriswri$ more test.txt > > \ae robic > > chriswri$ pandoc -t native test.txt > > [Para [RawInline (Format "tex") "\\ae ",Str "robic"]] > > chriswri$ pandoc -t html test.txt > >

robic

> > > What's the best way around this - write a filter? finding some docs that= =20 > will help? (I've found that ... is automatically converted to an ellipsis= =20 > - so \dots isn't necessary). > > > with thanks > > > Chris > > > > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/bbaae9b2-c139-415f-9063-86a887358b4c%40googlegroups.com. For more options, visit https://groups.google.com/d/optout. ------=_Part_2585_917571042.1442847009733 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hello,

Ligatures like \ae are specific = to the LaTeX (and thus PDF) writer, so they don't work in any other for= mats. Pandoc just passes it through unchanged. For HTML output, you can use= an entity: `&AElig;` or `&aelig;`, for upper case or lower case. A= nother option is to use the unicode character directly (how you do this dep= ends on your system and text editor; in Windows hold Alt and type 0230 on t= he number pad; in vim type CTRL-K a e; use a character-map app, etc.) This = should work for most output formats. It'll work with LaTeX if you use X= eLaTeX or LuaLaTeX, as those allow unicode input.

= Jason

On Monday, September 21, 2015 at 5:57:37 AM UTC-5, Chris Wrigh= t wrote:

I want to publish a document with an \ae ligature to html and to pdf. Th= e latex form "\ae robic" converts to the appropriate form and dis= plays properly in pdf, but the html just drops the ligature.


=

Simple test case:


chriswri$ cat > test.txt

\ae robic

chriswri$ more test.txt

\ae robic

chriswri$ pandoc -t native test.txt

[Para [RawInline (Format "tex") "\\ae ",Str &q= uot;robic"]]

chriswri$ pandoc -t html test.txt

<p>robic</p>


What's the best way around this - write a filter? f= inding some docs that will help? (I've found that ... is automatically = converted to an ellipsis =C2=A0- so \dots isn't necessary).

<= p>

with thanks


<= /p>

Chris



--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/bbaae9b2-c139-415f-9063-86a887358b4c%40googlegroups.co= m.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_2585_917571042.1442847009733-- ------=_Part_2584_1070518452.1442847009732--