From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/23435 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: nopria Newsgroups: gmane.text.pandoc Subject: U+200B and LaTeX Date: Wed, 18 Sep 2019 03:05:49 -0700 (PDT) Message-ID: <45688658-4762-4910-b8d1-a28a23efd91c@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_71_1400588643.1568801149327" Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="141597"; mail-complaints-to="usenet@blaine.gmane.org" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDRYH4WSQIOBB7UCRDWAKGQESJKXZCA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Sep 18 12:05:53 2019 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-ot1-f58.google.com ([209.85.210.58]) by blaine.gmane.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.89) (envelope-from ) id 1iAWqO-000afU-Aw for gtp-pandoc-discuss@m.gmane.org; Wed, 18 Sep 2019 12:05:52 +0200 Original-Received: by mail-ot1-f58.google.com with SMTP id 9sf3322026otc.21 for ; Wed, 18 Sep 2019 03:05:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=n4xYofH/G4YyJRpRsKsJwxcYoZ94PlrBoTLLtLWjOEk=; b=R1CS/xoUgVFP7sF5sjJa4Y1GDrTogpa+HhJUkLcaOs93Fm4ogBJLpAITrFzcwCBme7 iwrA418NISY38Cpi72rlkr9qbW/jKGOfWXYUqbbbzbuPiMbAvB+6kJ+GtN3TWfZoA2lv /7NtdprBzzeQagbkkcXvL6AMjvR9b2R9O3n6evK0qDlQQ2ztTXqY+h4GnGYX752xsiQ/ q9K9I1fzpYXp3H1qUBXvdUR/vNBDSd3jrzvL3mUJoCnYE/SNSTJ/nl0+UfVN3qAAkiWb TWgRyGyQPNJBgRnigIF2dWLChnSKqf/rqKLBhwUVPZhb3++mDCR2S8tLgxGpDjWXNd4Z syvQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:subject:mime-version:x-original-sender :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=n4xYofH/G4YyJRpRsKsJwxcYoZ94PlrBoTLLtLWjOEk=; b=cqFDrrWSxTsYO9qvhSYXHKwVMcO1lh0s+JgNDhd/3uH8ypcC2nSPDg3PZc2eNLTg4i tWuRRaIjtiyzxbLlKQfduhWBe4G3bp//CMb38gVPobgqwZkUCrsrYDQHjNMMYG/r+1CK Ef6rRLBYx0PB1cJZkodUp8eqAooFIEYdDGiEWW003eW1ZvKokMiXgEMzVcSWY0zZPGPT kgOqmM4+M+Y3Zz3OOl5dmrT+MEADMGr3+AdkHvPejgKzRDGzqS3bBX8ArJCFGBY3zBOJ DWfdeu4sprvdwUQ8AzqV1bvXOprKhXlYzk7hvef6KgGhkcRWg8rvpATloVuK7Goz9gVi cOjg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=n4xYofH/G4YyJRpRsKsJwxcYoZ94PlrBoTLLtLWjOEk=; b=qYxT3wFqciyzLRNaHCQVmbH3bhsNN8GQvqSgmFHkBocNKyThP/wNk5TUO8w8Fncf5l g3HqaWKw/dlitpGQSqWTP5ulfuZOgd1DC9qpjwr9vdg/JmKLNXKJWEZ49BFx5zFIU/cn fv1CnhhyVivHrD0Kc7CvjkGn48aNmLG/WYbP7RQdaygM1fFlJ4FiWYCjj2DP21bYcW+i Exb3Cu3NyEQ2pqE9601nBytsiM9gCUNkMe4qseth8/Cu+42dEn59bpBP1RNjTfHYaXDg 5RtyDgtfQeAS8Af/8gyhAlidytLZZAmTQVpQKkuVAWjC9CMNqZA09UKMzYwgcMmd+KuF N+zA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: APjAAAXcLjCEqVHx99wD+72UYI9tJlLEkxpdljz8qXKBDD2jMOeBuIwj 7EQ5M9X7BzwbMJ7aAzKjHKk= X-Google-Smtp-Source: APXvYqw0a04iheRbFywFJYocYYl3UV+XlB+NgzF3mpJNlwFiVibtShNkfiDxOc9mT7Jfn0fbV4hamg== X-Received: by 2002:aca:a9c3:: with SMTP id s186mr1638177oie.60.1568801150915; Wed, 18 Sep 2019 03:05:50 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:aca:ed43:: with SMTP id l64ls68061oih.3.gmail; Wed, 18 Sep 2019 03:05:50 -0700 (PDT) X-Received: by 2002:aca:3ad6:: with SMTP id h205mr1607246oia.129.1568801149913; Wed, 18 Sep 2019 03:05:49 -0700 (PDT) X-Original-Sender: mmj529-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:23435 Archived-At: ------=_Part_71_1400588643.1568801149327 Content-Type: multipart/alternative; boundary="----=_Part_72_1491120441.1568801149328" ------=_Part_72_1491120441.1568801149328 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Converting from docbook to LaTeX I came across a possible uncorrect=20 management of U+200B when converting to LaTeX. The following docbook MWE (the simple string "...abc")
…​abc is converted to LaTeX \ldots=E2=80=8Babc with a (invisible but detectable in the real output) zero-width-space=20 between "\ldots" and "abc". I think that the correct LaTeX output should be \ldots abc with a standard space after `\ldots`, because if you try to produce a PDF= =20 you get [WARNING] Missing character: There is no =C3=94=C3=87=C3=AF (U+200B) in fon= t [lmroman10- regular]:mapping=3Dtex-text;! because of the presence of the zero-width-space, whereas with the standard= =20 space you get the correct output ("...abc" and not "... abc") in PDF (and= =20 no warnings). --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/45688658-4762-4910-b8d1-a28a23efd91c%40googlegroups.com. ------=_Part_72_1491120441.1568801149328 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Converting from docbook to LaTeX I came across a poss= ible uncorrect management of U+200B when converting to LaTeX.
The= following docbook MWE (the simple string "...abc")

<?xml version=3D&qu= ot;1.0" encoding=3D"= UTF-8"?>
=
<?asciidoc= -toc?>
<?asciidoc-numbered?>
<article xmlns<= /span>=3D<= span style=3D"color: #080;" class=3D"styled-by-prettify">"http://docbo= ok.org/ns/docbook" xmlns:xl=3D"= ;http://www.w3.org/1999/xlink" version=3D"5.0" xml:lang= =3D"e= n">= ;
<simpara>= &#8230= ;&#8203;abc</simpara>

is converted to LaTeX

\= ldots=E2= =80=8Babc<= /span>

with a (invisible but detectable in= the real output) zero-width-space between "\ldots" and "abc= ".

I think that the correct LaTeX output shou= ld be

\ldots abc

with a = standard space after `\ldots`, because if you try to produce a PDF you get<= /div>

[WARNING] Missing character: There is no =C3=94= =C3=87=C3=AF (U+200B) in font [lmro= man10-regular= ]:mapping=3Dtex-text;!

because of the= presence of the zero-width-space, whereas with the standard space you get = the correct output ("...abc" and not "... abc") in PDF = (and no warnings).

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/45688658-4762-4910-b8d1-a28a23efd91c%40googlegroups.co= m.
------=_Part_72_1491120441.1568801149328-- ------=_Part_71_1400588643.1568801149327--