From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/29897 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Leonard Rosenthol Newsgroups: gmane.text.pandoc Subject: Re: "double emphasis" bug when converting to asciidoc? Date: Tue, 4 Jan 2022 14:50:22 -0500 Message-ID: References: <3f7b920b-c982-5be5-fa04-9025e008e518@tuxad.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="00000000000049a4bd05d4c6f3ad" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="38790"; mail-complaints-to="usenet@ciao.gmane.io" Cc: Frank Bergmann To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCDIL7E46MGBBC6K2KHAMGQE34ACSJA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Jan 04 20:50:38 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-wm1-f62.google.com ([209.85.128.62]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1n4ppN-0009tg-IJ for gtp-pandoc-discuss@m.gmane-mx.org; Tue, 04 Jan 2022 20:50:37 +0100 Original-Received: by mail-wm1-f62.google.com with SMTP id o2-20020a05600c4fc200b00346251961besf546383wmq.0 for ; Tue, 04 Jan 2022 11:50:37 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1641325837; cv=pass; d=google.com; s=arc-20160816; b=TTbtjsAJvtPT99hNFLKM/8douJoLnBrhOq9OsDqIH5nL5LDNWt/VVcMmw5JKbaaAGB 3nBeT5Z0d3LO18JBftGlU35B59pPIoEVoTNtAsjvjmYkrSkPF81uBnkjCImqfVH0aDCy JKW5I8xIqLrf6MIqnri7yJ3HfDxZP3urOhGeabDwhEHCjcU9Ew4niqw3awgSXUanvTEV m4UdX5aobAYBk7CNc/4iEpEgNaNymY3VIP2NCyL8/lTnywWmOK+ecAC0DnvUomwslepo uWj8zqRuIoBcnb9D8QkhL5fsXLR7z45/x/dcfocwtQg1S7iF0gyC49HrIsMdoZYDOGOr Cq0A== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:cc:to:subject:message-id :date:from:in-reply-to:references:mime-version:sender:dkim-signature; bh=5DkAebUNj7LjjTi9nan8wl/UxAXGERKWog/aCU2Wj6o=; b=VTjdq0tmUs9yNEh69M5hgp8JkQsmERsbjxtER+NXA+xo2G2D8rR8ERPAGEzidP0z8f ung0sco1uHUpDobpydpEu8QuoUhxzhIMkjR6dINUL6APP09Saa/tv/8knFGo1aBsN6Cc PqrESKSu1/v33A4PooTVJCg2k6HVsd+rwnrMHes5D5kGaNO45L+doZPb6bRq/XGbzOqs NDPBwwdk96SlF1VOiko9c1t1aMavPVu3NIcWl5hJ0MYsgYpmERJCiNNFuceA+k8rLOE9 jUck+DWC6w5fePxOOD7p4YqL5lWARN9EkDQJAZSIuuUgg1+57kFjOhg4ESQc73q/n7gD y1ZQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@lazerware-com.20210112.gappssmtp.com header.s=20210112 header.b="lvirK/qV"; spf=neutral (google.com: 2a00:1450:4864:20::336 is neither permitted nor denied by best guess record for domain of leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org) smtp.mailfrom=leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=sender:mime-version:references:in-reply-to:from:date:message-id :subject:to:cc:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=5DkAebUNj7LjjTi9nan8wl/UxAXGERKWog/aCU2Wj6o=; b=Wxi+hW16zr4aVNMn+gA6fA48qqlsg26sHx9VAYIOAMlLQLkJA3DypaRBhE+UgGQucB AV2Gz/vgbgEbnf185yWnWrzGyWHGgIPZT04m4eW2q7z8ZSQPpbPPRyXaAHX69wjIkvwp 1J+eSNjy8eU5cMXTrLQ6Zjcl8mu8/spWbgDnOFhnzApGjYM7PMy97VkBTVC8jrfW3x/f 9XpR76RpSRdlZZAyW1bNBG7rsE5YBjkeN4Z3AAsx2eNt9Rmcf7j7I6cUfr7c0AHMoFVV CVFCzor/Kdx5RbK5jXNgxjqJ2EeTbFx7TYxX2/DoBXYKWQyimEZ/JeaE8glcmeHHIOYr hKSQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=sender:x-gm-message-state:mime-version:references:in-reply-to:from :date:message-id:subject:to:cc:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=5DkAebUNj7LjjTi9nan8wl/UxAXGERKWog/aCU2Wj6o=; b=dL/4g36V+SvHvgifMxJ1FxmDo+jbsmU83mQ2Qh6na7e/ki7xpY91GS3ygYkRkwNHWh j7h6rPSE9NAQ8eNq1d+WQHoKob/L/4IowATMVkd9oX8SP8pt32ES2U27YvX75lbvmgrx nhnpKyLIUjTEywLOyVuYm8wV5CLgLIEW3yrc6T+81XIjiwchEE9WDuSLgRISE6cRlOfb zfZjD5Yvg/+/GHbmD1so7uplu4X/MlhSj4kkEptZVvGFj8PBZUfXedA0fEPqOJrQbLXp Gw/JVPig+Z/mamvP8J4azHclYu+7Tng6Wn+gLIqXZbjd4e90dXPWpKrVkX25PG7twLnm 0hAA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM532uN5ZxJRa+5cqGvuYDH5WYJ9dCWDICyXv2jxJ1bIj0FBuJ49Bf vALwd81p7Wcd8RFw3zXFBw4= X-Google-Smtp-Source: ABdhPJxghAX8d6Ompe5MjEhA7OtULv5dWj66uYmGsaRuf9wRB0bPBhm6Xdv32iV64oEJFgiTS2olQQ== X-Received: by 2002:a05:600c:3489:: with SMTP id a9mr42993130wmq.45.1641325836996; Tue, 04 Jan 2022 11:50:36 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:adf:d21b:: with SMTP id j27ls526640wrh.3.gmail; Tue, 04 Jan 2022 11:50:34 -0800 (PST) X-Received: by 2002:adf:efc6:: with SMTP id i6mr42648137wrp.428.1641325834646; Tue, 04 Jan 2022 11:50:34 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1641325834; cv=none; d=google.com; s=arc-20160816; b=0rgBMNQduOkbOQKqk4ajGS823cT+XKSApE7EVgXJ4VXeYA28YbO3TjwoY3XOUBS6wT 0ntBdjTuEtfMwZ5T+pzXtI6AtKrc76D6ou2ATAQFe1FVMahucD/c7tUZueSkjN5Gz5Fj fcgwkldoJ8L5hnLmCINuuuClSD7/EaGE0d81sv+UVDKN2hPg2IcGauxGxrbHaTxNemn7 b6tpXV0aYD4PAQQ85JnFiScPfklaFywfGAm9RwnlS2q9zuWMP1GQhQRQmcwGE3CwB9hJ tu2pefTUSX2CmzYNUKQa/PGsTrWk8S+FatMFfR5j3mGTwiCC1YNC+IKr4ckSeqw97uvr jhWQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:dkim-signature; bh=v0OIUoMjkJa/cVgcs3e+HPQm+Tg5JJqEoGbgMdepl5Y=; b=kH5IyDY65sy5uqMfwEivSMiXrqvZvSxVa3xxjMNHgb5aS9OnNkWbRR5EzgnFFkLoET OtieFrd5EljAEBAPGBO/Sqdk+XiXRZPxSKfISSp4vrfa4hEBVxXbGCi5ySb9ukDAY8fB e6DWYZJM+wkjpo19MomfBG+JIZPCekV+vvzF2dmv1Yfh8suflGxDeBFk+GdZy5nIG8Te FuI+uzWZ9bMZHbqisDPTbsIbtbP/Um1DKvw7G2AZHmYWTuuISEUTNAyCDpLlgC3ijHE2 AKyRkJyjHaLHe3UPY68VXJuNfKUyDxeI3Y0xIYuL4kfdhZpGpZRPFNaCo7cMn3MvUmJl O+Aw== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@lazerware-com.20210112.gappssmtp.com header.s=20210112 header.b="lvirK/qV"; spf=neutral (google.com: 2a00:1450:4864:20::336 is neither permitted nor denied by best guess record for domain of leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org) smtp.mailfrom=leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org Original-Received: from mail-wm1-x336.google.com (mail-wm1-x336.google.com. [2a00:1450:4864:20::336]) by gmr-mx.google.com with ESMTPS id 126si40567wme.2.2022.01.04.11.50.34 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 04 Jan 2022 11:50:34 -0800 (PST) Received-SPF: neutral (google.com: 2a00:1450:4864:20::336 is neither permitted nor denied by best guess record for domain of leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org) client-ip=2a00:1450:4864:20::336; Original-Received: by mail-wm1-x336.google.com with SMTP id b186-20020a1c1bc3000000b00345734afe78so2039746wmb.0 for ; Tue, 04 Jan 2022 11:50:34 -0800 (PST) X-Received: by 2002:a05:600c:1c02:: with SMTP id j2mr43559412wms.1.1641325833850; Tue, 04 Jan 2022 11:50:33 -0800 (PST) In-Reply-To: X-Original-Sender: leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@lazerware-com.20210112.gappssmtp.com header.s=20210112 header.b="lvirK/qV"; spf=neutral (google.com: 2a00:1450:4864:20::336 is neither permitted nor denied by best guess record for domain of leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org) smtp.mailfrom=leonardr-bM6h3K5UM15l57MIdRCFDg@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:29897 Archived-At: --00000000000049a4bd05d4c6f3ad Content-Type: text/plain; charset="UTF-8" The AsciiDoc specs on emphasis/italics is here - https://docs.asciidoctor.org/asciidoc/latest/text/italic/ - and talks about the intraword scenario. Hope that helps. Leonard On Tue, Jan 4, 2022 at 2:13 PM John MacFarlane wrote: > > I see this in the code: > > inlineToAsciiDoc opts (Emph lst) = do > contents <- inlineListToAsciiDoc opts lst > isIntraword <- gets intraword > let marker = if isIntraword then "__" else "_" > return $ marker <> contents <> marker > > > So apparently in asciidoc you use __ for intraword emphasis > (I don't use asciidoc myself). > > The problem may be that this is a context asciidoc doesn't > consider "intraword." > > Anyway, please submit a bug report and link here. > > Frank Bergmann writes: > > > Hi, > > > > I found a strange behaviour when converting some HTML files to asciidoc. > > > > Versions used: > > asciidoc 9.1.0 > > pandoc 2.16.2 > > > > Example input: > > > > > > > > > > Xx > > > > > > Xx, > > > > > > > > With "pandoc --wrap=none -f html -t asciidoc" I get this asciidoc output: > > > > link:x.htm[_Xx_]__,__ > > > > The double underscores look "suspicious" and with "asciidoc -b > > docbook;xmllint" I get: > > > > z.xml:10: parser error : Unescaped '<' not allowed in attributes values > > link:x.htm > role="Xx">, > > > The related docbook line which was created by asciidoc: > > > > link:x.htm > role="Xx">, > > > > *Is this a known bug?* > > > > > > If I add a space before comma... > > > > Xx , > > > > then I get > > > > link:x.htm[_Xx_] _,_ > > > > which causes no issue. Also adding a space before the emphasis... > > > > Xx , > > > > create an asciidoc file which can be rendered: > > > > link:x.htm[_Xx_] _,_ > > > > > > > > Does someone know this? Does a fix already exist? > > > > > > cheers, > > Frank > > > > -- > > You received this message because you are subscribed to the Google > Groups "pandoc-discuss" group. > > To unsubscribe from this group and stop receiving emails from it, send > an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > > To view this discussion on the web visit > https://groups.google.com/d/msgid/pandoc-discuss/3f7b920b-c982-5be5-fa04-9025e008e518%40tuxad.com > . > > -- > You received this message because you are subscribed to the Google Groups > "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit > https://groups.google.com/d/msgid/pandoc-discuss/m2v8yzpb5x.fsf%40MacBook-Pro-2.hsd1.ca.comcast.net > . > -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/CALu%3Dv3JE2PPCY8%3DagCY9wtvwrMKXAidpSVFN650oc%2BHge8J3dw%40mail.gmail.com. --00000000000049a4bd05d4c6f3ad Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
The AsciiDoc specs on emphasis/italics is here -=C2=A0https://d= ocs.asciidoctor.org/asciidoc/latest/text/italic/ - and talks about the = intraword scenario.

Hope that helps.

<= div>Leonard

On Tue, Jan 4, 2022 at 2:13 PM John MacFarlane <<= a href=3D"mailto:jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org">jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org> wrote:

I see this in the code:

inlineToAsciiDoc opts (Emph lst) =3D do=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0
=C2=A0 contents <- inlineListToAsciiDoc opts lst=C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 isIntraword <- gets intraword=C2=A0 =C2=A0
=C2=A0 let marker =3D if isIntraword then "__" else "_"= =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0
=C2=A0 return $ marker <> contents <> marker=C2=A0 =C2=A0 =C2= =A0 =C2=A0


So apparently in asciidoc you use __ for intraword emphasis
(I don't use asciidoc myself).

The problem may be that this is a context asciidoc doesn't
consider "intraword."

Anyway, please submit a bug report and link here.

Frank Bergmann <pa= ndoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org> writes:

> Hi,
>
> I found a strange behaviour when converting some HTML files to asciido= c.
>
> Versions used:
> asciidoc 9.1.0
> pandoc 2.16.2
>
> Example input:
>
> <!DOCTYPE HTML>
> <html>
> <head>
> <title>Xx</title>
> </head>
> <body>
> <a href=3D"x.htm"><i>Xx</i></a><i= >,</i>
> </body>
> </html>
>
> With "pandoc --wrap=3Dnone -f html -t asciidoc" I get this a= sciidoc output:
>
> link:x.htm[_Xx_]__,__
>
> The double underscores look "suspicious" and with "asci= idoc -b
> docbook;xmllint" I get:
>
> z.xml:10: parser error : Unescaped '<' not allowed in attri= butes values
> <simpara>link:x.htm<emphasis><phrase
> role=3D"<emphasis>Xx</emphasis>">,</phrase= ></
>
> The related docbook line which was created by asciidoc:
>
> <simpara>link:x.htm<emphasis><phrase
> role=3D"<emphasis>Xx</emphasis>">,</phrase= ></emphasis></simpara>
>
> *Is this a known bug?*
>
>
> If I add a space before comma...
>
> <a href=3D"x.htm"><i>Xx</i></a><i= > ,</i>
>
> then I get
>
> link:x.htm[_Xx_] _,_
>
> which causes no issue. Also adding a space before the emphasis...
>
> <a href=3D"x.htm"><i>Xx</i></a> <= i>,</i>
>
> create an asciidoc file which can be rendered:
>
> link:x.htm[_Xx_] _,_
>
>
>
> Does someone know this? Does a fix already exist?
>
>
> cheers,
> Frank
>
> --
> You received this message because you are subscribed to the Google Gro= ups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send= an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
> To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/3f7b920b-c982-5be5-fa04-9025e008e518%40tuxad.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/msgid/p= andoc-discuss/m2v8yzpb5x.fsf%40MacBook-Pro-2.hsd1.ca.comcast.net.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://g= roups.google.com/d/msgid/pandoc-discuss/CALu%3Dv3JE2PPCY8%3DagCY9wtvwrMKXAi= dpSVFN650oc%2BHge8J3dw%40mail.gmail.com.
--00000000000049a4bd05d4c6f3ad--