From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/29896 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: "double emphasis" bug when converting to asciidoc? Date: Tue, 04 Jan 2022 11:12:58 -0800 Message-ID: References: <3f7b920b-c982-5be5-fa04-9025e008e518@tuxad.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="15804"; mail-complaints-to="usenet@ciao.gmane.io" To: Frank Bergmann , pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCJZJHG45QDBBR5Y2KHAMGQEYVBC7AQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Jan 04 20:13:13 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-il1-f186.google.com ([209.85.166.186]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1n4pFB-0003wm-E9 for gtp-pandoc-discuss@m.gmane-mx.org; Tue, 04 Jan 2022 20:13:13 +0100 Original-Received: by mail-il1-f186.google.com with SMTP id j6-20020a056e02218600b002b261165281sf20072442ila.22 for ; Tue, 04 Jan 2022 11:13:13 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1641323592; cv=pass; d=google.com; s=arc-20160816; b=wVCRVtRiTF+plXAjtNQGrli761Kpfr5HtJ3ZwmVkXiDr2tIcJUQByo8vRwaXCqul/T jKi9f9hAbz6HEjgYCwMaWJdNKv8stbyr5mLFrhaxPysPYJtbECMFeCs9mqyzbRXeYhdq ZD9pQ6MpkJgR5rQ9QfMIeefjxok7uX45OjkzfBmbuhp6YUJNBJWMWhTlxS8ZC+bnSuP5 rEAj3N3kI/XHL1NLad6FEzNJo78iqsqEQ89E8u80i7GJgqP9iZBetXDEg6P3/8orILIN ioLkbYzGSxoFfjhE3Bdrt7WXt58PbLcbG1dtQ07G7ngu6GlR2lD1qS0m60hUMA9lwSsh K70A== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:mime-version:message-id :date:references:in-reply-to:subject:to:from:sender:dkim-signature; bh=ZGpnM9lD3d9PnsnmrcYltWAe6qkYPRUtG3wWPlUGfgE=; b=VW1W3AmB8pWs8E4E8OpgxrAoMRyeRMT/Xa6h4/p8S65PVy6hrWLF0Ij320eh9pDtzk yuuWF4ntS3DHhMOvdw0hss+ffg1oVJOClUqEXbR2a2DTOZR+yILFpUoAxbJsOZpWE//j t/Dj8HQFA/5TMcDG/sIludERxhc4uPGl9Eknp2Y+x8X0RycjtdWoEz8hx4tjMzqM7IhJ b61pT4cOKJFPPLXurdRBV9QeEhnOhyJ1WVAfPm6fh77MzEpFejXQvd6GkLWm61cT12j4 QA0mtPzKyx3wbor/pimXSD6OIEKhuggoVaKLTP1e7Lq2jbs6ii4MmU6D69w44ig3TcL8 GTGg== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20210112.gappssmtp.com header.s=20210112 header.b=aXxabxqa; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=sender:from:to:subject:in-reply-to:references:date:message-id :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=ZGpnM9lD3d9PnsnmrcYltWAe6qkYPRUtG3wWPlUGfgE=; b=J8xMKsyFQNzOWK72b5Fmvfh/LlVH1xl6/kZK5yE4Y4/GoFFm3HZyzJaWQAzSqXoI2Q ADeP27K/kpa1DmX5mLARFwG3xG4TE03+mw6nTNZZGr7iglu7Q9GlFhHZ/nzTWJL7IkVg JnZdsU0lhQkOSuY2GNsYvy94GRxl5F7xwJv6Ma6S2HTIJIpr+5/qyYcBUdkY3prIQYT2 OIZyd0K67cbjfSLzBfurNIY4+78/gRU1Fd84BAgPGHG5xbHiN/x3aEMO7b3Y8S6QT0ML eMWRmH4nqcGGRePlVdZ5ZhtbSgPXZ2cwm0Sb/PN8B3U7MKsL9DbC5D6EnFs17ZJ1EWjO XFkw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=sender:x-gm-message-state:from:to:subject:in-reply-to:references :date:message-id:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=ZGpnM9lD3d9PnsnmrcYltWAe6qkYPRUtG3wWPlUGfgE=; b=SdRibo1zjoioyXFjNr8C97iRd8QPMaxnsmvV4JkDetGGYmshzdl4rw4BQmJYtB2hrg s5tqpM0LgJY1mmwslapGel/p59JjUnKqXaOa2jdqgvd6lXM7vGmQhRZgj0yueazCqV1I 6Mmp4epE5lpePj1CgyEhKQW6R9/qvK2THFPSGbXlkZ3ZgOrzJNEN3CWQhDS8fkwV9QfT h80AO+RKREVAcXIH3FUI/v2mTzWz7ubNL/jM6FGqCEG/O8cYYWcJvKA93PtK1WBDBvmj b6k3mLaacrZjQfiJOilM5nBQVMS0SbAvsA/FSbR/bUpS/eKLugz5bf1XoDsN320/7KZY QQxw== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM531YnN92OBaZf543urGKT7tP3ADj83plce5G2bYZisIkZ7P+Zlir 9zb/n8827dRQ3gZc+7m9uKk= X-Google-Smtp-Source: ABdhPJzNcnEAt7shH4FVAXCSR9VM7PP+H0deZu9BLNKbukvPqSi/2YlCxuULSyRfii522hd3DOE1fQ== X-Received: by 2002:a05:6638:2513:: with SMTP id v19mr24068191jat.140.1641323592434; Tue, 04 Jan 2022 11:13:12 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6638:3783:: with SMTP id w3ls5035145jal.5.gmail; Tue, 04 Jan 2022 11:13:11 -0800 (PST) X-Received: by 2002:a05:6638:387:: with SMTP id y7mr22952140jap.135.1641323591092; Tue, 04 Jan 2022 11:13:11 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1641323591; cv=none; d=google.com; s=arc-20160816; b=F0SAzOkT4JSlnHz1N8OzExglnfHFeMFp88iKyr9shWb9s2NUs5EVNS9i/T6sK1LByE JgpyRkb/B/wYiR/z9Avu4onhPH19PrfEbHWd0gjjZIdFXZqzX/DoBOw+TLPwuU6y/3kl loElcYS8EAMvQMnvoSyEvRXDG8wHGyziKO8CrQ6VCrLRFVx0HLTM7roEdUqZJomMXto4 0k+FhGI9kqvkY6Ec3n7Gyeelbwu0ebEjQ7U3zqEXv7Ny96Ri4qxdPmpZO29j4cw7byG7 Z7V0PZGDeC716OvIqLKKuP07YAA9HI4HtxHn1rxrMrLjA+VeDo265ijWSqqEJB0bpm/N y7nA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:message-id:date:references:in-reply-to:subject:to:from :dkim-signature; bh=eWnZv+HozHcIe28B/oZKapbf1T2TAF+ycpqw8oGNG88=; b=MyvtEvM/ORciNjW++PGHO3+dYR2AlzwuvcBPGJ6yw8XqMTwlZRMB+1ATN3nfcRI0CK 948EG1zRUwA/eZ1ARypWyiOD+BhS3dcsEV/HMKArmSn6jJBLBtrv4LdEa7Qf6KniKMIS drD2/zdJOCeCKDEu5BzNyZRJN4YljWHSuvjADjuQgCqxoDQwH2JLKOrKT5nC/ZcpszJ+ EA9Z+APKgedlJv6eBbf324yrSQjQf41XxS6Mxz9chv3yatFUlxQ+az5hwm293o5g5dqv WP4s8xohEXkzfAHTPjTVMxpokygiZTYoANg9YO/I2ayrZ4OwY8pckVglOazBmnvoCpY/ YQ/Q== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20210112.gappssmtp.com header.s=20210112 header.b=aXxabxqa; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Original-Received: from mail-pj1-x102d.google.com (mail-pj1-x102d.google.com. [2607:f8b0:4864:20::102d]) by gmr-mx.google.com with ESMTPS id c11si2332482ilm.5.2022.01.04.11.13.11 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 04 Jan 2022 11:13:11 -0800 (PST) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) client-ip=2607:f8b0:4864:20::102d; Original-Received: by mail-pj1-x102d.google.com with SMTP id lr15-20020a17090b4b8f00b001b19671cbebso4143466pjb.1 for ; Tue, 04 Jan 2022 11:13:11 -0800 (PST) X-Received: by 2002:a17:902:6944:b0:149:6fd4:4916 with SMTP id k4-20020a170902694400b001496fd44916mr41615548plt.150.1641323590109; Tue, 04 Jan 2022 11:13:10 -0800 (PST) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id z13sm26047668pgi.75.2022.01.04.11.13.09 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 04 Jan 2022 11:13:09 -0800 (PST) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id 9D9FCA29D; Tue, 4 Jan 2022 14:12:58 -0500 (EST) In-Reply-To: <3f7b920b-c982-5be5-fa04-9025e008e518-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org> X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20210112.gappssmtp.com header.s=20210112 header.b=aXxabxqa; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:29896 Archived-At: I see this in the code: inlineToAsciiDoc opts (Emph lst) = do contents <- inlineListToAsciiDoc opts lst isIntraword <- gets intraword let marker = if isIntraword then "__" else "_" return $ marker <> contents <> marker So apparently in asciidoc you use __ for intraword emphasis (I don't use asciidoc myself). The problem may be that this is a context asciidoc doesn't consider "intraword." Anyway, please submit a bug report and link here. Frank Bergmann writes: > Hi, > > I found a strange behaviour when converting some HTML files to asciidoc. > > Versions used: > asciidoc 9.1.0 > pandoc 2.16.2 > > Example input: > > > > > Xx > > > Xx, > > > > With "pandoc --wrap=none -f html -t asciidoc" I get this asciidoc output: > > link:x.htm[_Xx_]__,__ > > The double underscores look "suspicious" and with "asciidoc -b > docbook;xmllint" I get: > > z.xml:10: parser error : Unescaped '<' not allowed in attributes values > link:x.htm role="Xx">, > The related docbook line which was created by asciidoc: > > link:x.htm role="Xx">, > > *Is this a known bug?* > > > If I add a space before comma... > > Xx , > > then I get > > link:x.htm[_Xx_] _,_ > > which causes no issue. Also adding a space before the emphasis... > > Xx , > > create an asciidoc file which can be rendered: > > link:x.htm[_Xx_] _,_ > > > > Does someone know this? Does a fix already exist? > > > cheers, > Frank > > -- > You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/3f7b920b-c982-5be5-fa04-9025e008e518%40tuxad.com.