From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/29899 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Frank Bergmann Newsgroups: gmane.text.pandoc Subject: Re: "double emphasis" bug when converting to asciidoc? Date: Wed, 5 Jan 2022 09:39:12 +0100 Message-ID: <695408e4-ba64-571f-42d2-be6fda24a8b1@tuxad.com> References: <3f7b920b-c982-5be5-fa04-9025e008e518@tuxad.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="15783"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:91.0) Gecko/20100101 Thunderbird/91.4.1 Cc: Frank Bergmann To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDB4NK6F5EBBBO5S2WHAMGQE3QQNGMA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Jan 05 09:39:26 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lj1-f187.google.com ([209.85.208.187]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1n51pO-0003w6-C1 for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 05 Jan 2022 09:39:26 +0100 Original-Received: by mail-lj1-f187.google.com with SMTP id g20-20020a2eb5d4000000b0022e0a6d890dsf5164496ljn.15 for ; Wed, 05 Jan 2022 00:39:26 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1641371965; cv=pass; d=google.com; s=arc-20160816; b=cMMleIQxrojLV/c/VB/VNIX58wO75gwwxCGNJIXwnuwD1HmV6cV6t+e7NrmETkwshL gqU2nncnCuwhhJEKja2Iq0ylP9LTrgV51p2gJ8a8v4rEul9GshNKJAzOfd8ZXKGC4p6w INN58bqgLuQomPJgvAYh5Lcbv23kHrFxb7XRrKhK+teUebD159YS4XaODKXt1+TjqmWO 60P0LAFetF/KfuoAOM9MrehZe5xrNitnIfYvBGlkMbU4lWlkN6jPbiigEB2/MWar/lRG nBNIaXKPWDcgKk5loRC2Z20G2kBWECyDYTstCcz9QdWC3UmyJx0ddUs/Knit+S06s6Vj SmxQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :in-reply-to:from:cc:references:to:content-language:subject :user-agent:mime-version:date:message-id:sender:dkim-signature; bh=8a5KqLS8GX5rgrdxAwpRhQIiTNtpRP2A79sLf7qqIQ8=; b=n3rSdxPSMQ+3pmg/G4ncoMupzD+cPiDZobeozFnjR72aVaRvAhEPzfbqvO3oNcHbvR fWdBgp1U95gdYQ3JqEY7P+6Y4l7PNVGiYAJ3DQiGBzCbLeFlPzSV1k2XiO52JCPAl9Kn cu2p2rM7vBQhT/inPaqvZf++dWNHtvd520aM/ba5GcdNw5gn4Dwl3xj0WomgDxSeTQRM rtpFNKwZLq6GonRtCyDqR+38WU3ASV79xQGG879KQlqfbsld1FZHVp5YUYdTuBw5fdsv KcprNxtBU5LVzd2K9rS/tDpoLNZsbpBvKPQ1XsEjE6sbRMYWt8oTASiGPkJat9g8QQM+ A71Q== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) smtp.mailfrom=pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=sender:message-id:date:mime-version:user-agent:subject :content-language:to:references:cc:from:in-reply-to :content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=8a5KqLS8GX5rgrdxAwpRhQIiTNtpRP2A79sLf7qqIQ8=; b=HcBRIaXq7V+O8LmxsJi9Pn0hmzu0ZauD7329a0ZinRPKiiiCKfpvBQKkYMwD1vg6v9 HxsxWJBdvGJfOVJBlG5VelkNhTBq9zzBE8uw+BKuOc9UWSgRymHyek3i34iy4toeXR+V CfpQGWkv8m4BXVLilvmwEQin8iZYTRT4KDAeI4iqcBdQfimzk8SwYKT3lPELhD2L3NFg 6NiP7VvvYvdItAu9PYNrte83xu+kLZ4wmFiVBG+2ZOeqqiHdsimrT+vt+Xtxs0Q/KBVl zxwcOhd4090PCdIKKN1CsBz/1fuyFXGQKhBM8w4DqFgRfyfpHSOgVa/WBmYtGBt2x/qb X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=sender:x-gm-message-state:message-id:date:mime-version:user-agent :subject:content-language:to:references:cc:from:in-reply-to :content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=8a5KqLS8GX5rgrdxAwpRhQIiTNtpRP2A79sLf7qqIQ8=; b=YDiLaat/QtbTacOsUR2+qIlt/v4QHEKDQwtNxHMdrYbkEzKhjvY8+B2/ebsjI9SI9R aBy3Z95pJDOGiwuYsEnZTPVZ+QdK2Yxef1b3QmIyvtdzmTEgjvoVbe2nGMpYDkVyhIj8 PwWjPObsOyrNpTs4dn0pa/m/UAXPi32HnTCIgA1Hi0RcW2TEP+sIjP/ux/ilNJpf1kQe +XxwsmI2uuzTQRp6rmq3IGqwBtXYjxEUSBD0+ZG+S7IjaNJiH8KrbV/QT0ENDX48AjrC q7d1CK1jboXl5kg6JFEv0iq2AivYw5XH2 Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM531wDOQE8m11XnTdw5dhZxtasc6cIALMVhAyZccLEPesuRPz05Zj g6O54VkKZ0nvzSXeNiCVEoo= X-Google-Smtp-Source: ABdhPJxnzOr0ivgqTqFyFHdIMp4+m21HDtNM1/LnNjNpW5A52L+KTRDNdmmtRX7pfkFFqVglXMNnAw== X-Received: by 2002:ac2:5b9a:: with SMTP id o26mr45735550lfn.479.1641371965756; Wed, 05 Jan 2022 00:39:25 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6512:158e:: with SMTP id bp14ls2465292lfb.2.gmail; Wed, 05 Jan 2022 00:39:23 -0800 (PST) X-Received: by 2002:a05:6512:3986:: with SMTP id j6mr45044903lfu.170.1641371963041; Wed, 05 Jan 2022 00:39:23 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1641371963; cv=none; d=google.com; s=arc-20160816; b=xGf6wDDPy+EvT8EO5N4moIZl0W8LCyV/uV0sGeDPP9MJGXADdn3MH0rguqMGh1GsI9 CS20QldPsqEF5hawXyCaFvBh8XKS5ulGRBUhXB480d9RY/0DzLdM+o8YrMc2IaT78zGR QX9HgjobKruC/JfP2f+/1AdDM2RQD/3lOQ9dABUwstmEnCwm6kCVWpP3wqRqEj3fEnHy xhrVi1o+pxzxD88mzFUZ6uITVUTv/2ShLGp9VmZPKehWBP6sRepsFZeEGr6oVLvGQ6hM /8d0cXRcgHsFDNU2Cj4iKShAVMlBunNNhxbdxaBBJD3gSgBIzqvOvomJYkm5D0mijk0g gXtg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:cc:references:to :content-language:subject:user-agent:mime-version:date:message-id; bh=sHQmfRJCjFZHs9vpupRgGjBv6rY9UY7szkpAZ7jYOY0=; b=i4ZSlXsnywn1rKgCmzI9BsZCt70zsC7gEwgUg1M7rWfXOGc179Urc3n1a9Qj8LuVA7 TCoXCrnLYzKU8eJmXN5uzmA3y1YBYsvh9UfIzhnMH2RhoFPJzSqw0m2a3TI9otBK4b3H ihlhbIH3hUMo1hEXmFgvUNALUjKnpqsUh9dx48sHFgSDbhJkF+M8rHvDoh3bBBy9omt0 Pxfl2Bdc54Y63nL/v5m9Pkez1DslCtdMHyjgQxlvaGVvyYs/pI/NJYVmJcLX9jI1AJVz RPd273lLYb8M1/Ni6rjkX4IVk9m57lAGiV2zkrEtCVHSxqmZulFBpbe/JA3ff0+C6ZiN /nnw== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) smtp.mailfrom=pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org Original-Received: from mail.tuxad.com (treferpol.tuxad.net. [81.89.239.233]) by gmr-mx.google.com with ESMTPS id k19si2097017lfv.12.2022.01.05.00.39.21 for (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Wed, 05 Jan 2022 00:39:21 -0800 (PST) Received-SPF: pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) client-ip=81.89.239.233; Original-Received: from [192.168.101.166] (rhtec.tuxad.net [62.216.165.252]) (using TLSv1.3 with cipher AEAD-CHACHA20-POLY1305-SHA256 (256/256 bits)) (No client certificate requested) (Authenticated sender: frankb) by mail.tuxad.com (Postfix) with ESMTP id A08BF5633C; Wed, 5 Jan 2022 09:39:20 +0100 (CET) Content-Language: en-US In-Reply-To: X-Original-Sender: pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) smtp.mailfrom=pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:29899 Archived-At: Hi, that's awesome - thank you for the quick and great answer, John. I was not aware of the intraword marking. I'll check the specs if it is=20 an undesired behaviour of asciidoc or maybe pandoc and then submit a bug=20 accordingly. cheers, Frank On 04.01.22 20:12, John MacFarlane wrote: > I see this in the code: > > inlineToAsciiDoc opts (Emph lst) =3D do > contents <- inlineListToAsciiDoc opts lst > isIntraword <- gets intraword > let marker =3D if isIntraword then "__" else "_" > return $ marker <> contents <> marker > > > So apparently in asciidoc you use __ for intraword emphasis > (I don't use asciidoc myself). > > The problem may be that this is a context asciidoc doesn't > consider "intraword." > > Anyway, please submit a bug report and link here. > > Frank Bergmann writes: > >> Hi, >> >> I found a strange behaviour when converting some HTML files to asciidoc. >> >> Versions used: >> asciidoc 9.1.0 >> pandoc 2.16.2 >> >> Example input: >> >> >> >> >> Xx >> >> >> Xx, >> >> >> >> With "pandoc --wrap=3Dnone -f html -t asciidoc" I get this asciidoc outp= ut: >> >> link:x.htm[_Xx_]__,__ >> >> The double underscores look "suspicious" and with "asciidoc -b >> docbook;xmllint" I get: >> >> z.xml:10: parser error : Unescaped '<' not allowed in attributes values >> link:x.htm> role=3D"Xx">,> >> The related docbook line which was created by asciidoc: >> >> link:x.htm> role=3D"Xx">, >> >> *Is this a known bug?* >> >> >> If I add a space before comma... >> >> Xx , >> >> then I get >> >> link:x.htm[_Xx_] _,_ >> >> which causes no issue. Also adding a space before the emphasis... >> >> Xx , >> >> create an asciidoc file which can be rendered: >> >> link:x.htm[_Xx_] _,_ >> >> >> >> Does someone know this? Does a fix already exist? >> >> >> cheers, >> Frank >> >> --=20 >> You received this message because you are subscribed to the Google Group= s "pandoc-discuss" group. >> To unsubscribe from this group and stop receiving emails from it, send a= n email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> To view this discussion on the web visit https://groups.google.com/d/msg= id/pandoc-discuss/3f7b920b-c982-5be5-fa04-9025e008e518%40tuxad.com. --=20 Frank Bergmann, P=C3=B6dinghauser Str. 5, D-32051 Herford, Tel. +49-5221-92= 49753 SAP Hybris & Linux LPIC-3, E-Mail tx2014-VEyjnN4Vo9k@public.gmane.org, USt-IdNr DE237314606 http://tdyn.de/freel -- Redirect to profile at freelancermap http://www.gulp.de/freiberufler/2HNKY2YHW.html -- Profile at GULP --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/695408e4-ba64-571f-42d2-be6fda24a8b1%40tuxad.com.