From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/29902 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Frank Bergmann Newsgroups: gmane.text.pandoc Subject: Re: "double emphasis" bug when converting to asciidoc? Date: Wed, 5 Jan 2022 18:19:13 +0100 Message-ID: <16c258f8-d4fb-4d9f-a9a6-c855e83dfa4a@tuxad.com> References: <3f7b920b-c982-5be5-fa04-9025e008e518@tuxad.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="10128"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:91.0) Gecko/20100101 Thunderbird/91.4.1 To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDB4NK6F5EBBBHNG26HAMGQEN7PDIMQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Jan 05 18:19:29 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lf1-f58.google.com ([209.85.167.58]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1n59we-0002TH-BT for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 05 Jan 2022 18:19:28 +0100 Original-Received: by mail-lf1-f58.google.com with SMTP id m8-20020a0565120a8800b00425edb1a456sf9475895lfu.16 for ; Wed, 05 Jan 2022 09:19:28 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1641403167; cv=pass; d=google.com; s=arc-20160816; b=kv6+epJSNRA5l6HCs3wcmLJJ49PFlNXs09rgTWW7N61m0ZUDZK5/foAlyJuvot5bfR DlekAtgDcY0m2+B+C5/Dg82GX9jgrUHetIGwycZLwzUKRBYxCYEUPAtlmshZxGixNDPX 3oFSE4MaT8CV3u1PkiXmco1Qhh7H0QfIoTin2CivrYbeM7xtanF6883YtvcpoW7InNXa UTijOKCYDfJjx8uz7sHFhPb6zIv9b/BmJRuPkPvpdDjSbReiMQAhADqySiPDzth0+Txq wuf1P4bhz8mqEeTCVQu7a30BMgx9sqpS/Q0hdSea/HILZ8az9ZfvPj6SrhoGEAFzfIH9 v4Pw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :in-reply-to:from:references:to:content-language:subject:user-agent :mime-version:date:message-id:sender:dkim-signature; bh=YPJQpZccm1RI6T0tzO7/KNRyxr6FqkGI4319EQlf1pA=; b=hSWtI10129osXVBt+OBUXpe7iZsG6Iz1GF+IseF5BYkHec7RFo6E2Lgmi7j/rs6Pc7 wMF0XJBP19Ep3dvgqeulNV8kdtD41saU2MbVrqEmWftU5vGgEv8Zaz+jI6MwfQaLtyLK dz+d6qz4zNhjY4zGUfnyIV05BVADPx/20pdkfKT149+vwFkiYYeqHKMQRH48lQ9lYBIc M0o03WXygOcrvz44Mshbw795YdhXYehVcc7XzruEa17Z3rYzIc0SsA862ty3LUGM/j3q piJAvETbKDbNKNtMx2e3sFsTrIvlqqPBUOTEd6xquWLVLTAjGM/uplk6fEwhcfQSwPE3 0TMQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) smtp.mailfrom=pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=sender:message-id:date:mime-version:user-agent:subject :content-language:to:references:from:in-reply-to :content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=YPJQpZccm1RI6T0tzO7/KNRyxr6FqkGI4319EQlf1pA=; b=VydCmz8vNhc6yWqYQAnMZJOk6jMiEFvWOCxBaDB/6dPchbr+LWwZg45A8/UEWmnrgS HhjE8ozEFm9lyKw4JucaPxBXRRDczO1bLKqehDO2OxHQLtu1IPBOdGH6Phabutq4mJOQ Bqcs/WgMNnbskT6VHrAaAF47rfKnOKsnun+NaPaHsfBWraBblXMFukpZ1f8OcyEVxFGS uD2T0+L1dvEu9soWT9mGto/JFT1m57M6y95XmNu8FjwM40SpuJnBkcawlOXQjtFRFfjH 32vGpi9bfGNDkFDkf5Sc2bqJnQZsn2Asb2TGzsuIsYcC971dLEZKPZbktWiYzq07WCiQ qig X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=sender:x-gm-message-state:message-id:date:mime-version:user-agent :subject:content-language:to:references:from:in-reply-to :content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=YPJQpZccm1RI6T0tzO7/KNRyxr6FqkGI4319EQlf1pA=; b=xw2/4mnnTQbOmt3uwG2S3+Ryv02DVKTyWV7QkuiInWDyTTYOyXY5NhV6mBC8gxoSFm 22XHw+5EMzjXNb4IOgdMwzslyn41gjoG8N3g8oC8AbmLLlClEeoov66DX14ROHDEAmgk zttX94FJcLnotCUDTRhWbJuUFMvrnYp3ifwxr/aZDCdDdP+31M03DlNZeK+Mns06nAAI 6YIl9e7CeTS12o+C57/KajQn2TZvHBNwVoZbbJzKOdb3IyC7ANdWmoeCwE3AaxaqdkQ7 IJ/SQt0ucCOF4EKGZV2xVOW3Wh2/QvwSf8qf Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM532XXatuV+GwHji0wO0w1nqvMXENZYG1VLSgtH6AWymSu+5GD5YR OsXB4fMtBRebQau/IZjXn3M= X-Google-Smtp-Source: ABdhPJwt7A7jyN70ZSNyWJBMn1TZB2mazyswF1E3m7AX+CDiovRelglcbUa8NdQ7rGlArIPT8bvi0g== X-Received: by 2002:a2e:9cc7:: with SMTP id g7mr38667934ljj.128.1641403167236; Wed, 05 Jan 2022 09:19:27 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6512:3d2a:: with SMTP id d42ls3376788lfv.0.gmail; Wed, 05 Jan 2022 09:19:24 -0800 (PST) X-Received: by 2002:a19:644f:: with SMTP id b15mr47533813lfj.76.1641403164315; Wed, 05 Jan 2022 09:19:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1641403164; cv=none; d=google.com; s=arc-20160816; b=QdRO/gJChQGtEEtxbJQ8jNVJi49YNapiRCVBaGSD8i1kj4L2XoK8Srlx+AZ1PREad1 HNG8rNBezgNOKvJC6YEXM4QdsR4hvpo1sP5RqALsXEByG2l9Oj4i+clVYrZJhtighlJP iFkeBO0NOqu6bYgTsutU+vD7AgBD7tJvm9ASjiL8Sv8S0tXkxULDy4IYIlIfHQgwg7Xa o83meNafhnx/HrisRK/Oj0RoCiXX7p+cZxwMuWEV+Q7eRk0IwOFpHz/havSqUbHxXiAH VQ0RyaxGVi/Qtwvy9ceYt08GvE9n+K8cUbmmIuemE91tsegzxeYO8s8GzbgpefRxQmur iCLg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id; bh=J1Yxyvd4c8HZP85SU+g5vyiYlg5kdXDSlgMtro+4LEg=; b=lja3BA0WNb9+u54sudvmuPzC7WDxnJEuaYE14lYNJIpLQG3GTRh9NLOk+p7PyrMtAu v9l1FuMPbByAvZWCMrXI0VOe9Ma5c/mqXmvJVZ5bquJc0hOnWHLUfI2hIw0UpuJD2sKJ 4yNGmr+km0UIiawHcl84FBcIZPsceC608legqTDrVkztGCOu9A527kMT7qm35SjmLB7J gIl7ajg7JVmviNJ9Kt25iV9F1HxmLsgxfi9RS3t11jCCVNz24dsIj/6vyp7SwYJVX6fx KNTfDlH8rPt36BYXd0EfrGraEd1m5jSKVAxpGyPAjP6pECnaLNtnehG8skLhEif+WjsH xAdg== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) smtp.mailfrom=pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org Original-Received: from mail.tuxad.com (treferpol.tuxad.net. [81.89.239.233]) by gmr-mx.google.com with ESMTPS id r5si1659738lfp.1.2022.01.05.09.19.22 for (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Wed, 05 Jan 2022 09:19:22 -0800 (PST) Received-SPF: pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) client-ip=81.89.239.233; Original-Received: from [192.168.101.166] (rhtec.tuxad.net [62.216.165.252]) (using TLSv1.3 with cipher AEAD-CHACHA20-POLY1305-SHA256 (256/256 bits)) (No client certificate requested) (Authenticated sender: frankb) by mail.tuxad.com (Postfix) with ESMTP id A099E5633C for ; Wed, 5 Jan 2022 18:19:21 +0100 (CET) Content-Language: en-US In-Reply-To: X-Original-Sender: pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org designates 81.89.239.233 as permitted sender) smtp.mailfrom=pandoc-eSlkCAlw8VwAvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:29902 Archived-At: Hi, thank you both, John and Leonard. I read the asciidoc specs, did some tests with asciidoc's intraword=20 emphasize, created a few test cases and will now submit a bug with my=20 findings. Frank On 05.01.22 04:17, John MacFarlane wrote: > That's helpful. Please submit a bug report at > https://github.com/jgm/pandoc/issues > > Leonard Rosenthol writes: > >> The AsciiDoc specs on emphasis/italics is here - >> https://docs.asciidoctor.org/asciidoc/latest/text/italic/ - and talks ab= out >> the intraword scenario. >> >> Hope that helps. >> >> Leonard >> >> On Tue, Jan 4, 2022 at 2:13 PM John MacFarlane wrote: >> >>> I see this in the code: >>> >>> inlineToAsciiDoc opts (Emph lst) =3D do >>> contents <- inlineListToAsciiDoc opts lst >>> isIntraword <- gets intraword >>> let marker =3D if isIntraword then "__" else "_" >>> return $ marker <> contents <> marker >>> >>> >>> So apparently in asciidoc you use __ for intraword emphasis >>> (I don't use asciidoc myself). >>> >>> The problem may be that this is a context asciidoc doesn't >>> consider "intraword." >>> >>> Anyway, please submit a bug report and link here. >>> >>> Frank Bergmann writes: >>> >>>> Hi, >>>> >>>> I found a strange behaviour when converting some HTML files to asciido= c. >>>> >>>> Versions used: >>>> asciidoc 9.1.0 >>>> pandoc 2.16.2 >>>> >>>> Example input: >>>> >>>> >>>> >>>> >>>> Xx >>>> >>>> >>>> Xx, >>>> >>>> >>>> >>>> With "pandoc --wrap=3Dnone -f html -t asciidoc" I get this asciidoc ou= tput: >>>> >>>> link:x.htm[_Xx_]__,__ >>>> >>>> The double underscores look "suspicious" and with "asciidoc -b >>>> docbook;xmllint" I get: >>>> >>>> z.xml:10: parser error : Unescaped '<' not allowed in attributes value= s >>>> link:x.htm>>> role=3D"Xx">,>>> >>>> The related docbook line which was created by asciidoc: >>>> >>>> link:x.htm>>> role=3D"Xx">, >>>> >>>> *Is this a known bug?* >>>> >>>> >>>> If I add a space before comma... >>>> >>>> Xx , >>>> >>>> then I get >>>> >>>> link:x.htm[_Xx_] _,_ >>>> >>>> which causes no issue. Also adding a space before the emphasis... >>>> >>>> Xx , >>>> >>>> create an asciidoc file which can be rendered: >>>> >>>> link:x.htm[_Xx_] _,_ >>>> >>>> >>>> >>>> Does someone know this? Does a fix already exist? >>>> >>>> >>>> cheers, >>>> Frank >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>> Groups "pandoc-discuss" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >>>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/pandoc-discuss/3f7b920b-c982-5be5-fa0= 4-9025e008e518%40tuxad.com >>> . >>> >>> -- >>> You received this message because you are subscribed to the Google Grou= ps >>> "pandoc-discuss" group. >>> To unsubscribe from this group and stop receiving emails from it, send = an >>> email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/pandoc-discuss/m2v8yzpb5x.fsf%40MacBo= ok-Pro-2.hsd1.ca.comcast.net >>> . >>> >> --=20 >> You received this message because you are subscribed to the Google Group= s "pandoc-discuss" group. >> To unsubscribe from this group and stop receiving emails from it, send a= n email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> To view this discussion on the web visit https://groups.google.com/d/msg= id/pandoc-discuss/CALu%3Dv3JE2PPCY8%3DagCY9wtvwrMKXAidpSVFN650oc%2BHge8J3dw= %40mail.gmail.com. --=20 Frank Bergmann, P=C3=B6dinghauser Str. 5, D-32051 Herford, Tel. +49-5221-92= 49753 SAP Hybris & Linux LPIC-3, E-Mail tx2014-VEyjnN4Vo9k@public.gmane.org, USt-IdNr DE237314606 http://tdyn.de/freel -- Redirect to profile at freelancermap http://www.gulp.de/freiberufler/2HNKY2YHW.html -- Profile at GULP --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/16c258f8-d4fb-4d9f-a9a6-c855e83dfa4a%40tuxad.com.