From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32460 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Bastien DUMONT Newsgroups: gmane.text.pandoc Subject: Re: Issues with Quotation Marks in Pandoc When Mixing Japanese and English Texts Date: Mon, 10 Apr 2023 07:40:31 +0000 Message-ID: References: <4a0eafdc-b4a2-4a6a-9488-d2a1c9ef8351n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="32810"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDCINCES2QJRB472Z2QQMGQET7J4RFY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Apr 10 09:40:40 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lj1-f186.google.com ([209.85.208.186]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1plm8m-0008N9-2N for gtp-pandoc-discuss@m.gmane-mx.org; Mon, 10 Apr 2023 09:40:40 +0200 Original-Received: by mail-lj1-f186.google.com with SMTP id f12-20020a05651c02cc00b002a76e69f496sf585667ljo.9 for ; Mon, 10 Apr 2023 00:40:40 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1681112439; cv=pass; d=google.com; s=arc-20160816; b=KGF/U40uCA+reuhuxIuD7FwH4ETrlKg7HwZQvkF4U/jVIcyLhar4kQr06PJDWZBVIU t8TJSF9WyJe5wsDHDv5RlC3rfCR25y4jOa5Hes+zgx2sdzJt1lIz7WINAL0SeW4nBRH9 GVvh3qiqz3dlUCf1XiKTFoCJRgZumTJ1Yh4KOfFoSt4al4Gu50G7pTvfQZt3nKlDf6Uq a87q0RoD8Az4joBCgs8oTnbC5g/rCKGWocHp/hpjWTladCy+9PNMuT88oxCg0X/aU/QW UHUQfAGq9i5t374ful/wVB9hO/TBmO/iXKe3EYKhTmZ4Avsb2p67/q8SR1QYzd21KZJP dauQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:to:from:date:sender:dkim-signature; bh=ZNcUDJ1yt3uGLLM97qwqsEVTePIjWYn4aMCls5xtD7A=; b=MhZL6KA27FDNcVnF13aJC+F5GTrc47T+JQnFsUbeBrSvShG5cTgsLhPSA3EFLk5Qye Z4moaIBAHQjw4Tm0Xg9GrKG7kxrD+gsDdKhz+xHfo8S0e0cweN0PG1P78I+PFe4xV1q6 si6WNnyuJbY9TT+lODgbTS8sZCuuBIrOCEJP7xt4zschiATo9+fxf/0kyjTi2Cl4TAe1 6P6XzQvIVvw8yRcm4szbOFhPHm1pDdB0XELkYKqv9cf+7MwF8qC5r+Q6z/mt1BvXD2z0 sNex+wSzQExrpKIAK8lf/L1zGUNrMHOF5rwBIepvRr9o7AlmhP/bQB5VwPHa8ob84L6T JbyQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@posteo.net header.s=2017 header.b=YJfHy+60; spf=pass (google.com: domain of bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org designates 185.67.36.66 as permitted sender) smtp.mailfrom=bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=posteo.net DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20221208; t=1681112439; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:to:from:date:sender:from:to:cc :subject:date:message-id:reply-to; bh=ZNcUDJ1yt3uGLLM97qwqsEVTePIjWYn4aMCls5xtD7A=; b=BAFMJ5Kqx/JbP5kZO2nHwnf6wa5TJztRXd79pvXajsV6O/1flzMwqj+ggzRuD+68xU SB7BpEMZKFvyR0XNDa/r31gkquESQ0TQBggpn23rl5mNklkG187EKq4FGsHsGK4B0U9n f74WICEtcILiX9JiLSqGqsaj8U+51u6n6E8nru84odSYzpE2hdV1Dzq8dKoDfp0yUD5/ 2zU3Xqz63jbZA2OolgVhzNwvu2C1XWXC7UupQS1/jHjKIoKFATjP35LCaVs7pwqskijB AvKqAPxmxBgOYP5SO0mofGUPRhJsSyo X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1681112439; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:to:from:date:x-beenthere :x-gm-message-state:sender:from:to:cc:subject:date:message-id :reply-to; bh=ZNcUDJ1yt3uGLLM97qwqsEVTePIjWYn4aMCls5xtD7A=; b=h84eZHvaz3/JS98iRC3ws7aqUeJp4s6NEiZUkCCz++auzmn9w3AlrFbaciXd1zH+yj blXCunz00YwUF4FE7nfHMF4ZwbtoxyrJc7SLUn+iE+qr5BW5K25zFpRrXXRghg5enMEF WDAjh/kpwOUpDsg3oCr5j5jnXHymDyR+MXPNaE8kdXk+SvMNp1xjLmaW+k15P3h7/iAu 1JaxKALgmvZF+ux7k0XTPWnje5QhwQYBv7JYyXC3Ik Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AAQBX9dY7YvIZRJ2L8lOhB7/4wzJgdC7ufjxKNmCOwBvjVmkCKWluqWa fg7NYQSQt24exLCR2xg0/Pk= X-Google-Smtp-Source: AKy350Z7CYtCrlbM/9OAMtFtxQh36f/GV1/JpSeGTiPhQkkGcoWnJd3eqqzRviQsN3N59pSDn2qIwg== X-Received: by 2002:ac2:4d15:0:b0:4e9:d7b3:97a6 with SMTP id r21-20020ac24d15000000b004e9d7b397a6mr1749938lfi.8.1681112439411; Mon, 10 Apr 2023 00:40:39 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6512:3ba4:b0:4ec:7b3a:89b8 with SMTP id g36-20020a0565123ba400b004ec7b3a89b8ls444074lfv.2.-pod-prod-gmail; Mon, 10 Apr 2023 00:40:34 -0700 (PDT) X-Received: by 2002:ac2:52ab:0:b0:4df:830d:4a3a with SMTP id r11-20020ac252ab000000b004df830d4a3amr3001602lfm.23.1681112434487; Mon, 10 Apr 2023 00:40:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1681112434; cv=none; d=google.com; s=arc-20160816; b=fn3acoHu+HdR/u+UISONP+eQvmLneXbsiRyTfCI7htPRsR0tPL7w6tpps+ZlfiPStD DJfl2Ck/E1d43zaDCE+UHmxTq5BOY2ftL2MIqFtEPQB1lPaA2Xh5A34MDVkHyKMlyiGq HxhEgKp2GRuulnQqzjAbKKNlT/yuwdNfpIm/YxszCUKoaHoDio5AC+eMJVFR/UmtGt3g KjZyFOBllhiyNI/rYhQoG3HhnyQ8wIrqUyR6hWcJs7rLOBE3gGxk9Uw+A4fhEuzm1VSG OyuUJZckjJM8mea0fvFq/FH457IgVhSG4qQ6fSHQ/NjACRF1VEC/A6h0yJIZEKLzrqHy wWZg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-transfer-encoding:content-disposition :mime-version:references:message-id:subject:to:from:date :dkim-signature; bh=l7DtUlKyJYpql+I9WIILE3cDF6FP8SApVBdQoEoFjQI=; b=GK8gIukXn06UAjke4ExYVs5U2xraL+LTmxUDNJQVl4gT0UXoqRB2njIrGqk4Dfz4QB Y/SQHmKEzR4aabP1UUsZ3Hp9ULxPuycmKUzzd7e8i1NoUwtOHIpnlEMgEfqtD3WmRCR0 n0zIhWrpejIqvsc0ZJNZNKZbUpyxhM09MjkD7qEm1verBwRlfA2MoOhreqxMkJBiqVR5 9Hv1zzdWUH6IdP0aTKqd4kylL3h5in2taDk8JwFWjAGKkCqNjUR5/sJOJvzC/g9VXUHb EH3rqCUXzPbbt2DVOMPBJqa65vxYAaN3Wqjj7rFBWEhvooZEOnjhaP6NTWrD+PLd2HCn VTjg== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@posteo.net header.s=2017 header.b=YJfHy+60; spf=pass (google.com: domain of bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org designates 185.67.36.66 as permitted sender) smtp.mailfrom=bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=posteo.net Original-Received: from mout02.posteo.de (mout02.posteo.de. [185.67.36.66]) by gmr-mx.google.com with ESMTPS id l21-20020a2ea815000000b002a61d615a07si413675ljq.3.2023.04.10.00.40.34 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 10 Apr 2023 00:40:34 -0700 (PDT) Received-SPF: pass (google.com: domain of bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org designates 185.67.36.66 as permitted sender) client-ip=185.67.36.66; Original-Received: from submission (posteo.de [185.67.36.169]) by mout02.posteo.de (Postfix) with ESMTPS id A8AD624019C for ; Mon, 10 Apr 2023 09:40:33 +0200 (CEST) Original-Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 4Pw19D0nPVz6tw3 for ; Mon, 10 Apr 2023 09:40:32 +0200 (CEST) Content-Disposition: inline In-Reply-To: <4a0eafdc-b4a2-4a6a-9488-d2a1c9ef8351n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@posteo.net header.s=2017 header.b=YJfHy+60; spf=pass (google.com: domain of bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org designates 185.67.36.66 as permitted sender) smtp.mailfrom=bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=posteo.net Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32460 Archived-At: Does marking the English text as such (with ["Hello world"]{lang=3Den}) sol= ve your problem? Le Sunday 09 April 2023 =C3=A0 04:53:00PM, Shigeru Kobayashi a =C3=A9crit : > Dear Pandoc community, >=20 > I have encountered two issues regarding Pandoc's handling of quotation ma= rks in > cases where Japanese and English texts are mixed. >=20 > I am using Pandoc version 3.1.2 on macOS 12.6.3, and I can reproduce thes= e > issues. If these are indeed bugs, I am planning to submit them as issues = on > GitHub. However, I would appreciate any guidance if these issues arise fr= om my > incorrect usage. >=20 > Issue 1: Conversion of English phrases within Japanese text >=20 > I have observed the following issue. "input.md" is the input file, and > "input.tex" is the output file. >=20 > $ pandoc input.md -o input.tex >=20 > input.md: > =E3=81=9D=E3=81=AE=E4=BA=BA=E3=81=AF"Hello, world!"=E3=81=A8=E8=A8=80=E3= =81=84=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82 >=20 > input.tex: > =E3=81=9D=E3=81=AE=E4=BA=BA=E3=81=AF''Hello, world!{}``=E3=81=A8=E8=A8=80= =E3=81=84=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82 >=20 > However, the conversion is correct when spaces are added before and after= the > double quotation marks. >=20 > input.md: > =E3=81=9D=E3=81=AE=E4=BA=BA=E3=81=AF "Hello, world!" =E3=81=A8=E8=A8=80= =E3=81=84=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82 >=20 > input.tex: > =E3=81=9D=E3=81=AE=E4=BA=BA=E3=81=AF ``Hello, world!'' =E3=81=A8=E8=A8=80= =E3=81=84=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82 >=20 >=20 > Issue 2: The quotation marks are treated as Japanese text >=20 > When converting with Pandoc, the quotation marks are treated as Japanese = text, > resulting in an unnaturally wide gap. I have confirmed this using two fil= es, > "preamble.tex" and "input.md," and specifying as follows: >=20 > $ pandoc input.md -o input.pdf --pdf-engine=3Dxelatex -H preamble.tex. >=20 > preamble.tex: > \usepackage{fontspec} >=20 > \setmainfont{Georgia} > \setjamainfont{BIZ UDMincho Medium} >=20 >=20 > input.md: > --- > documentclass: bxjsarticle > classoption: pandoc > papersize: a4 > fontsize: 10pt > --- >=20 > # =E3=81=AF=E3=81=98=E3=82=81=E3=81=AB >=20 > =E3=81=9D=E3=81=AE=E4=BA=BA=E3=81=AF "Hello, world!" =E3=81=A8=E8=A8=80= =E3=81=84=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82 >=20 > That person said, "Hello, world!" >=20 > pandoc test 2023-04-10 8.47.43.png >=20 > In contrast, when I directly write the content in TeX and output it using= $ > xelatex test.tex, the quotation marks are treated as English text, and th= e > expected output is obtained. >=20 > test.tex: > \documentclass[a4paper,xelatex,ja=3Dstandard]{bxjsarticle} >=20 > \usepackage{fontspec} > \setmainfont{Georgia} > \setjamainfont{BIZ UDMincho Medium} >=20 > \title{=E3=83=86=E3=82=B9=E3=83=88} > \begin{document} > \maketitle >=20 > \section{=E3=81=AF=E3=81=98=E3=82=81=E3=81=AB} >=20 > =E3=81=9D=E3=81=AE=E4=BA=BA=E3=81=AF ``Hello, world!'' =E3=81=A8=E8=A8=80= =E3=81=84=E3=81=BE=E3=81=97=E3=81=9F=E3=80=82 >=20 > That person said, ``Hello, world!'' >=20 > \end{document} >=20 > xelatex test 2023-04-10 8.46.44.png >=20 > Shigeru Kobayashi >=20 >=20 > -- > You received this message because you are subscribed to the Google Groups > "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email > to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit [2]https://groups.google.com/d/m= sgid/ > pandoc-discuss/4a0eafdc-b4a2-4a6a-9488-d2a1c9ef8351n%40googlegroups.com. >=20 > References: >=20 > [1] mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org > [2] https://groups.google.com/d/msgid/pandoc-discuss/4a0eafdc-b4a2-4a6a-9= 488-d2a1c9ef8351n%40googlegroups.com?utm_medium=3Demail&utm_source=3Dfooter --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/ZDO9b9OpQyS8OJ_Z%40localhost.