From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/31179 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Lyndon Drake Newsgroups: gmane.text.pandoc Subject: Re: A little citation parsing wrinkle Date: Fri, 5 Aug 2022 13:34:45 -0700 (PDT) Message-ID: <92daa17b-ca93-41ac-9d0f-bbaa02fad022n@googlegroups.com> References: <13DD89B3-A534-4469-8256-F9690F63A725@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_46_1948376863.1659731685492" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="9127"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCKOX6WY7EKBBZX5WWLQMGQE4R5TIFY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Fri Aug 05 22:34:51 2022 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-yb1-f184.google.com ([209.85.219.184]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1oK41y-0002AP-7u for gtp-pandoc-discuss@m.gmane-mx.org; Fri, 05 Aug 2022 22:34:50 +0200 Original-Received: by mail-yb1-f184.google.com with SMTP id 137-20020a250b8f000000b0067a5a14d730sf2931756ybl.12 for ; Fri, 05 Aug 2022 13:34:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:x-original-sender :mime-version:subject:references:in-reply-to:message-id:to:from:date :sender:from:to:cc; bh=CmBdo1cprxeGrJE2BtEJwBdg45DMG/vmC+ppQJ3vv2A=; b=APEDW/JHUPdt9sr9fogR9KwkLzq4xwKFL4CSCkjSRiSJn+C9U6ZMDbvpdiYiYrPCIR T3lhkRiemOO8gyAFge9z8zsEx864Hrxh74bBSaL9wwMjDKWcDuu5EMvh5Ly0yqN54a88 ww8zUpg4HwRPlkdK5p+ieUIcekGMo1z07+bSKL+r5T736v2az1SGEfYsMykMGUFGpTzL vrubM1LU4jU+g2vrTycoSGANu4NUDWBTD87WdH1PWIkgpGBvd+ugXrdYc2ucb3AKm6qn FPuZOlwcJ2YfQhCzR4ZJMssN07JppvJylmmERrRTKZOYxLf/EnL4fpKC2xg2opju94QA BelQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:x-original-sender :mime-version:subject:references:in-reply-to:message-id:to:from:date :from:to:cc; bh=CmBdo1cprxeGrJE2BtEJwBdg45DMG/vmC+ppQJ3vv2A=; b=c3Xdyh59jzSFS/z5Jdi402OHpBWUalCJw3rcD4rZNh8BAm+mOEoSDDd6xDX8sRsPwT 9sFmM5m2eLZZSyzvXstV4euiQw9Wyo8i8bDoTxOlE3we/XOArGZAEhM2Cho6tCz07JSn 4CBJ1wNh7/wKj6eUXWoMhw3dJtsONI/ba0nQPMPwNIHcyTAjfp8yzuZZmGP0C4gnh8jb DYF1bhl9vNWV5WP60aZ0V1asHCAdacoHCKZ7GZ6zp9kyfgrpGdl4RpXNeVQlUWD6Wsil HCc8/wRN3l0a+rVAuqO0Joke8PPgLQVO/rGY8WZ9/Yd+4JW+l8XUBKmOS1MqIJfw0FAi W35g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-sender:mime-version:subject:references:in-reply-to :message-id:to:from:date:x-gm-message-state:sender:from:to:cc; bh=CmBdo1cprxeGrJE2BtEJwBdg45DMG/vmC+ppQJ3vv2A=; b=OkZ5LKE6TOIZW5WnvH5VDgph8MnKc5hpwX1RJUHrlj2+Ud3WBDmX/PJZa4qQOlkwJb 0ZnS8HP/ou6pqXrxLCN8FuAccvKvJ3LDju+LvV+uBChGqKIsSwH0pO4wEdcXypSINV8p Dxe3KKLspsoZZ1nPYlQ3YQhqVhmOqtfKMgzbAz/G81KqgyhEFnhwrR8Xzc4xG9AQz6DG MKvuTGcQpXPgIo2US0mTiy4NzWHugEZMFzpGIcdCllleME/NuQyqyrxHH7+pS+CDMY4/ rqPmXQpKbRpshc4hOh9hYfec+CoeDTNbsDBp6bZUdZ/0Y/28Mi23WfVj92OAw0ontc+5 W6sQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: ACgBeo34oYxfl7s798cvrxGwZ6vLKTNT+tRcvMs5sWhUF9+8RTYGtxGk q9U95uMuUg+JIgldNZ0C4wc= X-Google-Smtp-Source: AA6agR7qbxtwzrRSyh7UuJvf4/36Xx8sRSY3H8qgAFi9V5MiEjGq+5RoqXh1X9Z9GmjqK5/vriKIFQ== X-Received: by 2002:a05:6902:723:b0:677:115b:553c with SMTP id l3-20020a056902072300b00677115b553cmr7395575ybt.214.1659731689023; Fri, 05 Aug 2022 13:34:49 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a25:adce:0:b0:670:98c6:b898 with SMTP id d14-20020a25adce000000b0067098c6b898ls3192139ybe.4.-pod-prod-gmail; Fri, 05 Aug 2022 13:34:46 -0700 (PDT) X-Received: by 2002:a25:c6d7:0:b0:67a:694e:6666 with SMTP id k206-20020a25c6d7000000b0067a694e6666mr6906325ybf.329.1659731686066; Fri, 05 Aug 2022 13:34:46 -0700 (PDT) In-Reply-To: <13DD89B3-A534-4469-8256-F9690F63A725-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> X-Original-Sender: isenguard-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:31179 Archived-At: ------=_Part_46_1948376863.1659731685492 Content-Type: multipart/alternative; boundary="----=_Part_47_981866316.1659731685493" ------=_Part_47_981866316.1659731685493 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable The backslash does indeed fix it. Thank you! On Friday, 5 August 2022 at 19:20:56 UTC+1 fiddlosopher wrote: > Oh, this is interesting. I can reproduce with this input: > > ``` > --- > references: > - id: Eruera1997 > author: N. Eurera > title: "My article" > journal: "My journal" > type: journal-article > volume: 55 > date: 1997 > ... > > see[^fn7] > see[^fn8] > > [^fn7]: @Eruera1997 . Eruera's text switches freely between English and= =20 > M=C4=81ori. > > [^fn8]: @Eruera1997. Eruera's text switches freely between English and=20 > M=C4=81ori. > > ``` > > Output with pandoc test.md -C -t plain: > > ``` > see[1] see[2] > > [1] 1 . Eruera=E2=80=99s text switches freely between English and M=C4=81= ori. > > [2] > 1. Eruera=E2=80=99s text switches freely between English and M=C4=81ori. > ``` > > What's happening here is that the `@Eurera1997` in fn8 is being=20 > interpreted as an example list, which causes the `@Eurera1997` in fn7 to = be=20 > interpreted as a reference to that example list item. > > Obviously not what you want. You can defeat this by putting a backslash= =20 > before the period. > > > > > On Aug 5, 2022, at 4:48 AM, Lyndon Drake wrote: > >=20 > > Hi, > >=20 > > I've noticed what I think is an unexpected parsing difference for a=20 > citation which begins a footnote, and which is followed immediately by=20 > punctuation. This: > >=20 > > [^fn7]: @Eruera1997 . Eruera's text switches freely between English and= =20 > M=C4=81ori. > >=20 > > is treated differently from this: > >=20 > > [^fn7]: @Eruera1997. Eruera's text switches freely between English and= =20 > M=C4=81ori. > >=20 > > The latter, when using citeproc and outputting to Word with the SBL=20 > style, seems to generate a footnote inside the footnote. (The former,=20 > unsurprisingly, leaves a space between the generated citation and the ful= l=20 > stop.) By comparison, a similar in-text citation followed immediately by = a=20 > full stop at the very end of a footnote is treated normally. > >=20 > > Lyndon > >=20 > > --=20 > > You received this message because you are subscribed to the Google=20 > Groups "pandoc-discuss" group. > > To unsubscribe from this group and stop receiving emails from it, send= =20 > an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > > To view this discussion on the web visit=20 > https://groups.google.com/d/msgid/pandoc-discuss/d41e5b70-462e-4f7b-b6b1-= d2ead5b6c4ccn%40googlegroups.com > . > > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/92daa17b-ca93-41ac-9d0f-bbaa02fad022n%40googlegroups.com. ------=_Part_47_981866316.1659731685493 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable The backslash does indeed fix it. Thank you!

On Friday, 5 August 2022 at 19:2= 0:56 UTC+1 fiddlosopher wrote:
Oh, this is interesting. I can reproduce with this input:

```
---
references:
- id: Eruera1997
author: N. Eurera
title: "My article"
journal: "My journal"
type: journal-article
volume: 55
date: 1997
...

see[^fn7]
see[^fn8]

[^fn7]: @Eruera1997 . Eruera's text switches freely between English= and M=C4=81ori.

[^fn8]: @Eruera1997. Eruera's text switches freely between English = and M=C4=81ori.

```

Output with pandoc test.md -C -t plain:

```
see[1] see[2]

[1] 1 . Eruera=E2=80=99s text switches freely between English and M=C4= =81ori.

[2]
1. Eruera=E2=80=99s text switches freely between English and M=C4=81or= i.
```

What's happening here is that the `@Eurera1997` in fn8 is being int= erpreted as an example list, which causes the `@Eurera1997` in fn7 to be in= terpreted as a reference to that example list item.

Obviously not what you want. You can defeat this by putting a backslash= before the period.



> On Aug 5, 2022, at 4:48 AM, Lyndon Drake <isen...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
>=20
> Hi,
>=20
> I've noticed what I think is an unexpected parsing difference = for a citation which begins a footnote, and which is followed immediately b= y punctuation. This:
>=20
> [^fn7]: @Eruera1997 . Eruera's text switches freely between En= glish and M=C4=81ori.
>=20
> is treated differently from this:
>=20
> [^fn7]: @Eruera1997. Eruera's text switches freely between Eng= lish and M=C4=81ori.
>=20
> The latter, when using citeproc and outputting to Word with the SB= L style, seems to generate a footnote inside the footnote. (The former, uns= urprisingly, leaves a space between the generated citation and the full sto= p.) By comparison, a similar in-text citation followed immediately by a ful= l stop at the very end of a footnote is treated normally.
>=20
> Lyndon
>=20
> --=20
> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, = send an email to pandoc-discus..= .@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/d41e5b7= 0-462e-4f7b-b6b1-d2ead5b6c4ccn%40googlegroups.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d= /msgid/pandoc-discuss/92daa17b-ca93-41ac-9d0f-bbaa02fad022n%40googlegroups.= com.
------=_Part_47_981866316.1659731685493-- ------=_Part_46_1948376863.1659731685492--