From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28095 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: BPJ Newsgroups: gmane.text.pandoc Subject: Re: Error compiling with icu support / possible workaround? Date: Wed, 7 Apr 2021 11:37:27 +0200 Message-ID: References: <5035db2e-16b9-4923-8e38-d95b81d27840n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="000000000000826ee905bf5eae1a" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="7825"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCWMVYEK54FRBZH2WWBQMGQE6IWQOXY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Apr 07 11:37:43 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-ed1-f63.google.com ([209.85.208.63]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1lU4d4-0001rY-8B for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 07 Apr 2021 11:37:42 +0200 Original-Received: by mail-ed1-f63.google.com with SMTP id a2sf11798438edx.0 for ; Wed, 07 Apr 2021 02:37:42 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1617788262; cv=pass; d=google.com; s=arc-20160816; b=yqo1vuo1yGJyonaMMQD+w8hQk/VTQw2yT22Eaol4d7z9bw6S93lI0CmDaydG8Ng3FY vfH+1fnABifExf3imUOP2SBQX9SZ3ZK7CAqQxqlhNVDMvK46QZZNBmgBGpjpjUX179Nv Ld9JvnQImFFpjQnuXLiT2d7AHPYC/6B+z1jvJr73OL0P9Zp4XP+xTU+w1ZcWybz/t4+o k8PdxgG2S5RGMAXFZzTBsQBtv3HSxaM+vHOruPRfXLZk1WMz2TM3JFiVhT3TDRKHRFze 5+WBmdhbYMdUv5FzdG3KLS9ZxvLYjkOS2IBhGNG57/r2imh4+AIDNyzUG3mn5NwNgn+u kjbg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:sender:dkim-signature :dkim-signature; bh=gqqYe9XMMdjV9yh6pAbtVuKepqTUfsRUV3YrcbefA3o=; b=JP/tFR+rueyYHOXsi93PNmxd7szsvQ33ru+5c9gTfeZEuswdfggeEq132Pt+cjRYmw m5CZo7OzIfLugg4pJF+CVzt11epdAb2OArPc44c9YF1St0Grzi/PRKzIAtht6yH6mh2C 4y6l+Fpi2TK0FbcRlgIizheoRQsqKSBx1KPGY0G8L7IeYJA3tw0lUPz6HPY7EVFXmBnQ 9HBHjnTgrqCNL6Ux38yOW/jl2CkVhxDvy2j19zeEB7VDGMFnns5Rusi2xpzDF41JS0bP kNrrF2j+xEQcssRJjc056V4i1jUL0YNyGcAgi73VjzjIcj6ENZkJjo63GgQCULQyy9rP Q3uw== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=TrBJaFH4; spf=pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::231 as permitted sender) smtp.mailfrom=melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:references:in-reply-to:from:date:message-id :subject:to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=gqqYe9XMMdjV9yh6pAbtVuKepqTUfsRUV3YrcbefA3o=; b=O5qgMsKaTec/1cmX130U+j4/YoM1J26kKwluusp+fsOGonfO18EoKyX9J4v1jqsnxK UVPX7cCzfQlGNBxkxTP3n9M2A7cUIHXgg+fY5OdZxTMoT9b61k/eOz9/sUxAP3o3Tccs Yg9f3sB5P0Rmwv1j55L0cgiND5X+jFQXopbyftc8UEeAlqj6ZSmqtyK9TDBrS8/5zlR9 1cOkzozYsy4WJF92acny4U7+Tey1pDqb88stKvMhVML8BhGE7i/lR45kOo4fwWEoUSxP JDASCfnreNh+ZsAgAX8AajbExKgqC1m8Q4CZRfVJjq9LaiVVRgJ5NpSyoIH2kdMwNwq2 whKQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=gqqYe9XMMdjV9yh6pAbtVuKepqTUfsRUV3YrcbefA3o=; b=UP2srTS2Cv94ncQeAzuhzteA+wQjuh+d/t6SaB9EQWhsghwPZotO84SVE3SHg0+QRw QgOUUi/QCM8B1j4KJD/xj0oos/sejqhQsB9ksAp/iitoetO49zXgqVeonQ53/WcVZzQV 9XD5z2Lcg9roDX9AKAc4bWJ7etXW/ZyI1g3Zdj3iudam1FfFvbR/Td2JnEwdHgrdVvoH XPQsPjZ9qgVINTpUh/DB/7KCkjGRAYaPVU3ThxJKk12zuy5QWLNVex++QicEKFImNnTF sZdXs1k02aySrBLDPDxYE8ONJAzJiV4uIQVwSM/3uVSTgG98UfkuFGb0HgqrE3RYRTmj rB+g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:mime-version:references:in-reply-to:from :date:message-id:subject:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=gqqYe9XMMdjV9yh6pAbtVuKepqTUfsRUV3YrcbefA3o=; b=YEd4vW4pj6GmvQrHcADbnC5vOwgPyvVoEICctUWsf09HkKkKzy5b+TV7r1Gto+py2a Wv7pKL9pjlDPKNwtORlJEh3dV/M4CZLKB827HVRLD+PPwNRMAHywmHzc1jlICOy1Zjqv 1QNrB5P9nZFsscX7G6RIQjb3iNKZCIdBtbC/S+Cw03udaw5suMoU/5JtG3XC1/TgIFGH VmfzuxnTi66Nyxrqa5eTG0U0J0gBS5V5KluXG/kYSDI09/k7IYRap/OZkqifxpLbHTws k/BlBT83EEScC5ea7ByojBOygMKvKPDdcnoOEC5zzaheT0gXs1/fGdks3Q2MIwQBozN2 dXNA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM531TbFrinp9d+ElmNMfo/8HBMOWd4uMSKAhTrY+LrhNqN3E3KvGh C0HU1HjBiOouAJ/cIFeDX5c= X-Google-Smtp-Source: ABdhPJwEhCzbQEXrxwDkCzVAlcwP285DEsNdbjR7uVbJ7kwQSlrSi7J81hhBU/nKv459s42JKcSpaA== X-Received: by 2002:a17:906:82d1:: with SMTP id a17mr981012ejy.237.1617788261940; Wed, 07 Apr 2021 02:37:41 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6402:50cd:: with SMTP id h13ls2119892edb.3.gmail; Wed, 07 Apr 2021 02:37:39 -0700 (PDT) X-Received: by 2002:a50:ef08:: with SMTP id m8mr1342403eds.232.1617788259608; Wed, 07 Apr 2021 02:37:39 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617788259; cv=none; d=google.com; s=arc-20160816; b=Q0alNmmGgQWT3184KEb2NTxcHkE4sFj7RYyM7VF5sYbLys2m7Io8W0QzAR3OTkjMDm cQq3nt7c0Up7CyxvSsRc95mkNAEFimXs0eVIOnEp+H6uFKOjfyUwQizuYhTPEf2uBeKT 946GHNwO3cr8dASHc2D96VDSiYpo6KBhvu/thMnmj5XTXUPQ6ERuFWztzoL6ryvxaqBH ogAiHwnFGtuDxsbjOjULW3zyYAPzrHPKPPjlUj/GVRkImz9bYVs5cc+SQHrX9WwKErAT NhIinjpLVPKW4M1557yU3zGCyldZcBC+uJ/+n7I3ddy/Tgro+hd9yy/e8X1qTZFUACwI 75bw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=XmTD1OPCwTnVzLBZQBMBPslICU6gMU7W5dU/uXTzweU=; b=INO1UpWzBglGSOOzbg47wIcLWJBvFGMaqe3YAm99Z0CHDxwHsW+PcaDztn4LBX3wZB +cYyzNqthzbPB4mXBmtg7RzZssfdxnHUN3/pv3il0qmCyfITjHaDCPjLPryxpeyAYifs jCrulRZemDt137XVRMg2r5QqqhzKiXqOY5/kKoMfbb3NbblcC7GrJiiP8kh/w1AKguIE vflFg0f1XvI35n/7614xXmXzUAwSh2M29BYsGXvaR6Amr9n9NfzqUcDWF+zVt6A9f1xg EMeMmR/WVSWKCES4bIwn4X2suLaTk5RsEv98WcE5by2cLIm6K30gctNwaEMhgoBcfuEm FyKg== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=TrBJaFH4; spf=pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::231 as permitted sender) smtp.mailfrom=melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Original-Received: from mail-lj1-x231.google.com (mail-lj1-x231.google.com. [2a00:1450:4864:20::231]) by gmr-mx.google.com with ESMTPS id r21si2256272ejo.0.2021.04.07.02.37.39 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 07 Apr 2021 02:37:39 -0700 (PDT) Received-SPF: pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::231 as permitted sender) client-ip=2a00:1450:4864:20::231; Original-Received: by mail-lj1-x231.google.com with SMTP id a1so19895208ljp.2 for ; Wed, 07 Apr 2021 02:37:39 -0700 (PDT) X-Received: by 2002:a2e:8084:: with SMTP id i4mr1664843ljg.122.1617788259162; Wed, 07 Apr 2021 02:37:39 -0700 (PDT) In-Reply-To: X-Original-Sender: melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=TrBJaFH4; spf=pass (google.com: domain of melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::231 as permitted sender) smtp.mailfrom=melroch-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28095 Archived-At: --000000000000826ee905bf5eae1a Content-Type: multipart/alternative; boundary="000000000000826ee705bf5eae18" --000000000000826ee705bf5eae18 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable I updated my script to be configurable so that you can try various locales, normalization forms and lists of words with perl/Unicode::Collate::Locale/Unicode::Normalize. Info on required CPAN modules/perl version are in a comment at the top of the file. After installing the requirements use the --help option for usage instructions. Den ons 7 apr. 2021 09:52BPJ skrev: > I tried this out with the latest Unicode::Collate::Locale > > < > https://metacpan.org/pod/release/SADAHIRO/Unicode-Collate-1.29/Collate/Lo= cale.pm > > > > With all of fr_FR fr_CA fr_BE fr_Ch and both Normalization Form C and > Normalization Form D and it turns out that fr_CA actually is different! > > Locale: fr_FR; getlocale: default > Normalization: NFC > Sorted: cote cot=C3=A9 c=C3=B4te c=C3=B4t=C3=A9 > Normalization: NFD > Sorted: cote cote=CC=81 co=CC=82te co=CC=82te=CC=81 > Locale: fr_CA; getlocale: fr_CA > Normalization: NFC > Sorted: cote c=C3=B4te cot=C3=A9 c=C3=B4t=C3=A9 > Normalization: NFD > Sorted: cote co=CC=82te cote=CC=81 co=CC=82te=CC=81 > Locale: fr_BE; getlocale: default > Normalization: NFC > Sorted: cote cot=C3=A9 c=C3=B4te c=C3=B4t=C3=A9 > Normalization: NFD > Sorted: cote cote=CC=81 co=CC=82te co=CC=82te=CC=81 > Locale: fr_CH; getlocale: default > Normalization: NFC > Sorted: cote cot=C3=A9 c=C3=B4te c=C3=B4t=C3=A9 > Normalization: NFD > Sorted: cote cote=CC=81 co=CC=82te co=CC=82te=CC=81 > > If you want to try the script you will need to install the > Unicode::Collate CPAN distribution first, and perl if you are not on a > Unixy system. See: > > > > > > I recommend Strawberry Perl on Windows. > > Den ons 7 apr. 2021 01:39John MacFarlane skrev: > >> >> I just checked my 2006 Le Robert Micro: it has >> >> cote < c=C3=B4te < c=C3=B4t=C3=A9 >> >> cot=C3=A9 appears as a subheading of cote, so I'm not sure it's >> clear from this how it is to be ordered. Not inconsistent >> with the French Academy anyway. >> >> Bastien DUMONT writes: >> >> > Hi, >> > >> > Honestly, these are such subtleties that, as a native French speaker, = I >> have no precise ideas about it. I would say that accents are only a >> secondary criterium for sorting (cote < c=C3=B4te < coteau). Actually th= e >> Wikipedia page about the French alphabet agrees with that: "diacritics a= nd >> ligatures are taken into account only at a third level, after the second >> level (case). [...] In Quebec French diacritics are considered more >> important than case." (I hope my translation is not too bad.) Unfortunat= ely >> they give no reference. As for the "last syllable" rule, I have never he= ard >> of it, but the French Academy's dictionary online has cote < c=C3=B4te <= cot=C3=A9 < >> c=C3=B4t=C3=A9 (https://www.dictionnaire-academie.fr/article/A9C4445?his= tory=3D2). >> Anyway I guess that it rarely applies. I will check a recent Robert >> whenever possible (maybe tomorrow): they introduced a lot of changes in >> 2010. >> > >> > The French Association for Normalization produced a norm in 1969 about >> proper names' sorting, but it is behind a paywall and I am not sure that= it >> is really in use. >> > >> > Cheers, >> > >> > Bastien >> > >> > Le Tuesday 06 April 2021 =C3=A0 04:42:40PM, 'Nick Bart' via pandoc-dis= cuss a >> =C3=A9crit : >> >> Concerning French, I checked a few more sources, and some of them see= m >> to hold different views on French collation: >> https://fr.wikipedia.org/wiki/Alphabet_fran%C3%A7ais states that >> diacritics should be disregarded when sorting, except in Quebec French, >> where accented characters are to appear after their unaccented >> counterparts. No "last syllable" rule is mentioned at all. In addition, = in >> a printed French dictionary, Le Nouveau Petit Robert (1994), I couldn=E2= =80=99t >> find any explicit rules on sorting, but entries are ordered "cote < cot= =C3=A9 < >> c=C3=B4te < c=C3=B4t=C3=A9". Hopefully some native speakers of French wi= ll chime in here. >> >> >> >> As to supporting multiple collations, I tend to think that the defaul= t >> collation (which usually seems to follow the most recent rules for a giv= en >> language) would usually be sufficient. >> >> >> >> -- >> >> You received this message because you are subscribed to the Google >> Groups "pandoc-discuss" group. >> >> To unsubscribe from this group and stop receiving emails from it, sen= d >> an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/pandoc-discuss/lIJvVkf_iXceir6oyQVnvHD= TXlTIgech_5Trj2TRBY6uBZ_AnU8ghvMV6not9E_QSwG0BhZJUnHprUcIN8UlAKrUw7DzQF5-Zp= Iki3TC74Q%3D%40protonmail.com >> . >> > >> > -- >> > You received this message because you are subscribed to the Google >> Groups "pandoc-discuss" group. >> > To unsubscribe from this group and stop receiving emails from it, send >> an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> > To view this discussion on the web visit >> https://groups.google.com/d/msgid/pandoc-discuss/YGylIXTe6M3FSBIl%40loca= lhost >> . >> >> -- >> You received this message because you are subscribed to the Google Group= s >> "pandoc-discuss" group. >> To unsubscribe from this group and stop receiving emails from it, send a= n >> email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/pandoc-discuss/m2h7kjoueo.fsf%40MacBoo= k-Pro.hsd1.ca.comcast.net >> . >> > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/CADAJKhDZHQYcZQog7i3DiwFG%3D2T3WeefE_w3hUbfrq0o1FEiYQ%40mail= .gmail.com. --000000000000826ee705bf5eae18 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
I updated my script to be configurable so that you can tr= y various locales, normalization forms and lists of words with perl/Unicode= ::Collate::Locale/Unicode::Normalize.=C2=A0

Info on required CPAN modules/perl version are in a comment a= t the top of the file.

A= fter installing the requirements use the --help option for usage instructio= ns.


<= div dir=3D"ltr" class=3D"gmail_attr">Den ons 7 apr. 2021 09:52BPJ <bpj-J3H7GcXPSITLoDKTGw+V6w@public.gmane.org> skrev:
I tried this out with the latest Unicode::Collate::Locale


With all of fr_FR fr_CA fr_BE fr_Ch and both Normalization Form= C and Normalization Form D and it turns out that fr_CA actually is differe= nt!

Locale: fr_FR; getlo= cale: default
Normalization: NFC
Sorted: cote cot=C3=A9 c=C3=B4te c=C3=B4t=C3=A9
= Normalization: NFD
Sorted: cote cote=CC=81 co=CC=82t= e co=CC=82te=CC=81
Locale: fr_CA; getlocale: fr_CA
Normalization: NFC
Sorted: cot= e c=C3=B4te cot=C3=A9 c=C3=B4t=C3=A9
Normalization: = NFD
Sorted: cote co=CC=82te cote=CC=81 co=CC=82te=CC= =81
Locale: fr_BE; getlocale: default
Normalization: NFC
Sorted: cote cot=C3=A9 = c=C3=B4te c=C3=B4t=C3=A9
Normalization: NFD
Sorted: cote cote=CC=81 co=CC=82te co=CC=82te=CC=81
Locale: fr_CH; getlocale: default
Nor= malization: NFC
Sorted: cote cot=C3=A9 c=C3=B4te c= =C3=B4t=C3=A9
Normalization: NFD
Sorted: cote cote=CC=81 co=CC=82te co=CC=82te=CC=81

If you want to try the script you will need= to install the Unicode::Collate CPAN distribution first, and perl if you a= re not on a Unixy system. See:

<http://www.cpan.org/modules/INSTALL.html><= /div>


I = recommend Strawberry Perl on Windows.

Den ons 7 apr. 2021 01:39J= ohn MacFarlane <jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org> skrev:

I just checked my 2006 Le Robert Micro: it has

cote < c=C3=B4te < c=C3=B4t=C3=A9

cot=C3=A9 appears as a subheading of cote, so I'm not sure it's
clear from this how it is to be ordered.=C2=A0 Not inconsistent
with the French Academy anyway.

Bastien DUMONT <bastien.dumont-VwIFZPTo/vqsTnJN9+BGXg@public.gmane.org> writ= es:

> Hi,
>
> Honestly, these are such subtleties that, as a native French speaker, = I have no precise ideas about it. I would say that accents are only a secon= dary criterium for sorting (cote < c=C3=B4te < coteau). Actually the = Wikipedia page about the French alphabet agrees with that: "diacritics= and ligatures are taken into account only at a third level, after the seco= nd level (case). [...] In Quebec French diacritics are considered more impo= rtant than case." (I hope my translation is not too bad.) Unfortunatel= y they give no reference. As for the "last syllable" rule, I have= never heard of it, but the French Academy's dictionary online has cote= < c=C3=B4te < cot=C3=A9 < c=C3=B4t=C3=A9 (https://www.dictionnaire-academie.fr/= article/A9C4445?history=3D2). Anyway I guess that it rarely applies. I = will check a recent Robert whenever possible (maybe tomorrow): they introdu= ced a lot of changes in 2010.
>
> The French Association for Normalization produced a norm in 1969 about= proper names' sorting, but it is behind a paywall and I am not sure th= at it is really in use.
>
> Cheers,
>
> Bastien
>
> Le Tuesday 06 April 2021 =C3=A0 04:42:40PM, 'Nick Bart' via pa= ndoc-discuss a =C3=A9crit :
>> Concerning French, I checked a few more sources, and some of them = seem to hold different views on French collation: https://fr.wikipedia.org/wiki/Alphabet_fran%C3%A7ai= s states that diacritics should be disregarded when sorting, except in = Quebec French, where accented characters are to appear after their unaccent= ed counterparts. No "last syllable" rule is mentioned at all. In = addition, in a printed French dictionary, Le Nouveau Petit Robert (1994), I= couldn=E2=80=99t find any explicit rules on sorting, but entries are order= ed "cote < cot=C3=A9 < c=C3=B4te < c=C3=B4t=C3=A9". Hope= fully some native speakers of French will chime in here.
>>
>> As to supporting multiple collations, I tend to think that the def= ault collation (which usually seems to follow the most recent rules for a g= iven language) would usually be sufficient.
>>
>> --
>> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
>> To unsubscribe from this group and stop receiving emails from it, = send an email to pandoc-discuss+unsub= scribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
>> To view this discussion on the web visit ht= tps://groups.google.com/d/msgid/pandoc-discuss/lIJvVkf_iXceir6oyQVnvHDTXlTI= gech_5Trj2TRBY6uBZ_AnU8ghvMV6not9E_QSwG0BhZJUnHprUcIN8UlAKrUw7DzQF5-ZpIki3T= C74Q%3D%40protonmail.com.
>
> --
> You received this message because you are subscribed to the Google Gro= ups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send= an email to pandoc-discuss+unsubscri= be-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
> To view this discussion on the web visit https://groups.google.com/d/ms= gid/pandoc-discuss/YGylIXTe6M3FSBIl%40localhost.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe@go= oglegroups.com.
To view this discussion on the web visit https://groups.= google.com/d/msgid/pandoc-discuss/m2h7kjoueo.fsf%40MacBook-Pro.hsd1.ca.comc= ast.net.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://group= s.google.com/d/msgid/pandoc-discuss/CADAJKhDZHQYcZQog7i3DiwFG%3D2T3WeefE_w3= hUbfrq0o1FEiYQ%40mail.gmail.com.
--000000000000826ee705bf5eae18-- --000000000000826ee905bf5eae1a Content-Type: text/x-perl; charset="US-ASCII"; name="try-locale-sorting.pl" Content-Disposition: attachment; filename="try-locale-sorting.pl" Content-Transfer-Encoding: base64 Content-ID: <178abb01bf59897b0671> X-Attachment-Id: 178abb01bf59897b0671 IyEvdXNyL2Jpbi9lbnYgcGVybAoKIyBUcnkgb3V0IHNvcnRpbmcgYWNjb3JkaW5nIHRvIHZhcmlv dXMgbG9jYWxlcyB3aXRoIFVuaWNvZGU6OkNvbGxhdGU6OkxvY2FsZSBhbmQgbm9ybWFsaXphdGlv biBmb3JtcyB3aXRoIFVuaWNvZGU6Ok5vcm1hbGl6ZS4KIwojIFJlcXVpcmVzIHRoZSBmb2xsb3dp bmcgQ1BBTiBtb2R1bGVzIHRvIGJlIGluc3RhbGxlZDoKIwojIHV0Zjg6OmFsbAojCiMgVW5pY29k ZTo6Q29sbGF0ZTo6TG9jYWxlCiMKIyBVbmljb2RlOjpOb3JtYWxpemUKIwojIFBhdGg6OlRpbnkK IwojIEdldG9wdDo6TG9uZzo6RGVzY3JpcHRpdmUKIwojIFNlZTogCiMgPGh0dHA6Ly93d3cuY3Bh bi5vcmcvbW9kdWxlcy9JTlNUQUxMLmh0bWw+CiMKIyBBbHNvIHJlcXVpcmVzIHBlcmwgNS4xMC4x IG9yIGxhdGVyLgojCiMgSWYgeW91IGFyZSBvbiBhIFVuaXh5IHN5c3RlbSB5b3UgcHJvYmFibHkg aGF2ZSBhIG5ldyBlbm91Z2ggcGVybCBpbnN0YWxsZWQuCiMgT3RoZXJ3aXNlIHNlZToKIyA8aHR0 cHM6Ly93d3cucGVybC5vcmcvZ2V0Lmh0bWw+CiMKIyBPbiBXaW5kb3dzIEkgd291bGQgcmVjb21t ZW5kIFN0cmF3YmVycnkgUGVybC4KIwojIFRoaXMgc29mdHdhcmUgaXMgY29weXJpZ2h0IChjKSAy MDIxIGJ5IEJlbmN0IFBoaWxpcCBKb25zc29uLgojCiMgVGhpcyBpcyBmcmVlIHNvZnR3YXJlOyB5 b3UgY2FuIHJlZGlzdHJpYnV0ZSBpdCBhbmQvb3IgbW9kaWZ5IGl0IHVuZGVyCiMgdGhlIHNhbWUg dGVybXMgYXMgdGhlIFBlcmwgNSBwcm9ncmFtbWluZyBsYW5ndWFnZSBzeXN0ZW0gaXRzZWxmLgoj CiMgaHR0cDovL2Rldi5wZXJsLm9yZy9saWNlbnNlcy8KIwoKdXNlIDUuMDEwMDAxOwojIHVzZSB1 dGY4Owp1c2UgdXRmODo6YWxsOwp1c2Ugc3RyaWN0Owp1c2Ugd2FybmluZ3M7CnVzZSB3YXJuaW5n cyBGQVRBTCA9PiAndXRmOCc7CnVzZSBhdXRvZGllOwoKIyB1c2Ugb3BlbiBxd1sgOnV0ZjggOnN0 ZCBdOwoKdXNlIFVuaWNvZGU6OkNvbGxhdGU6OkxvY2FsZTsKdXNlIFVuaWNvZGU6Ok5vcm1hbGl6 ZSBxd1tub3JtYWxpemVdOwp1c2UgUGF0aDo6VGlueSBxd1twYXRoXTsKdXNlIEdldG9wdDo6TG9u Zzo6RGVzY3JpcHRpdmU7CgpteSgkb3B0LCR1c2FnZSkgPSBkZXNjcmliZV9vcHRpb25zKAogICcl YyAlbycsCiAgWyAnbG9jYWxlfGw9c0AnLCAnQSBsb2NhbGUgdG8gdHJ5IGxpa2UgImZyIiBvciAi ZnItQ0EiLiBSZXBlYXRhYmxlLicsCiAgICAreyByZXF1aXJlZCA9PiAxIH0sCiAgXSwKICBbICdu b3JtYWxpemV8bj1zQCcsCiAgICAnQSBVbmljb2RlIE5vcm1hbGl6YXRpb24gRm9ybSBhY2NvcmRp bmcgdG8gVW5pY29kZTo6Tm9ybWFsaXplIHRvIGFwcGx5IGxpa2UgTkZDIG9yIE5GRC4gRm9yIHVu bm9ybWFsaXplZCBzYXkgLW4gMCAoemVybykuIFJlcGVhdGFibGUuIERlZmF1bHQ6IE5GQy4nLAog ICAgK3sgZGVmYXVsdCA9PiBbJ05GQyddIH0sCiAgXSwKICBbICdpbnB1dHxpPXMnLCAnTmFtZSBv ZiB0ZXh0IGZpbGUgd2l0aCBsaW5lcyB0byBzb3J0LiBBc3N1bWVkIHRvIGJlIFVURi04IGVuY29k ZWQuJywKICAgICsgeyByZXF1aXJlZCA9PiAxIH0sCiAgXSwKICBbICdvdXRwdXR8bz1zJywgJ05h bWUgb2Ygb3V0cHV0IGZpbGUgdG8gcHJpbnQgdG8uIE9wdGlvbmFsLiBEZWZhdWx0OiBzdGRvdXQu JywKICBdLAogIFsgJ2hlbHB8aCcsICdQcmludCBoZWxwIHRleHQgYW5kIGV4aXQuJywKICAgICt7 IHNob3J0Y2lyY3VpdCA9PiAxIH0sCiAgXSwKICArewogICAgc2hvd19kZWZhdWx0cyA9PiAwLAog ICAgZ2V0b3B0X2NvbmYgPT4gW3F3KG5vX2F1dG9fYWJicmV2IG5vX2J1bmRsaW5nIG5vX2lnbm9y ZV9jYXNlKV0sCiAgfSwKKTsKCmlmICggJG9wdC0+aGVscCApIHsKICBzYXkgIiQwOiB0cnkgb3V0 IHNvcnRpbmcgYWNjb3JkaW5nIHRvIHZhcmlvdXMgbG9jYWxlcyB3aXRoIFVuaWNvZGU6OkNvbGxh dGU6OkxvY2FsZSBhbmQgbm9ybWFsaXphdGlvbiBmb3JtcyB3aXRoIFVuaWNvZGU6Ok5vcm1hbGl6 ZS4iOwogIHByaW50ICR1c2FnZS0+dGV4dDsKICBleGl0Owp9CgpteSAkbG9jYWxlcyA9ICRvcHQt PmxvY2FsZTsKbXkgJG5vcm1zICAgPSAkb3B0LT5ub3JtYWxpemU7Cm15ICRpbiAgICAgID0gJG9w dC0+aW5wdXQ7Cm15ICRvdXQgICAgID0gJG9wdC0+b3V0cHV0OwoKbXkgJGZoID0gJG91dCA/IHBh dGgoJG91dCktPm9wZW53X3V0ZjggOiBcKlNURE9VVDsKCnNlbGVjdCAkZmg7CgpteSBAbGluZXMg PSBwYXRoKCRpbiktPmxpbmVzX3V0Zjg7Cgpmb3IgbXkgJGxvY2FsZSAoIEAkbG9jYWxlcyApIHsK ICBteSAkY29sbCA9IFVuaWNvZGU6OkNvbGxhdGU6OkxvY2FsZS0+bmV3KGxvY2FsZSA9PiAkbG9j YWxlKTsKICBwcmludGYgIkxvY2FsZTogJGxvY2FsZTsgZ2V0bG9jYWxlOiAlc1xuXG4iLCAkY29s bC0+Z2V0bG9jYWxlOwogIGZvciBteSAkbm9ybSAoIEAkbm9ybXMgKSB7CiAgICBwcmludCAiTm9y bWFsaXphdGlvbjogJG5vcm1cblxuIjsKICAgIG15IEBub3JtZWQgPSAkbm9ybSA/IChtYXAgeyBu b3JtYWxpemUgJG5vcm0sICRfIH0gQGxpbmVzKSA6IEBsaW5lczsKICAgIG15IEBzb3J0ZWQgPSAk Y29sbC0+c29ydChAbm9ybWVkKTsKICAgIHByaW50ICJTb3J0ZWQ6XG5cbkBzb3J0ZWRcblxuIjsK ICB9Cn0KCnNlbGVjdCBTVERPVVQ7CgpjbG9zZSAkZmg7CgpleGl0OwoK --000000000000826ee905bf5eae1a--