ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: Henning Hraban Ramm <texml@fiee.net>
To: mailing list for ConTeXt users <ntg-context@ntg.nl>
Subject: [NTG-context] index sort order (de)
Date: Tue, 14 May 2024 18:57:19 +0200	[thread overview]
Message-ID: <3c34c54e-5c9a-4146-9f87-4034d3ef7af3@fiee.net> (raw)

Hi,
I checked the German index sorting rules, and as far as I can tell, all 
of them are somewhat wrong.

– de, default, or no setting (just mainlanguage de)
– Duden
- DIN 5007-1(:2005)
- DIN 5007-2(:1996)
- de-AT

IMO, "Duden" is obsolete, should be the same as default; it makes no 
sense to have a default setting that doesn’t adhere to the official 
rules. If Û#m not mistaken it’s also the same as DIN 5007-1; maybe there 
was a difference before 2005. (DIN 5007-2 and de-AT have their uses.)

The order of umlauts etc. should be ok.

[ß]
In tex/context/base/mkiv/sort-lan.lua, ß is replaced by s, while it 
should be sorted like ss in default and both DIN versions. (de-AT sorts 
ss after ß.)

[upper-/lowercase]
While casing is not mentioned in Wikipedia 
(https://de.wikipedia.org/wiki/Alphabetische_Sortierung), Duden „Satz 
und Korrektur“ says:

Uppercase characters get sorted the same as lowercase (i.e. mixed), but 
if both appear, lowercase comes first.

ATM all German settings sort lowercase behind uppercase.



Generally, default method seems to be not "default" but "before". Does 
this make sense?

If we look at the method presets, there’s none for {zm,pm,zc,zm,uc} 
(upper like lower, ignore diacritics – basic default for German).

"""
\mainlanguage[de]
\setupregister[index][language=de]

\starttext

\index{Apfel}
\index{Äpfel}
\index{apfel}
\index{äpfel}
\index{aepfel}
\index{Aepfel}
\index{Ápfel}
\index{ápfel}

\page
\completeindex

\stoptext
"""

default order should be:

aepfel
Aepfel
apfel
Apfel
äpfel
Äpfel
ápfel
Ápfel
aqua

DIN 5007-2 sorts like:

aepfel
Aepfel
äpfel
Äpfel
apfel
Apfel
ápfel
Ápfel
aqua

de-AT sorts like:

aepfel
Aepfel
apfel
Apfel
aqua
äpfel
Äpfel
ápfel
Ápfel




Hraban
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage  : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive  : https://github.com/contextgarden/context
wiki     : https://wiki.contextgarden.net
___________________________________________________________________________________

             reply	other threads:[~2024-05-14 16:58 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-05-14 16:57 Henning Hraban Ramm [this message]
2024-07-07  9:46 ` [NTG-context] " Henning Hraban Ramm
2024-07-08 15:40   ` Hans Hagen via ntg-context

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3c34c54e-5c9a-4146-9f87-4034d3ef7af3@fiee.net \
    --to=texml@fiee.net \
    --cc=ntg-context@ntg.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).