ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: Henning Hraban Ramm <texml@fiee.net>
To: mailing list for ConTeXt users <ntg-context@ntg.nl>
Subject: Re: sort order in registers (indexes)
Date: Sun, 10 Jun 2018 18:16:50 +0200	[thread overview]
Message-ID: <E0300F8F-B8E6-4E62-89B4-17AA10410FFC@fiee.net> (raw)
In-Reply-To: <25A29D16-9128-4488-82B7-BB797543B2F6@fiee.net>

Am 2018-06-10 um 14:11 schrieb Henning Hraban Ramm <texml@fiee.net>:

> Am 2018-06-10 um 13:07 schrieb Henning Hraban Ramm <texml@fiee.net>:
> 
>>> I think this may achieve what you want:
>>> \setupregister[index][method={zm,zc}]
>>> I hope it helps,
>> 
>> Thank you very much!
>> 
>> I overlooked that there is indeed some documentation:
>> http://wiki.contextgarden.net/Command/keyword:method
> 
> But reading through that and the source I’m still confused.
> 
> I’d like to write a proper explanation for the wiki (and my book).
> 
> Is there anywhere documentation about the meaning/goal of the presets or algorithms?
> 
> Are these codes translatable into something like "ignore diacritics", "ignore upper/lowercase" etc.?

Ok, I think I got it...
For a proper sorting, you first apply a "mapping", then a "casing" and finally "unicode".

Presets:
default = upper like lowercase, diacritics separate
before  = upper before lower, diacritics ignored
after   = lower before upper, diacritics ignored
first   = lower before upper, diacritics separate
last    = upper before lower, diacritics separate

* If you don’t set the sorting method, the preset "first" ist used (and not "default").

* There’s no preset for the (in my eyes most meaningful) combination "upper like lowercase, diacritics ignored" (zm,zc,uc).

* Aren’t language specific sorting rules possible at the current state? Or does "unicode" handle that?
E.g.
-- DIN 5007-1 (German default sorting) is like zm,zc,uc, but ß should be sorted like ss.
-- DIN 5007-2 (German phonebook sorting) would additionally require umlauts to be sorted as ä = ae etc.
-- Austrian phonebook sorting sorts umlauts after base vocals, i.e. a, ä, o, ö, u, ü, s, ß.
-- Danish and Norwegian: x, y, z, æ, ø, å
-- Finnish and Swedish: x, y = ü, z, æ, ä, ö, ø, å (until 2006 v = w)
-- etc.
(according to https://de.wikipedia.org/wiki/Alphabetische_Sortierung)

If nobody objects I’ll add this to the wiki.

Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD

___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://context.aanhet.net
archive  : https://bitbucket.org/phg/context-mirror/commits/
wiki     : http://contextgarden.net
___________________________________________________________________________________

  reply	other threads:[~2018-06-10 16:16 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-06-10  9:49 Henning Hraban Ramm
2018-06-10 10:42 ` Pablo Rodriguez
2018-06-10 11:07   ` Henning Hraban Ramm
2018-06-10 12:11     ` Henning Hraban Ramm
2018-06-10 16:16       ` Henning Hraban Ramm [this message]
2018-06-10 18:09         ` Pablo Rodriguez
2018-06-10 18:50           ` Henning Hraban Ramm
2018-06-10 19:07             ` Hans Hagen
2018-06-10 20:20               ` Wolfgang Schuster
2018-06-10 19:26             ` Henning Hraban Ramm

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=E0300F8F-B8E6-4E62-89B4-17AA10410FFC@fiee.net \
    --to=texml@fiee.net \
    --cc=ntg-context@ntg.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).