* sort order in registers (indexes)
@ 2018-06-10 9:49 Henning Hraban Ramm
2018-06-10 10:42 ` Pablo Rodriguez
0 siblings, 1 reply; 10+ messages in thread
From: Henning Hraban Ramm @ 2018-06-10 9:49 UTC (permalink / raw)
To: mailing list for ConTeXt users
Ahoi,
usually, uppercase index entries are sorted before all lowercase entries.
Is there a simple setup to change that?
i.e. I need the sorting
sum - Sun - sunny
instead of
Sun - sum - sunny
I know I can influence sort order for single entries, but I’d like a general setting.
(Diacritics are handled as wanted.)
\starttext
\strut
\index{Sun}\index{sun}\index{Suomi}\index{suave}
\index{sunny}\index{sum}\index{Sumatra}\index{summon}
\index{sample}\index{super}
\index{şample}\index{südlich}\index{súper}
\index{şun}\index{sün}\index{şüñ}
\completeindex
\stoptext
Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 9:49 sort order in registers (indexes) Henning Hraban Ramm
@ 2018-06-10 10:42 ` Pablo Rodriguez
2018-06-10 11:07 ` Henning Hraban Ramm
0 siblings, 1 reply; 10+ messages in thread
From: Pablo Rodriguez @ 2018-06-10 10:42 UTC (permalink / raw)
To: ntg-context
On 06/10/2018 11:49 AM, Henning Hraban Ramm wrote:
> Ahoi,
>
> usually, uppercase index entries are sorted before all lowercase entries.
> Is there a simple setup to change that?
>
> i.e. I need the sorting
> sum - Sun - sunny
> instead of
> Sun - sum - sunny
>
> I know I can influence sort order for single entries, but I’d like a general setting.
Hi Hraban,
I think this may achieve what you want:
\setupregister[index][method={zm,zc}]
I hope it helps,
Pablo
> (Diacritics are handled as wanted.)
>
>
> \starttext
> \strut
> \index{Sun}\index{sun}\index{Suomi}\index{suave}
> \index{sunny}\index{sum}\index{Sumatra}\index{summon}
> \index{sample}\index{super}
> \index{şample}\index{südlich}\index{súper}
> \index{şun}\index{sün}\index{şüñ}
>
> \completeindex
>
> \stoptext
>
>
>
> Greetlings, Hraban
> ---
> https://www.fiee.net
> http://wiki.contextgarden.net
> https://www.dreiviertelhaus.de
> GPG Key ID 1C9B22FD
>
> ___________________________________________________________________________________
> If your question is of interest to others as well, please add an entry to the Wiki!
>
> maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
> webpage : http://www.pragma-ade.nl / http://context.aanhet.net
> archive : https://bitbucket.org/phg/context-mirror/commits/
> wiki : http://contextgarden.net
> ___________________________________________________________________________________
>
--
http://www.ousia.tk
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 10:42 ` Pablo Rodriguez
@ 2018-06-10 11:07 ` Henning Hraban Ramm
2018-06-10 12:11 ` Henning Hraban Ramm
0 siblings, 1 reply; 10+ messages in thread
From: Henning Hraban Ramm @ 2018-06-10 11:07 UTC (permalink / raw)
To: mailing list for ConTeXt users
Am 2018-06-10 um 12:42 schrieb Pablo Rodriguez <oinos@gmx.es>:
> On 06/10/2018 11:49 AM, Henning Hraban Ramm wrote:
>> Ahoi,
>>
>> usually, uppercase index entries are sorted before all lowercase entries.
>> Is there a simple setup to change that?
>>
>> i.e. I need the sorting
>> sum - Sun - sunny
>> instead of
>> Sun - sum - sunny
>>
>> I know I can influence sort order for single entries, but I’d like a general setting.
>
> Hi Hraban,
>
> I think this may achieve what you want:
>
> \setupregister[index][method={zm,zc}]
>
> I hope it helps,
Thank you very much!
I overlooked that there is indeed some documentation:
http://wiki.contextgarden.net/Command/keyword:method
(Source: http://repo.or.cz/w/context.git/blob/HEAD:/tex/context/base/sort-ini.lua)
Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 11:07 ` Henning Hraban Ramm
@ 2018-06-10 12:11 ` Henning Hraban Ramm
2018-06-10 16:16 ` Henning Hraban Ramm
0 siblings, 1 reply; 10+ messages in thread
From: Henning Hraban Ramm @ 2018-06-10 12:11 UTC (permalink / raw)
To: mailing list for ConTeXt users
Am 2018-06-10 um 13:07 schrieb Henning Hraban Ramm <texml@fiee.net>:
>> I think this may achieve what you want:
>> \setupregister[index][method={zm,zc}]
>> I hope it helps,
>
> Thank you very much!
>
> I overlooked that there is indeed some documentation:
> http://wiki.contextgarden.net/Command/keyword:method
But reading through that and the source I’m still confused.
Source snippet:
local validmethods = tohash {
"ch", -- raw character (for tracing)
"mm", -- minus mapping
"zm", -- zero mapping
"pm", -- plus mapping
"mc", -- lower case - 1
"zc", -- lower case
"pc", -- lower case + 1
"uc", -- unicode
}
local predefinedmethods = {
[v_default] = "zc,pc,zm,pm,uc",
[v_before] = "mm,mc,uc",
[v_after] = "pm,mc,uc",
[v_first] = "pc,mm,uc",
[v_last] = "pc,mm,uc",
}
I’d like to write a proper explanation for the wiki (and my book).
Is there anywhere documentation about the meaning/goal of the presets or algorithms?
Are these codes translatable into something like "ignore diacritics", "ignore upper/lowercase" etc.?
Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 12:11 ` Henning Hraban Ramm
@ 2018-06-10 16:16 ` Henning Hraban Ramm
2018-06-10 18:09 ` Pablo Rodriguez
0 siblings, 1 reply; 10+ messages in thread
From: Henning Hraban Ramm @ 2018-06-10 16:16 UTC (permalink / raw)
To: mailing list for ConTeXt users
Am 2018-06-10 um 14:11 schrieb Henning Hraban Ramm <texml@fiee.net>:
> Am 2018-06-10 um 13:07 schrieb Henning Hraban Ramm <texml@fiee.net>:
>
>>> I think this may achieve what you want:
>>> \setupregister[index][method={zm,zc}]
>>> I hope it helps,
>>
>> Thank you very much!
>>
>> I overlooked that there is indeed some documentation:
>> http://wiki.contextgarden.net/Command/keyword:method
>
> But reading through that and the source I’m still confused.
>
> I’d like to write a proper explanation for the wiki (and my book).
>
> Is there anywhere documentation about the meaning/goal of the presets or algorithms?
>
> Are these codes translatable into something like "ignore diacritics", "ignore upper/lowercase" etc.?
Ok, I think I got it...
For a proper sorting, you first apply a "mapping", then a "casing" and finally "unicode".
Presets:
default = upper like lowercase, diacritics separate
before = upper before lower, diacritics ignored
after = lower before upper, diacritics ignored
first = lower before upper, diacritics separate
last = upper before lower, diacritics separate
* If you don’t set the sorting method, the preset "first" ist used (and not "default").
* There’s no preset for the (in my eyes most meaningful) combination "upper like lowercase, diacritics ignored" (zm,zc,uc).
* Aren’t language specific sorting rules possible at the current state? Or does "unicode" handle that?
E.g.
-- DIN 5007-1 (German default sorting) is like zm,zc,uc, but ß should be sorted like ss.
-- DIN 5007-2 (German phonebook sorting) would additionally require umlauts to be sorted as ä = ae etc.
-- Austrian phonebook sorting sorts umlauts after base vocals, i.e. a, ä, o, ö, u, ü, s, ß.
-- Danish and Norwegian: x, y, z, æ, ø, å
-- Finnish and Swedish: x, y = ü, z, æ, ä, ö, ø, å (until 2006 v = w)
-- etc.
(according to https://de.wikipedia.org/wiki/Alphabetische_Sortierung)
If nobody objects I’ll add this to the wiki.
Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 16:16 ` Henning Hraban Ramm
@ 2018-06-10 18:09 ` Pablo Rodriguez
2018-06-10 18:50 ` Henning Hraban Ramm
0 siblings, 1 reply; 10+ messages in thread
From: Pablo Rodriguez @ 2018-06-10 18:09 UTC (permalink / raw)
To: ntg-context
On 06/10/2018 06:16 PM, Henning Hraban Ramm wrote:
> [...]
> * Aren’t language specific sorting rules possible at the current state? Or does "unicode" handle that?
> E.g.
> -- DIN 5007-1 (German default sorting) is like zm,zc,uc, but ß should be sorted like ss.
> -- DIN 5007-2 (German phonebook sorting) would additionally require umlauts to be sorted as ä = ae etc.
> -- Austrian phonebook sorting sorts umlauts after base vocals, i.e. a, ä, o, ö, u, ü, s, ß.
> -- Danish and Norwegian: x, y, z, æ, ø, å
> -- Finnish and Swedish: x, y = ü, z, æ, ä, ö, ø, å (until 2006 v = w)
> -- etc.
> (according to https://de.wikipedia.org/wiki/Alphabetische_Sortierung)
sort-lan.lua contains different language definitions. Among others: DIN
5007-1, DIN 5007-2, Duden.
Languages "de-AT", "no", "da" and "sv" are ordered as you explain.
Swedish doesn’t contain ø (according to https://sv.wikipedia.org/wiki/Ø,
it is replaced with ö) or ü (it is a foreign letter to their alphabet).
and "fi" seems to order the way you describe. And v is different from w.
I would say, Finnish isn’t included in sort-lan.lua.
> If nobody objects I’ll add this to the wiki.
Please, it would be extremely helpful (I remember thinking that
registers didn’t make any sense in ConTeXt, before someone helped me).
Pablo
--
http://www.ousia.tk
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 18:09 ` Pablo Rodriguez
@ 2018-06-10 18:50 ` Henning Hraban Ramm
2018-06-10 19:07 ` Hans Hagen
2018-06-10 19:26 ` Henning Hraban Ramm
0 siblings, 2 replies; 10+ messages in thread
From: Henning Hraban Ramm @ 2018-06-10 18:50 UTC (permalink / raw)
To: mailing list for ConTeXt users
Am 2018-06-10 um 20:09 schrieb Pablo Rodriguez <oinos@gmx.es>:
> On 06/10/2018 06:16 PM, Henning Hraban Ramm wrote:
>> [...]
>> * Aren’t language specific sorting rules possible at the current state? Or does "unicode" handle that?
>> E.g.
>> -- DIN 5007-1 (German default sorting) is like zm,zc,uc, but ß should be sorted like ss.
>> -- DIN 5007-2 (German phonebook sorting) would additionally require umlauts to be sorted as ä = ae etc.
>> -- Austrian phonebook sorting sorts umlauts after base vocals, i.e. a, ä, o, ö, u, ü, s, ß.
>> -- Danish and Norwegian: x, y, z, æ, ø, å
>> -- Finnish and Swedish: x, y = ü, z, æ, ä, ö, ø, å (until 2006 v = w)
>> -- etc.
>> (according to https://de.wikipedia.org/wiki/Alphabetische_Sortierung)
>
> sort-lan.lua contains different language definitions.
Ah, thanks for the hint. But how can I employ these definitions with index/list ordering?
Setting mainlanguage and method "*,uc" doesn’t seem to do the trick.
> Among others: DIN
> 5007-1, DIN 5007-2, Duden.
Hans, please add the replacement
{ "ß", "ss" }
to definitions['DIN 5007-1'] and definitions['DIN 5007-2']
Thank you!
> Swedish doesn’t contain ø (according to https://sv.wikipedia.org/wiki/Ø,
> it is replaced with ö) or ü (it is a foreign letter to their alphabet).
> and "fi" seems to order the way you describe. And v is different from w.
I wouldn’t touch it then, who knows how accurate German wikipedia is...
> I would say, Finnish isn’t included in sort-lan.lua.
Yes it is.
(http://source.contextgarden.net/tex/context/base/mkiv/sort-lan.lua)
> Please, it would be extremely helpful (I remember thinking that
> registers didn’t make any sense in ConTeXt, before someone helped me).
As soon as I understand how the language dependent sorting works...
Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 18:50 ` Henning Hraban Ramm
@ 2018-06-10 19:07 ` Hans Hagen
2018-06-10 20:20 ` Wolfgang Schuster
2018-06-10 19:26 ` Henning Hraban Ramm
1 sibling, 1 reply; 10+ messages in thread
From: Hans Hagen @ 2018-06-10 19:07 UTC (permalink / raw)
To: mailing list for ConTeXt users, wolfgang Schuster
On 6/10/2018 8:50 PM, Henning Hraban Ramm wrote:
> Hans, please add the replacement
> { "ß", "ss" }
> to definitions['DIN 5007-1'] and definitions['DIN 5007-2']
wolfgang provided these so he has to give his blessing
Hans
-----------------------------------------------------------------
Hans Hagen | PRAGMA ADE
Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
tel: 038 477 53 69 | www.pragma-ade.nl | www.pragma-pod.nl
-----------------------------------------------------------------
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 18:50 ` Henning Hraban Ramm
2018-06-10 19:07 ` Hans Hagen
@ 2018-06-10 19:26 ` Henning Hraban Ramm
1 sibling, 0 replies; 10+ messages in thread
From: Henning Hraban Ramm @ 2018-06-10 19:26 UTC (permalink / raw)
To: mailing list for ConTeXt users
Am 2018-06-10 um 20:50 schrieb Henning Hraban Ramm <texml@fiee.net>:
> Ah, thanks for the hint. But how can I employ these definitions with index/list ordering?
> Setting mainlanguage and method "*,uc" doesn’t seem to do the trick.
Sorry, found it. In my test file there was still "language=cz" in the setup, and the language key is not documented.
> Hans, please add the replacement
> { "ß", "ss" }
> to definitions['DIN 5007-1'] and definitions['DIN 5007-2']
> Thank you!
Wikified:
http://wiki.contextgarden.net/Command/setupregister
http://wiki.contextgarden.net/Command/keyword:method
Greetlings, Hraban
---
https://www.fiee.net
http://wiki.contextgarden.net
https://www.dreiviertelhaus.de
GPG Key ID 1C9B22FD
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: sort order in registers (indexes)
2018-06-10 19:07 ` Hans Hagen
@ 2018-06-10 20:20 ` Wolfgang Schuster
0 siblings, 0 replies; 10+ messages in thread
From: Wolfgang Schuster @ 2018-06-10 20:20 UTC (permalink / raw)
To: Hans Hagen, Henning Hraban Ramm; +Cc: mailing list for ConTeXt users
[-- Attachment #1.1: Type: text/plain, Size: 331 bytes --]
> Hans Hagen <mailto:j.hagen@xs4all.nl>
> 10. Juni 2018 um 21:07
> On 6/10/2018 8:50 PM, Henning Hraban Ramm wrote:
>
> wolfgang provided these so he has to give his blessing
You can add them.
AFAIK the ß replacement was added later (no idea who sent it) because it
wasn’t in the patch I found in my mail archive.
Wolfgang
[-- Attachment #1.2: Type: text/html, Size: 1327 bytes --]
[-- Attachment #2: Type: text/plain, Size: 492 bytes --]
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!
maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage : http://www.pragma-ade.nl / http://context.aanhet.net
archive : https://bitbucket.org/phg/context-mirror/commits/
wiki : http://contextgarden.net
___________________________________________________________________________________
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2018-06-10 20:20 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-06-10 9:49 sort order in registers (indexes) Henning Hraban Ramm
2018-06-10 10:42 ` Pablo Rodriguez
2018-06-10 11:07 ` Henning Hraban Ramm
2018-06-10 12:11 ` Henning Hraban Ramm
2018-06-10 16:16 ` Henning Hraban Ramm
2018-06-10 18:09 ` Pablo Rodriguez
2018-06-10 18:50 ` Henning Hraban Ramm
2018-06-10 19:07 ` Hans Hagen
2018-06-10 20:20 ` Wolfgang Schuster
2018-06-10 19:26 ` Henning Hraban Ramm
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).