From: Vit Zyka <vit.zyka@seznam.cz>
Subject: Re: Sorting question
Date: Tue, 24 Jan 2006 00:26:41 +0100 [thread overview]
Message-ID: <43D56631.50104@seznam.cz> (raw)
In-Reply-To: <E91BA656-FBE3-47F8-941C-053FE9583FAA@fiee.net>
Henning Hraban Ramm wrote:
> Am 2006-01-23 um 01:08 schrieb Vit Zyka:
>
>>> ä (adiaeresis) is identical to a, ö (odiaeresis) identical to o, ü
>>> (udiaeresis) identical to u, the same for uppercase. ß (ssharp) is
>>> edentical to "ss" (same for uppercase, but in uppercase it's written
>>> as SS anyway).
>>
>>
>> Hmmm, that is not complete: I understand that every ü, Ü, u, U comes
>> to single group, but is u<ü<U<Ü? Let say yes. Then try
>
>
> I didn't test your code, but u, ü, U and Ü should be handled as same
> (in "normal German order")
> and u=U, ü=ue=Ü=Ue=UE in "German phone book order".
Hmmm, I feel that the situation is more complicated (same as in Czech).
Proper sorting needs several (3 or 4, perhaps some languages more?) passes:
1. pass: division - define which letters comes to the same group (it can
be also group of letters) - defined for newtexutil.rb
2. pass: sorting with the simplified rules e.g. ü=ue=Ü=Ue=UE
3. pass: if all letter are the same according the 2. pass, then apply
e.g. ü<ue<Ü<Ue<UE
4. ??? (perhaps problems with Czech 'Ch').
After that:
'Üb' < 'üz' < 'Üz'
> Greetlings from Lake Constance!
Enjoy it.
Vit
next prev parent reply other threads:[~2006-01-23 23:26 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-01-21 21:26 AutoDroppedCaps and indenting Joshua Ellis
2006-01-22 10:17 ` Sorting question Bernd Militzer
2006-01-22 17:02 ` Vit Zyka
2006-01-22 19:36 ` Mojca Miklavec
2006-01-23 0:06 ` Henning Hraban Ramm
2006-01-23 0:08 ` Vit Zyka
2006-01-23 0:19 ` Henning Hraban Ramm
2006-01-23 23:26 ` Vit Zyka [this message]
2006-01-23 18:55 ` AutoDroppedCaps and indenting Joshua Ellis
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=43D56631.50104@seznam.cz \
--to=vit.zyka@seznam.cz \
--cc=ntg-context@ntg.nl \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).