From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/25523 Path: news.gmane.org!not-for-mail From: Vit Zyka Newsgroups: gmane.comp.tex.context Subject: Re: Sorting question Date: Tue, 24 Jan 2006 00:26:41 +0100 Message-ID: <43D56631.50104@seznam.cz> References: <43D2A70D.1000703@apostate.com> <1137925029.4142.3.camel@eagle.intern> <43D3BAA8.8030602@seznam.cz> <6faad9f00601221136s6a487603g1849b570f45cd2b2@mail.gmail.com> <43D41E8C.6080508@seznam.cz> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1138058821 18051 80.91.229.2 (23 Jan 2006 23:27:01 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 23 Jan 2006 23:27:01 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Tue Jan 24 00:26:57 2006 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1F1B5L-0005G8-QV for gctc-ntg-context-518@m.gmane.org; Tue, 24 Jan 2006 00:26:55 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 9DC6D127E6; Tue, 24 Jan 2006 00:26:54 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23744-07; Tue, 24 Jan 2006 00:26:52 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 61447127B6; Tue, 24 Jan 2006 00:26:51 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 7A24F127B6 for ; Tue, 24 Jan 2006 00:26:48 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23719-06-2 for ; Tue, 24 Jan 2006 00:26:47 +0100 (CET) Original-Received: from smtp.seznam.cz (smtp.seznam.cz [212.80.76.43]) by ronja.ntg.nl (Postfix) with SMTP id E0E22127A2 for ; Tue, 24 Jan 2006 00:26:46 +0100 (CET) Original-Received: (qmail 18938 invoked from network); 23 Jan 2006 23:26:46 -0000 Original-Received: from unknown (HELO ?127.0.0.1?) (vit.zyka@62.240.166.134) by cetus.go.seznam.cz with ESMTPA; 23 Jan 2006 23:26:46 -0000 User-Agent: Mozilla Thunderbird 1.0.6 (Windows/20050716) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: X-Antivirus: avast! (VPS 0603-4, 20.01.2006), Outbound message X-Antivirus-Status: Clean X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.7 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:25523 Archived-At: Henning Hraban Ramm wrote: > Am 2006-01-23 um 01:08 schrieb Vit Zyka: > = >>> =E4 (adiaeresis) is identical to a, =F6 (odiaeresis) identical to o, = =FC >>> (udiaeresis) identical to u, the same for uppercase. =DF (ssharp) is >>> edentical to "ss" (same for uppercase, but in uppercase it's written >>> as SS anyway). >> >> >> Hmmm, that is not complete: I understand that every =FC, =DC, u, U come= s = >> to single group, but is u<=FC = > = > I didn't test your code, but u, =FC, U and =DC should be handled as same = > (in "normal German order") > and u=3DU, =FC=3Due=3D=DC=3DUe=3DUE in "German phone book order". Hmmm, I feel that the situation is more complicated (same as in Czech). = Proper sorting needs several (3 or 4, perhaps some languages more?) passes: 1. pass: division - define which letters comes to the same group (it can = be also group of letters) - defined for newtexutil.rb 2. pass: sorting with the simplified rules e.g. =FC=3Due=3D=DC=3DUe=3DUE 3. pass: if all letter are the same according the 2. pass, then apply = e.g. =FC Greetlings from Lake Constance! Enjoy it. Vit