From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/17980 Path: main.gmane.org!not-for-mail From: Vit Zyka Newsgroups: gmane.comp.tex.context Subject: Re: cz support Date: Wed, 26 Jan 2005 11:17:51 +0100 Message-ID: <41F76E4F.5060000@seznam.cz> References: <41F580A3.90203@seznam.cz> <41F6A3F9.1040704@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1106734749 12547 80.91.229.6 (26 Jan 2005 10:19:09 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 26 Jan 2005 10:19:09 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Wed Jan 26 11:19:02 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CtkGM-00088d-00 for ; Wed, 26 Jan 2005 11:19:02 +0100 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 141EA127B9; Wed, 26 Jan 2005 11:19:02 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 07469-01; Wed, 26 Jan 2005 11:19:01 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id A2ED8127B1; Wed, 26 Jan 2005 11:17:13 +0100 (CET) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 36CB3127B1 for ; Wed, 26 Jan 2005 11:17:12 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 07136-06-2 for ; Wed, 26 Jan 2005 11:17:11 +0100 (CET) Original-Received: from smtp.seznam.cz (smtp.seznam.cz [212.80.76.43]) by ronja.ntg.nl (Postfix) with SMTP id 12CA91276C for ; Wed, 26 Jan 2005 11:17:11 +0100 (CET) Original-Received: (qmail 14810 invoked from network); 26 Jan 2005 10:17:09 -0000 Original-Received: from unknown (HELO ?127.0.0.1?) (vit.zyka@82.142.74.35) by smtp.seznam.cz with SMTP; 26 Jan 2005 10:17:09 -0000 User-Agent: Mozilla Thunderbird 0.9 (Windows/20041103) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: <41F6A3F9.1040704@wxs.nl> X-Antivirus: avast! (VPS 0502-3, 14.01.2005), Outbound message X-Antivirus-Status: Clean X-Virus-Scanned: by amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: by amavisd-new at ntg.nl Xref: main.gmane.org gmane.comp.tex.context:17980 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:17980 h h extern wrote: > Vit Zyka wrote: > >> Hallo Hans and other ConTeXters, >> >> I have started to collect and clean up Czech ConTeXt localization. And >> I have discovered that Tomas Hudec includeded Czech (non-trivial) >> sorting rules into texutils.pl (calling czsort.pl from Jan Pazdziora). >> Unfortunately it was not incorporated into official ConTeXt disribution. >> >> Hans, what is your prefereble solution in case of local sorting rules? >> I know, there is \definesortkey, but this mechanism is insufficient >> for Czech. I think that calling external Perl sorting routine is a >> suitable solution. If so, it can be incorporated into current version? >> Files are attached. Hallo Hans, thank you for your positive responce. Since I am still a bit confused in the ConTeXt low and high level encoding macros (but a bit less after solving the last week UTF-8 -> IL2 problem - despite of the reason was in my old ConTeXt distr.) I need a bit more details about how to contribute. > in order to do that: > > - the pm code should move into texutil > - we should find a way to hook the switch into texexec > now, apart from such an effort, i think we should rethink sorting; > - no funny encodings which demands the sorter to be encoding aware > - no locale dependencies > - readable sorting code -) OK. So, each sorting rules will be represented by cmp($a,$b) function? And $a and $b should be glyps, shouldn't they? > steps: > - write named glyphs to the output: > - no longer use direct code in definecommand and defineaccent, i.e. > - define the fallbacks as 'definecharacter ones' Sorry, output of what? Where are the commands? ??? > the last two steps actually come first (since adam l. is quite fluent in > the character names and such, i guess that the both of us need to look > into that and do that in 'one move'); after that i can rewrite the > sort/index code to become > aware of named glyphs and languages, and we can then plug in language > specific sorting rules [taking care of xml at teh same time] Great. > [it's worth considering to drop the non etex code] All right. etex can be everywhere now. > named glyphs are not so much a problem, but we need to clean up the > fallback encoding defs a bit in order to prevent funny expansion I totaly do not known what the code bellow illustrates. So, what to do? Vit Zyka > \def\literateencodedtokens > {% \let\dohandleaccent \keephandleaccent > % \let\dohandlecommand \keephandlecommand > \let\dohandlecharacter\keephandlecharacter} > > \def\convertmeaning#1\to % watch the double expansion ! > {\bgroup > \honorunexpanded > %\dontexpandencoding % new > \literateencodedtokens % newer > \xdef\@@globalexpanded{#1}% > \xdef\@@globalexpanded{\@@globalexpanded}% > \egroup > \convertcommand\@@globalexpanded\to} > > \protect > > \enableencoding[texnansi] \setupregister[index][keyexpansion=yes] > > \starttext > > \defineactivecharacter * {\eacute} test \index{test \eacute * \'e test} > > \stoptext