From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/18469 Path: main.gmane.org!not-for-mail From: Mojca Miklavec Newsgroups: gmane.comp.tex.context Subject: Re: languages Date: Fri, 25 Feb 2005 04:09:45 +0100 Message-ID: <421E96F9.6020007@guest.arnes.si> References: <421CBBFD.3050908@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1109300766 4398 80.91.229.2 (25 Feb 2005 03:06:06 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Fri, 25 Feb 2005 03:06:06 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Fri Feb 25 04:06:06 2005 Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1D4Vnj-0003Is-Ed for gctc-ntg-context-518@m.gmane.org; Fri, 25 Feb 2005 04:05:59 +0100 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 12196127DD; Fri, 25 Feb 2005 04:09:58 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 20806-04; Fri, 25 Feb 2005 04:09:55 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 3419912795; Fri, 25 Feb 2005 04:09:55 +0100 (CET) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 61DB812795 for ; Fri, 25 Feb 2005 04:09:53 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 20668-05 for ; Fri, 25 Feb 2005 04:09:51 +0100 (CET) Original-Received: from acheron.informatik.uni-muenchen.de (acheron.informatik.uni-muenchen.de [129.187.214.135]) by ronja.ntg.nl (Postfix) with ESMTP id 4ABF512792 for ; Fri, 25 Feb 2005 04:09:51 +0100 (CET) Original-Received: from internaldeliver.acheron.informatik.uni-muenchen.de (localhost [127.0.0.1]) by acheron.informatik.uni-muenchen.de (Postfix) with ESMTP id 3713143647; Fri, 25 Feb 2005 04:09:51 +0100 (CET) Original-Received: from [141.84.30.1] (b1.lmu.vpn.lrz-muenchen.de [141.84.30.1]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) by acheron.informatik.uni-muenchen.de (Postfix) with ESMTP id 11657435CA; Fri, 25 Feb 2005 04:09:51 +0100 (CET) User-Agent: Mozilla/4.5-4.75 (Windows; U; Windows NT 5.1; sl-SI; rv:1.4) Gecko/20030624 Netscape/7.1 X-Accept-Language: sl, en, en-us, de Original-To: mailing list for ConTeXt users In-Reply-To: <421CBBFD.3050908@wxs.nl> X-Virus-Scanned: by amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: by amavisd-new at ntg.nl X-MailScanner-From: ntg-context-bounces@ntg.nl X-MailScanner-To: gctc-ntg-context-518@m.gmane.org Xref: main.gmane.org gmane.comp.tex.context:18469 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:18469 Hans Hagen wrote: > Hi, >=20 > Attached is an xml file that describes the hyphenation pattern files.=20 > I'd appreciate checking (some records are incomplete). I'd also like to= =20 > add (for each language) a couple of tricky hyphenatable words [for=20 > testing]. Preferable in utf-8 encoding. There is room for more comments= =20 > as well, like: prefered input and font encodings etc. >=20 > Hans Leon "Zlajpah should probably be changed to Leon \v{Z}lajpah (or =C5=BDla= jpah if in Unicode) (I suppose the information by itself is correct.) (There's a comment "Use of code page 852 in patterns", which probably=20 remained from the "old good DOS times".) Default encoding? 98% use cp1250, some Linux people use latin2. (well,=20 maybe there are still freaks somewhere on this planet using cp852 :) UTF 8 should be(come) standard, but it is coming pretty slowly. But writing UTF8 as the default encoding in ConTeXt should be OK. How can I try it (hyphenation)? I did \enableregime[utf] \mainlanguage[sl] \starttext =C5=BDelezni=C4=8Dar \showhyphens{=C5=BDelezni=C4=8Dar} \showhyphens{zeleznicar} \showhyphens{mojca pokrajculja} \stoptext It should be =C5=BEe-le-zni-=C4=8Dar (ze-le-zni-car) and moj-ca po-kraj-c= u-lja (in=20 latex "mojca" is hyphenated wrong anyway), but in ConTeXt the first two=20 don't get hyphenated at all and the second one becomes mo-j-ca=20 pokra-jcul-ja. I guess the Slovenian patterns are not loaded at all. One more question about language specific issues: we always write 1. first section 2. second section 2.1. subsection 2.2. some other subsection 2.2.1. some subsubsection (with a dot before the space after every (sub)section). What is the best=20 place to store the default settings to? (For all the other Slovenian=20 users as well.) Thank you, Mojca