From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/17602 Path: main.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: (Con)TeX(t), Unicode and accented characters Date: Mon, 20 Dec 2004 21:52:17 +0100 Message-ID: <41C73B81.1000707@wxs.nl> References: <41C72FD6.3030807@email.si> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1103575950 20041 80.91.229.6 (20 Dec 2004 20:52:30 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Mon, 20 Dec 2004 20:52:30 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Mon Dec 20 21:52:21 2004 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1CgUVx-0006sP-00 for ; Mon, 20 Dec 2004 21:52:21 +0100 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 1E6BA127D4; Mon, 20 Dec 2004 21:52:21 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 18282-04; Mon, 20 Dec 2004 21:52:17 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id E5561127BC; Mon, 20 Dec 2004 21:52:16 +0100 (CET) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 1D34F127BC for ; Mon, 20 Dec 2004 21:52:16 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 18250-07 for ; Mon, 20 Dec 2004 21:52:15 +0100 (CET) Original-Received: from mailrelay02.solcon.nl (unknown [212.45.32.200]) by ronja.ntg.nl (Postfix) with ESMTP id 334FB1278C for ; Mon, 20 Dec 2004 21:52:15 +0100 (CET) Original-Received: from server-1.pragma-net.nl (dsl-212-84-128-085.solcon.nl [212.84.128.85]) by mailrelay02.solcon.nl (8.12.11/SQL-8.12.11-5/8.12.11) with ESMTP id iBKKq99g000606; Mon, 20 Dec 2004 21:52:09 +0100 Original-Received: by server-1.pragma-net.nl (Postfix, from userid 65534) id B41AE17BF5; Mon, 20 Dec 2004 21:51:05 +0100 (CET) Original-Received: from [10.100.1.191] (unknown [10.100.1.191]) by server-1.pragma-net.nl (Postfix) with ESMTP id 437B617BF0; Mon, 20 Dec 2004 20:51:04 +0000 (UTC) User-Agent: Mozilla Thunderbird 0.8 (Windows/20040913) X-Accept-Language: en-us, en Original-To: mojca.miklavec@guest.arnes.si, mailing list for ConTeXt users In-Reply-To: <41C72FD6.3030807@email.si> X-Virus-Scanned: ClamAV 0.80/540/Tue Oct 19 14:59:23 2004 clamav-milter version 0.80j on mailrelay02 X-Virus-Status: Clean X-Virus-Scanned: by amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: by amavisd-new at ntg.nl Xref: main.gmane.org gmane.comp.tex.context:17602 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:17602 Mojca Miklavec wrote: > But when I switched to ConTeXt I came against that problem again. > > In LaTeX I used > \v{c}\v{s}\v{z} this also works in context > at first, later > \usepackage{csz} ... "c"s"z in this case, i assume that csz makes " active and such; if you really want that , we shoul dmake an enco-fcz, with definitions like: \startlanguagespecifics[cz] \appendtoks \makecharacteractive " \to \everynormalcatcodes \installcompoundcharacter "c {\v{c}} \installcompoundcharacter "s {\v{s}} \installcompoundcharacter "z {\v{z}} \stoplanguagespecifics and alike; if you want utf, you should say (at the top of the file) \enableregime[utf] > As I didn't know how to use any other the font, I always used CMR, the > default, so I didn't have problems with exotic fonts either. this should work with all fonts, since there are fallback definitions > % output=pdf -translate-file=cp1250cs > \setupbodyfont > [csr,ams,rm] try to avoid code pages > What I don't really understand: why did the Chech TUG have to design > *their own font*, csr, (or made changes to cmr) if accented characters > worked perfectly already in plain TeX? in cmr \v{s} is actually two characters, while in csr it's one (composed) character (built of two characters but seen as one); therefore when you use csr fonts, you can get proper hyphenation (which is notthe case in cmr where the usage of \accent primitive spoils the game); next year, when i can assume that the new latin modern fonts are available everywhere, i will drop cmr as default cum suis in favor of lsr (which has cmr, plr, csr, vnr, aer etc included) > The second problem: This works under Windows when typesetting in code > page 1250. How can I use accented characters if text is typeset in > Unicode (or latin2) in Linux? you probably need to configure you reditor to use utf > The third problem: How do I typeset '\v{c}' in some other font? I do > understand that it may not function in just any font since someone has > to tell the computer how the accented characters are built, but as long > as \v{c} works, there's no reason for > \useencoding[utf8] > and then continuing with unicode encoded characters not to produce the > desired result. don't worry, other fonts work ok; if an encoding does not support the chars you need, a composed char is constructed; [font encodings have othing to do with input encoding but there do influence hyphenations] if i'm right, ec, texnansi, and qx encoding all serve your purpose Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl -----------------------------------------------------------------