From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/9099 Path: main.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: hyphenation of words with accented characters Date: Thu, 29 Aug 2002 23:28:32 +0200 Sender: owner-ntg-context@let.uu.nl Message-ID: <5.1.0.14.1.20020829232155.02003050@server-1> References: <20020828.134230.26700639.MarkoSchuetz@web.de> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1"; format=flowed Content-Transfer-Encoding: 8bit X-Trace: main.gmane.org 1035399454 673 80.91.224.250 (23 Oct 2002 18:57:34 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Wed, 23 Oct 2002 18:57:34 +0000 (UTC) Cc: ntg-context@ntg.nl Original-To: Uwe Koloska In-Reply-To: <3D6DD3D6.4010801@voiceinterconnect.de> Xref: main.gmane.org gmane.comp.tex.context:9099 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:9099 At 09:57 AM 8/29/2002 +0200, Uwe Koloska wrote: >Marko Schütz schrieb: >>\hyphenation{Äqui-va-lenz-re-la-tion} >>as well as >>\hyphenation{"Aqui-va-lenz-re-la-tion} >>with both I get >>! Improper \hyphenation will be flushed. >>\handlecompoundcharacter >>#1->\def \xhandlecompoundcharacter >>{\dohandlecompou... >>l.3 \hyphenation{" >> Aqui-va-lenz-re-la-tion} > >\hyphenation can only handle single characters and not compound ones. So >you have to switch to an input encoding that doesn't touch umlauts or >other non-ASCII chars. If it works TeX has to handle this characters as >follows: > 1. let 'Ä' untouched > 2. use the latin1 to TeX (or type1 or ...) output encoding to > print the char to output. > >Unfortunately I tried this myself and didn't had any success. It seems >that all input encodings in context change non-ASCII chars to compound >characters. let's be of help then: given that you have 8 bit characters (i.e. ec or texnansi encoded fonts) \hyphenation {f\oacute-r\egrave ign} \hyphenatedword{f\oacute-r\egrave ign} btw 1, when loading patterns, context goes through a lot of trouble to make sure that 8 bit patterns are handled ok; in some languages, multiple patterns are loaded (like il2 and ec in czech) which means that they even switch properly with different coding regimes btw 2, if you think that fonts is a mess, take a look at pattern files: all kind of methods, naming, etc Hans ------------------------------------------------------------------------- Hans Hagen | PRAGMA ADE | pragma@wxs.nl Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com ------------------------------------------------------------------------- information: http://www.pragma-ade.com/roadmap.pdf documentation: http://www.pragma-ade.com/showcase.pdf -------------------------------------------------------------------------