From mboxrd@z Thu Jan  1 00:00:00 1970
X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/9099
Path: main.gmane.org!not-for-mail
From: Hans Hagen <pragma@wxs.nl>
Newsgroups: gmane.comp.tex.context
Subject: Re: hyphenation of words with accented characters
Date: Thu, 29 Aug 2002 23:28:32 +0200
Sender: owner-ntg-context@let.uu.nl
Message-ID: <5.1.0.14.1.20020829232155.02003050@server-1>
References: <20020828.134230.26700639.MarkoSchuetz@web.de>
NNTP-Posting-Host: coloc-standby.netfonds.no
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"; format=flowed
Content-Transfer-Encoding: 8bit
X-Trace: main.gmane.org 1035399454 673 80.91.224.250 (23 Oct 2002 18:57:34 GMT)
X-Complaints-To: usenet@main.gmane.org
NNTP-Posting-Date: Wed, 23 Oct 2002 18:57:34 +0000 (UTC)
Cc: ntg-context@ntg.nl
Original-To: Uwe Koloska <koloska@voiceinterconnect.de>
In-Reply-To: <3D6DD3D6.4010801@voiceinterconnect.de>
Xref: main.gmane.org gmane.comp.tex.context:9099
X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:9099

At 09:57 AM 8/29/2002 +0200, Uwe Koloska wrote:
>Marko Schütz schrieb:
>>\hyphenation{Äqui-va-lenz-re-la-tion}
>>as well as
>>\hyphenation{"Aqui-va-lenz-re-la-tion}
>>with both I get
>>! Improper \hyphenation will be flushed.
>>\handlecompoundcharacter 
>>#1->\def                                   \xhandlecompoundcharacter 
>>{\dohandlecompou...
>>l.3 \hyphenation{"
>>                   Aqui-va-lenz-re-la-tion}
>
>\hyphenation can only handle single characters and not compound ones.  So 
>you have to switch to an input encoding that doesn't touch umlauts or 
>other non-ASCII chars.  If it works TeX has to handle this characters as 
>follows:
>   1. let 'Ä' untouched
>   2. use the latin1 to TeX (or type1 or ...) output encoding to
>      print the char to output.
>
>Unfortunately I tried this myself and didn't had any success. It seems 
>that all input encodings in context change non-ASCII chars to compound 
>characters.

let's be of help then:

given that you have 8 bit characters (i.e. ec or texnansi encoded fonts)

\hyphenation   {f\oacute-r\egrave ign}
\hyphenatedword{f\oacute-r\egrave ign}

btw 1, when loading patterns, context goes through a lot of trouble to make 
sure that 8 bit patterns are handled ok; in some languages, multiple 
patterns are loaded (like il2 and ec in czech) which means that they even 
switch properly with different coding regimes

btw 2, if you think that fonts is a mess, take a look at pattern files: all 
kind of methods, naming, etc

Hans

-------------------------------------------------------------------------
                                   Hans Hagen | PRAGMA ADE | pragma@wxs.nl
                       Ridderstraat 27 | 8061 GH Hasselt | The Netherlands
  tel: +31 (0)38 477 53 69 | fax: +31 (0)38 477 53 74 | www.pragma-ade.com
-------------------------------------------------------------------------
                        information: http://www.pragma-ade.com/roadmap.pdf
                     documentation: http://www.pragma-ade.com/showcase.pdf
-------------------------------------------------------------------------