From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/28558 Path: news.gmane.org!not-for-mail From: Peter Heslin Newsgroups: gmane.comp.tex.xetex,gmane.comp.tex.context Subject: Re: XeTeX, ConTeXt, and utf-8 hyphenation patterns. Date: Tue, 13 Jun 2006 12:41:38 +0100 Message-ID: <87ejxtmgel.fsf@heslin.eclipse.co.uk> References: <8764j6rpms.fsf@heslin.eclipse.co.uk> <448E685C.1020007@wxs.nl> Reply-To: Unicode-based TeX for Mac OS X and Linux NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1150199011 2533 80.91.229.2 (13 Jun 2006 11:43:31 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 13 Jun 2006 11:43:31 +0000 (UTC) Cc: ntg-context-wvrSQK3plZs@public.gmane.org Original-X-From: xetex-bounces-WUdSmCIlby8@public.gmane.org Tue Jun 13 13:43:26 2006 Return-path: Envelope-to: gctx-xetex-Uylq5CNFT+jYtjvyW6yDsg@public.gmane.org Original-Received: from dmz-169.daimi.au.dk ([130.225.2.169] helo=tug.org) by ciao.gmane.org with esmtp (Exim 4.43) id 1Fq7Ir-00073h-Sd for gctx-xetex-Uylq5CNFT+jYtjvyW6yDsg@public.gmane.org; Tue, 13 Jun 2006 13:43:25 +0200 Original-Received: from tug.org (localhost [127.0.0.1]) by tug.org (8.11.7-20030920/8.11.6) with ESMTP id k5DBhEC20487; Tue, 13 Jun 2006 13:43:14 +0200 Original-Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by tug.org (8.11.7-20030920/8.11.6) with ESMTP id k5DBh4C20466 for ; Tue, 13 Jun 2006 13:43:04 +0200 Original-Received: from list by ciao.gmane.org with local (Exim 4.43) id 1Fq7IA-0006v4-K0 for xetex-WUdSmCIlby8@public.gmane.org; Tue, 13 Jun 2006 13:42:45 +0200 Original-Received: from 213-152-32-235.dsl.eclipse.net.uk ([213.152.32.235]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 13 Jun 2006 13:42:42 +0200 Original-Received: from pj by 213-152-32-235.dsl.eclipse.net.uk with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 13 Jun 2006 13:42:42 +0200 X-Injected-Via-Gmane: http://gmane.org/ Original-To: xetex-WUdSmCIlby8@public.gmane.org Original-Lines: 24 Original-X-Complaints-To: usenet@sea.gmane.org X-Gmane-NNTP-Posting-Host: 213-152-32-235.dsl.eclipse.net.uk User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (gnu/linux) Cancel-Lock: sha1:CIuX1Izs7OGnHxAOwf4jJXqAiFc= X-DAIMI-Spam-Score: -2.599 () BAYES_00 X-Scanned-By: MIMEDefang 2.56 on 130.225.16.26 X-Scanned-By: MIMEDefang 2.56 on 130.225.2.178 X-BeenThere: xetex-WUdSmCIlby8@public.gmane.org X-Mailman-Version: 2.1.8 Precedence: list List-Id: Unicode-based TeX for Mac OS X and Linux List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: xetex-bounces-WUdSmCIlby8@public.gmane.org Errors-To: xetex-bounces-WUdSmCIlby8@public.gmane.org Xref: news.gmane.org gmane.comp.tex.xetex:1964 gmane.comp.tex.context:28558 Archived-At: Hans Hagen writes: > ctxtools --pat [en nl agr ...] > ctxtools --pat --utf [en nl agr ...] > > the greek conversions were done with the help of a greek language users > on the context list, so in case of troubles, so i cc there; bugs need to > be fixed indeed Thanks for the tips. I have taken a closer look at the Greek patterns, and it seems as though they have not only small problems, but also major problems. (They will fail to find most hyphenation points before accented vowels.) I will try to come up with a patch, but I don't know any Ruby, so it will be an interesting challenge -- the changes required go beyond tweaking the existing code. The characters in the file lang-agr.pat are precomposed, Unicode normalization form D. But I'd like to support both normalization forms C and D, if possible, in the same pattern file. Is that goal compatible with Context? -- Peter Heslin (http://www.dur.ac.uk/p.j.heslin)