From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/68248 Path: news.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: Some Ethiopic examples (hyphenation/breaking) in ConTeXt Date: Sat, 07 May 2011 13:20:27 +0200 Message-ID: <4DC52AFB.5050604@wxs.nl> References: Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit X-Trace: dough.gmane.org 1304767266 9755 80.91.229.12 (7 May 2011 11:21:06 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sat, 7 May 2011 11:21:06 +0000 (UTC) Cc: Adam McCollum , Mojca Miklavec To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Sat May 07 13:20:57 2011 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1QIfZB-0006FR-Eo for gctc-ntg-context-518@m.gmane.org; Sat, 07 May 2011 13:20:57 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id A7F80CACB2; Sat, 7 May 2011 13:20:55 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id YWWOkzQ8-a16; Sat, 7 May 2011 13:20:53 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 34BECCACAE; Sat, 7 May 2011 13:20:53 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id BB9D7CACAE for ; Sat, 7 May 2011 13:20:52 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id qylD3O1isLIN for ; Sat, 7 May 2011 13:20:41 +0200 (CEST) Original-Received: from filter1-til.mf.surf.net (filter1-til.mf.surf.net [194.171.167.217]) by balder.ntg.nl (Postfix) with ESMTP id C9407CACAB for ; Sat, 7 May 2011 13:20:41 +0200 (CEST) Original-Received: from smtp.ziggozakelijk.nl (D57D1DA2.static.ziggozakelijk.nl [213.125.29.162]) by filter1-til.mf.surf.net (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id p47BKdn0029279 for ; Sat, 7 May 2011 13:20:40 +0200 X-Default-Received-SPF: pass (skip=loggedin (res=PASS)) x-ip-name=10.100.1.100; Original-Received: from [10.100.1.100] (unverified [10.100.1.100]) by pragma-net.nl (SurgeMail 5.2a) with ESMTP id 3208-1713362 for multiple; Sat, 07 May 2011 13:20:35 +0200 User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.2.17) Gecko/20110414 Lightning/1.0b2 Thunderbird/3.1.10 ThunderBrowse/3.3.5 In-Reply-To: X-Authenticated-User: hagen@controller-9 X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=213.125.29.162; country=NL; region=11; city=Den Haag; latitude=52.0833; longitude=4.3000; http://maps.google.com/maps?q=52.0833,4.3000&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 08EEzkDJ7 - d28e5a7cab29 - 20110507 X-Scanned-By: CanIt (www . roaringpenguin . com) on 194.171.167.217 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:68248 Archived-At: On 6-5-2011 10:00, Mojca Miklavec wrote: > We were originally preparing the example for XeTeX (which behaves very > weird anyway) and I would like to know how to typeset Ethiopic text in > ConTeXt. Let's forget about xetex then. It's not that complex to add to mkiv as we have mechanisms in place for it. What is the otf language / script code? > The basic requirements are: > > - Words may be split after any character (character = syllable; it's > in the range "1200-"139F), but not before word/sentence dividers. (We > have hyphenation patterns, but one could just as well use some other > mechanism to break.) > > - "1361 and "1362 are word dividers and sentence dividers. > > - One doesn't use spaces when writing. Like in cjk. > - In output one should get something like space (approximately the > same width) before and something like space after word/sentence > divider, except that the "space" before divider should not be > breakable; I highly suspect that the amount of space before/after > dividers depends on the font being used, but I may be wrong. so let's visualize that: [1200][1200][1200][1361][1200][1200][1200][1362][1200][1200][1200] valid breakpoints: [1200] [1200] [1200][nbsp][1200] [1200] [1200][nbsp][1200] [1200] [1200] Is that okay? How about spaces in the input (end of lines introduce them)? > - Text should be nicely justified (I wonder if microtypography would also help). That is independent of the logic. > I'm attaching a sample text that does approximately what I expect it > to do, but I would like to avoid active characters, make the space > before and after divider of equal size and I'm not sure what is the > most appropriate approach in ConTeXt. The example also leaves a bit > too much whitespace after dividers that end the line. Nothing attached. Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com | www.pragma-pod.nl ----------------------------------------------------------------- ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________