From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/17867 Path: main.gmane.org!not-for-mail From: Rob Ermers Newsgroups: gmane.comp.tex.context Subject: Re: (resolved) m-letter backslash codes in xml database Date: Sun, 16 Jan 2005 11:12:24 +0100 Message-ID: <41EA3E08.50009@hccnet.nl> References: <41E87274.7050104@hccnet.nl> <20050115132141.19896@news.comp.lancs.ac.uk> Reply-To: r.ermers@hccnet.nl, mailing list for ConTeXt users NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1105869968 24218 80.91.229.6 (16 Jan 2005 10:06:08 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Sun, 16 Jan 2005 10:06:08 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Sun Jan 16 11:06:00 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1Cq7IG-0004j0-00 for ; Sun, 16 Jan 2005 11:06:00 +0100 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id EDCBC127C5; Sun, 16 Jan 2005 11:05:58 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 30754-02; Sun, 16 Jan 2005 11:05:56 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 9B7581277C; Sun, 16 Jan 2005 11:05:56 +0100 (CET) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id BF3F11277C for ; Sun, 16 Jan 2005 11:05:54 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (ronja.vet.uu.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 30535-04 for ; Sun, 16 Jan 2005 11:05:53 +0100 (CET) Original-Received: from hnexfe08.hetnet.nl (unknown [195.121.6.174]) by ronja.ntg.nl (Postfix) with ESMTP id BD1DB1277A for ; Sun, 16 Jan 2005 11:05:53 +0100 (CET) Original-Received: from [127.0.0.1] ([81.206.233.107]) by hnexfe08.hetnet.nl with Microsoft SMTPSVC(5.0.2195.6874); Sun, 16 Jan 2005 11:05:53 +0100 User-Agent: Mozilla Thunderbird 0.8 (Windows/20040913) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: <20050115132141.19896@news.comp.lancs.ac.uk> X-OriginalArrivalTime: 16 Jan 2005 10:05:53.0593 (UTC) FILETIME=[F6C2D290:01C4FBB2] X-Virus-Scanned: by amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: by amavisd-new at ntg.nl Xref: main.gmane.org gmane.comp.tex.context:17867 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:17867 Thanks Adam, Now Hans' suggestions work. They are a bit lengthy, but I don't really=20 care for now. They are preferred to a number. For searching purposes in=20 the database, it is possible to introduce separate labels without=20 diacritics. Introducing scedilla in the file, as you suggested, seems not necessary: Ba soka. Then it took me a while, but the Turkish Idot also works. This sign has=20 several names (Idotabove, Iabovedot), and in Context it appears to be=20 called Idotaccent. Therefore produces the desired=20 result:

brahim Gngrm

nn Soka

ARI

Thanks again! Kind regards, Robert Adam Lindsay wrote: > Rob Ermers said this at Sat, 15 Jan 2005 02:31:32 +0100: >=20 >=20 >>h h extern wrote: >> >>>if you want to move things around in an encoding neutral way: >>> >>>\defineXMLsingular >>> [c] >>> [n=3Dunknowncharacter] >>> {\executeifdefined {\XMLop{id}} \unknowncharacter} >>> >>> >=20 >=20 > Um, I'm not sure, but I think Hans mixed two different attribute names: > the code won't work as is. It should probably be: >=20 > \defineXMLsingular > [c] > [n=3Dunknowncharacter] > {\executeifdefined {\XMLop{n}} \unknowncharacter} >=20 >=20 >=20 >>Thanks, Hans, for the example. I do agree, but I'm sorry I don't quite=20 >>understand how to apply this code. Could you be more specific? >=20 >=20 > It should be in a .tex file that is loaded during processing. You can d= o > this manually in your style file, or in (for example) a cont-loc.tex fi= le > that's in your path (e.g., the project directory or your tex/context/us= er ). >=20 >=20 >>Do I still have to use the unicode number, or is this a method to use=20 >>\"e again in the database? >=20 >=20 > This is so that you can substitute elements for entities in your > database. This is an XML database, so it's a lot more useful if you don= 't > restrict it to TeX-specific character entry. Entities (or elements) are= a > much better option, long-term. >=20 >=20 >>Should I code each diacritic in this way, such as ouml (=F6) in my=20 >>example:

J. Schöttelndreher

)? >=20 >=20 >

J. Schöttelndreher

or, if you \useXMLfilter[ent] : >

J. Sch&#ouml;ttelndreher

or, with Hans's definition: >

J. Schttelndreher

>=20 >>My solution by the way does not work with, e.g., a Turkish scedil=20 >>(U0015F) in my database, while Tex excells in all kinds of diacritics. >=20 >=20 > This is a slightly different issue. It's fairly easily fixed with a > couple additions to the style file/cont-loc file: >=20 > % You can use ş with: > \defineXMLentity [Scedil] {\Scedilla} > \defineXMLentity [scedil] {\scedilla} >=20 > % You can use ş or other unicode (decimal) numbers with: > \def\executeXMLdeccharacter#1\relax %=20 > {\utfunifontglyph{#1}} >=20 > (I was unable to come up with a satisfactory way of handling the hex > version thereof. I also haven't really tested the above function > overload--I'm sure someone can improve on this.)