From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/63093 Path: news.gmane.org!not-for-mail From: Philipp Gesang Newsgroups: gmane.comp.tex.context Subject: Re: transliteration russian Date: Sat, 30 Oct 2010 00:47:50 +0200 Message-ID: <20101029224750.GB30892@aides> References: <6CB41398-8E44-433C-B4FA-1B98BC684A0E@st.estfiles.de> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0994747989==" X-Trace: dough.gmane.org 1288392497 28067 80.91.229.12 (29 Oct 2010 22:48:17 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Fri, 29 Oct 2010 22:48:17 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Sat Oct 30 00:48:16 2010 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PBxk3-0000Xa-Uf for gctc-ntg-context-518@m.gmane.org; Sat, 30 Oct 2010 00:48:12 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 8ABE9CA7ED; Sat, 30 Oct 2010 00:48:10 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id lr22x7pBoatV; Sat, 30 Oct 2010 00:48:07 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 63001CA7EA; Sat, 30 Oct 2010 00:48:07 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 01E01CA7EA for ; Sat, 30 Oct 2010 00:48:06 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 6H7uUzX39GgN for ; Sat, 30 Oct 2010 00:47:53 +0200 (CEST) Original-Received: from filter1-til.mf.surf.net (filter1-til.mf.surf.net [194.171.167.217]) by balder.ntg.nl (Postfix) with ESMTP id 90960CA7E9 for ; Sat, 30 Oct 2010 00:47:53 +0200 (CEST) Original-Received: from relay2.uni-heidelberg.de (relay2.uni-heidelberg.de [129.206.210.211]) by filter1-til.mf.surf.net (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id o9TMlqwG015850 for ; Sat, 30 Oct 2010 00:47:53 +0200 Original-Received: from ix.urz.uni-heidelberg.de (cyrus-portal.urz.uni-heidelberg.de [129.206.100.176]) by relay2.uni-heidelberg.de (8.13.8/8.13.8) with ESMTP id o9TMlHMV023403 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Sat, 30 Oct 2010 00:47:17 +0200 Original-Received: from extmail.urz.uni-heidelberg.de (extmail.urz.uni-heidelberg.de [129.206.100.140]) by ix.urz.uni-heidelberg.de (8.13.8/8.13.8) with ESMTP id o9TMlpvo015900 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Sat, 30 Oct 2010 00:47:51 +0200 Original-Received: from localhost (mnhm-4d0103a6.pool.mediaWays.net [77.1.3.166]) (authenticated bits=0) by extmail.urz.uni-heidelberg.de (8.13.4/8.13.1) with ESMTP id o9TMloIx019455 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES128-SHA bits=128 verify=NO) for ; Sat, 30 Oct 2010 00:47:51 +0200 In-Reply-To: X-Operating-System: Linux aides 2.6.35-ARCH X-Polite-Request: "Please try to be nice, don't send html mail." User-Agent: Mutt/1.5.20 (2009-06-14) X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=129.206.210.211; country=DE; region=01; city=Heidelberg; latitude=49.4167; longitude=8.7000; http://maps.google.com/maps?q=49.4167,8.7000&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 08DoKLQ41 - 70d13528e359 - 20101030 X-Scanned-By: CanIt (www . roaringpenguin . com) on 194.171.167.217 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:63093 Archived-At: --===============0994747989== Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="CUfgB8w4ZwR/yMy5" Content-Disposition: inline --CUfgB8w4ZwR/yMy5 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 2010-10-29 <23:25:20>, Mojca Miklavec wrote: >=20 > The best thing to do would be to transliterate Russian patterns into > Latin script (under one condition: transliteration needs to be > one-to-one; if one cyrillic glyph transliterates into two latin The one in question is rather a transcription (=E2=80=98romanization=E2=80= =99) than a transliteration, thus unfortunately there is no bijective mapping (e.g. =E2=80=98=D1=8F=E2=80=99->=E2=80=98ja=E2=80=99, =E2= =80=98=D1=88=E2=80=99->=E2=80=98sh=E2=80=99 etc.). It seems to be a hybrid between the standard Library of Congress-style transcription and an older ISO or =CE=93=CE=9F=D0=A1=D0=A2 transliteration.= Also, =E2=80=98j=E2=80=99 occurs in very odd positions. Whatever it is, we would need the complete transcription mapping. As others already pointed out, with a small number of strings Steffen might get acceptable results by using the patterns of a similar language. Although real transliterations work best with Czech or Slovak, this peculiar transcription might be better off with Polish or even (judging by the use of =E2=80=98sh=E2=80=99) standard English. @Steffen, if you could convince the author to supply the original Russian text and if he would agree to use a more common style, you could let the transliteration module do the job instead (http://bitbucket.org/phg/transliterator). > By far the easiest and most portable solution would be if you could > convince Taco to implement something like "latin a is equivalent to > cyrillic a as far as hyphenation is concerned" (which could also solve > many other problems that we have). +1. This would be a great feature. Good night all, Philipp >=20 > Mojca > _________________________________________________________________________= __________ > If your question is of interest to others as well, please add an entry to= the Wiki! >=20 > maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-co= ntext > webpage : http://www.pragma-ade.nl / http://tex.aanhet.net > archive : http://foundry.supelec.fr/projects/contextrev/ > wiki : http://contextgarden.net > _________________________________________________________________________= __________ --=20 () ascii ribbon campaign - against html e-mail /\ www.asciiribbon.org - against proprietary attachments --CUfgB8w4ZwR/yMy5 Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) iEYEARECAAYFAkzLTxYACgkQ02lYlJYWs9JExQCeOBBaYNE2D9/kov6uc0Dsuz/9 aPYAn0P61Apsj3m8OMzFrerDN51foi3M =wBe/ -----END PGP SIGNATURE----- --CUfgB8w4ZwR/yMy5-- --===============0994747989== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________ --===============0994747989==--