On 2010-10-29 <23:25:20>, Mojca Miklavec wrote: > > The best thing to do would be to transliterate Russian patterns into > Latin script (under one condition: transliteration needs to be > one-to-one; if one cyrillic glyph transliterates into two latin The one in question is rather a transcription (‘romanization’) than a transliteration, thus unfortunately there is no bijective mapping (e.g. ‘я’->‘ja’, ‘ш’->‘sh’ etc.). It seems to be a hybrid between the standard Library of Congress-style transcription and an older ISO or ΓΟСТ transliteration. Also, ‘j’ occurs in very odd positions. Whatever it is, we would need the complete transcription mapping. As others already pointed out, with a small number of strings Steffen might get acceptable results by using the patterns of a similar language. Although real transliterations work best with Czech or Slovak, this peculiar transcription might be better off with Polish or even (judging by the use of ‘sh’) standard English. @Steffen, if you could convince the author to supply the original Russian text and if he would agree to use a more common style, you could let the transliteration module do the job instead (http://bitbucket.org/phg/transliterator). > By far the easiest and most portable solution would be if you could > convince Taco to implement something like "latin a is equivalent to > cyrillic a as far as hyphenation is concerned" (which could also solve > many other problems that we have). +1. This would be a great feature. Good night all, Philipp > > Mojca > ___________________________________________________________________________________ > If your question is of interest to others as well, please add an entry to the Wiki! > > maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context > webpage : http://www.pragma-ade.nl / http://tex.aanhet.net > archive : http://foundry.supelec.fr/projects/contextrev/ > wiki : http://contextgarden.net > ___________________________________________________________________________________ -- () ascii ribbon campaign - against html e-mail /\ www.asciiribbon.org - against proprietary attachments