9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
From: akumar@mail.nanosouffle.net
To: 9fans@9fans.net
Subject: [9fans] Woes of New Language Support
Date: Sat, 25 Jul 2009 18:55:03 -0700	[thread overview]
Message-ID: <94eebbb5893e0c503dd154d9cbc46852@mail.nanosouffle.net> (raw)

I've been trying to add support for Sanskrit derived
languages, but just rendering the characters has halted
progress.  For the currently supported languages,
such as English, Russian, Greek, French, even Japanese, the
characters are more or less statically mapped to the unicode
(looking at my $font again, I see that Kanji bitmaps are
perhaps mapped to unspecified unicode ranges?).

However, in the class of languages for which I am trying to
provide support, certain characters are meant to be produced
by an ordered combination of other characters.  For example,
the general sequence in Devanagari script (and this extends
to the other scripts as well) is that
consonant+virama+consonant produces
half-consonant+consonant, where the half-consonant has no
other unicode specification.  As a concrete case in
Devanagari, na virama sa (viz., \u0928\u094d\u0938) should
produce the nsa character (this sequence can be seen in any
unicode representation of the word "Sanskrit" in Devanagari
script).

It seems to me that TTF font specifications (i.e., those I
converted to subfonts using Federico's ttf2subf) include
these sequence definitions, which are then processed by each
application providing support for the fonts.  Plan 9
subfonts are much too simple for this.

So, in this case, what are some ideas towards representing
the above?  I've thought about a ktrans-alike that perhaps
filters the data rio gets and processes it for these sort of
things, but it doesn't seem to be the best possible way to
proceed.  If we can even get past this hurdle, I'd be more
than happy to patch ktrans for input support for this class
of languages.


Thanks,
ak




             reply	other threads:[~2009-07-26  1:55 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-07-26  1:55 akumar [this message]
2009-07-26  5:08 ` erik quanstrom
2009-07-26  7:41   ` andrey mirtchovski
2009-07-26 14:32     ` erik quanstrom
2009-07-28 10:39       ` Charles Forsyth
2009-07-28 14:11         ` Ethan Grammatikidis
2009-07-28 14:52           ` John Floren
2009-07-28 17:46             ` Ethan Grammatikidis
2009-07-26  9:04   ` Salman Aljammaz
2009-07-26 13:48     ` erik quanstrom
2009-07-26 14:12       ` tlaronde
2009-07-26 14:24         ` erik quanstrom
2009-07-26 17:56       ` Nathaniel W Filardo
2009-07-26 18:39       ` Jack Johnson
2009-07-27  0:28         ` erik quanstrom
2009-07-26 11:43 Akshat Kumar
2009-07-26 12:01 Akshat Kumar

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=94eebbb5893e0c503dd154d9cbc46852@mail.nanosouffle.net \
    --to=akumar@mail.nanosouffle.net \
    --cc=9fans@9fans.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).