From: Roman Shaposhnick <rvs@sun.com>
To: Fans of the OS Plan 9 from Bell Labs <9fans@cse.psu.edu>
Subject: Re: [9fans] combining characters
Date: Fri, 19 May 2006 17:43:44 -0700 [thread overview]
Message-ID: <20060520004344.GI14448@submarine> (raw)
In-Reply-To: <f1fcda236000fd7d3278cdc9b33a275c@quanstro.net>
On Fri, May 19, 2006 at 07:13:33PM -0500, quanstro@quanstro.net wrote:
> On Fri May 19 19:13:39 CDT 2006, rvs@sun.com wrote:
> > Since I'm no expert
> > in UNICODE I'm quite curious to know how one is supposed to
> > tell between a real character and a combination of a diacritic
> > and some other character when they are visually indistinguishable ?
>
> say i have a random accented letter.
There's no such thing as an accented letter in a Russian language.
That was the exact point of my initial remark.
Now, if you allow me to educate myself in Unicode a little bit,
I'm about to follow through with your example. Be patient with me ;-)
> suppose that U+x is the cp for the letter.
> suppose U+y is the cp for the accent.
Ok.
> suppose that we're lucky and there exists U+w ? U+xU+y.
Just to make sure I still follow: U+w is supposed to *visually*
look like U+x followed by U+y, right ?
> then U+w should be the same glyph as U+xU+y.
The same glyph from a visual standpoint, right ?
> cannonical composition would yield
> compose(U+xU+y) U+w
> compose(U+w) U+w
> while cannonical decompostion would yield
> decompose(U+xU+y) U+xU+y
> decompose(U+w) U+xU+y
And that's exactly the place where I think Unicode goes against common
sense and language rules. I would expect it to mandate that a *decomposable*
character is supposed to be used over the decomposition. Which in your
original example was the case.
> > I would expect unicode to always favor single glyphs from a particular
> > page over anything else.
>
> it's always a single glyph. don't confuse letters, codepoints, and glyphs.
It is still a bit hard to not confuse letters and glyphs :-(
> i'll send you a png of the character. i don't have the books.
>
> what language rule are you trying to get at?
"There are no accents in Russian language" (*)
Thanks,
Roman.
(*) well, except for a Ukrainian one ;-)
next prev parent reply other threads:[~2006-05-20 0:43 UTC|newest]
Thread overview: 58+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-05-19 14:36 rog
2006-05-19 15:11 ` quanstro
2006-05-19 16:16 ` quanstro
2006-05-19 21:03 ` Jack Johnson
2006-05-19 21:26 ` Lluís Batlle i Rossell
2006-05-19 21:21 ` quanstro
2006-05-19 21:45 ` jmk
2006-05-19 21:57 ` Francisco J Ballesteros
2006-05-19 21:51 ` quanstro
2006-05-19 22:04 ` Francisco J Ballesteros
2006-05-19 22:16 ` quanstro
2006-05-19 22:34 ` Roman Shaposhnick
2006-05-19 22:35 ` quanstro
2006-05-19 23:40 ` Roman Shaposhnick
2006-05-19 23:43 ` quanstro
2006-05-20 0:12 ` Roman Shaposhnick
2006-05-20 0:13 ` quanstro
2006-05-20 0:43 ` Roman Shaposhnick [this message]
2006-05-20 0:44 ` quanstro
2006-05-20 4:13 ` Jack Johnson
2006-05-21 18:10 ` Joel Salomon
2006-05-21 18:12 ` Joel Salomon
2006-05-25 0:23 ` Roman Shaposhnick
2006-05-27 19:41 ` Dan Cross
2006-05-27 20:15 ` Victor Nazarov
2006-05-28 1:49 ` LiteStar numnums
2006-05-20 0:59 ` andrey mirtchovski
2006-05-20 0:51 ` quanstro
2006-05-20 1:43 ` Roman Shaposhnick
2006-05-20 1:38 ` Roman Shaposhnick
2006-05-20 1:59 ` Federico Benavento
2006-05-25 0:24 ` Roman Shaposhnick
2006-05-20 3:59 ` geoff
2006-05-20 10:56 ` Lucio De Re
2006-05-20 11:04 ` Lluís Batlle i Rossell
2006-05-20 0:18 ` andrey mirtchovski
2006-05-21 18:11 ` Joel Salomon
2006-05-19 22:40 ` andrey mirtchovski
2006-05-19 22:36 ` quanstro
2006-05-19 23:28 ` andrey mirtchovski
2006-05-19 22:54 ` Joel Salomon
2006-05-19 22:50 ` quanstro
2006-05-20 1:05 ` Skip Tavakkolian
2006-05-21 17:52 ` Joel Salomon
2006-05-22 4:04 ` Jack Johnson
2006-05-19 22:29 ` Lluís Batlle i Rossell
2006-05-19 22:22 ` quanstro
2006-05-19 22:45 ` Lluís Batlle i Rossell
2006-05-22 22:55 erik quanstrom
2006-05-23 4:58 ` Jack Johnson
2006-05-23 10:07 erik quanstrom
2006-05-23 14:24 ` LiteStar numnums
2006-05-23 17:07 ` plan9
2006-05-23 17:11 ` Ronald G Minnich
2006-05-23 18:53 ` Charles Forsyth
2006-05-23 20:09 ` Harri Haataja
2006-05-25 0:31 ` Roman Shaposhnick
2006-05-25 2:34 ` Andy Newman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20060520004344.GI14448@submarine \
--to=rvs@sun.com \
--cc=9fans@cse.psu.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).