The Unix Heritage Society mailing list
 help / color / mirror / Atom feed
From: Rob Pike <robpike@gmail.com>
To: Ralph Corderoy <ralph@inputplus.co.uk>
Cc: TUHS main list <tuhs@minnie.tuhs.org>
Subject: Re: [TUHS] Aleph Null in Software Practice & Experience.
Date: Mon, 2 May 2022 20:03:08 +1000	[thread overview]
Message-ID: <CAKzdPgycWORPb40EiSN6JxkQD2Z-ywH=1AbVWNE_=r=W_Mzn6g@mail.gmail.com> (raw)
In-Reply-To: <20220502095514.7C2BC21547@orac.inputplus.co.uk>

Under option, maybe. I'm not a fan of putting invisible characters
into a program designed to translate numbers into cut-and-pasteable
text. Plus, as you said, it just makes other things break, although
perhaps they should be encouraged not to.

-rob

On Mon, May 2, 2022 at 7:56 PM Ralph Corderoy <ralph@inputplus.co.uk> wrote:
>
> Hi Rob,
>
> > The output of  "unicode 5d0-5e7" (robpike.io/cmd/unicode has the
> > command) is fun.
> >
> > 05d0 א 05d1 ב 05d2 ג 05d3 ד
> > 05d4 ה 05d5 ו 05d6 ז 05d7 ח
> > 05d8 ט 05d9 י 05da ך 05db כ
> > 05dc ל 05dd ם 05de מ 05df ן
> > 05e0 נ 05e1 ס 05e2 ע 05e3 ף
> > 05e4 פ 05e5 ץ 05e6 צ 05e7 ק
> >
> > For comparison, here is "unicode 3d0-3e7". It will be fun to watch how
> > it's rendered.
> >
> > 03d0 ϐ 03d1 ϑ 03d2 ϒ 03d3 ϓ
> > 03d4 ϔ 03d5 ϕ 03d6 ϖ 03d7 ϗ
> > 03d8 Ϙ 03d9 ϙ 03da Ϛ 03db ϛ
> > 03dc Ϝ 03dd ϝ 03de Ϟ 03df ϟ
> > 03e0 Ϡ 03e1 ϡ 03e2 Ϣ 03e3 ϣ
> > 03e4 Ϥ 03e5 ϥ 03e6 Ϧ 03e7 ϧ
>
> In the terminal where I read and write email, they're all as if ‘0041 A’.
> But save the email's text/plain to foo.txt and foo.html, add a little HTML
> to foo.html, and the browser, here Firefox, presents the Hebrew in both as
>
>     05d0 05 אd1 05 בd2 05 גd3 ד
>     05d4 05 הd5 05 וd6 05 זd7 ח
>     05d8 05 טd9 05 יda 05 ךdb כ
>     05dc 05 לdd 05 םde 05 מdf ן
>     05e0 05 נe1 05 סe2 05 עe3 ף
>     05e4 05 פe5 05 ץe6 05 צe7 ק
>
> due to the mix of Unicode's strong, weak, and neutral bi-directional
> character types.
>
> To see what I intend above needs a ‘broken’ renderer, like a terminal.
> For those with more intelligent renderers, it's as if runes normally
> drawn as
>
>     00c0 À 00c1 Á 00c2 Â 00c3 Ã
>
> became
>
>     00c0 00 Àc1 00 Ác2 00 Âc3 Ã
>
> Wrapping each of the Hebrew characters in the text and HTML files in
> LRI...PDI,
>
>     LRI  U+2066  Left-to-right isolate
>     PDI  U+2069  Pop directional isolate
>
> so the first row becomes
>
>     0030 0035 0064 0030  0020  2066 05d0 2069  0020
>     0030 0035 0064 0031  0020  2066 05d1 2069  0020
>     0030 0035 0064 0032  0020  2066 05d2 2069  0020
>     0030 0035 0064 0033  0020  2066 05d3 2069  000a
>
> has Firefox display the tables as intended.  Perhaps the unicode command
> should do this to ensure correct display, especially if some terminals
> ever start to improve?
>
> I note that vim(1) here doesn't realise LRI and PDI are zero width
> so the cursor position drifts past the end of the visible line.
> ed(1) copes without a murmur.
>
> --
> Cheers, Ralph.

  reply	other threads:[~2022-05-02 10:03 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-04-09 11:45 [TUHS] {TUHS] Interesting Commentary on Unix from Multicians Douglas McIlroy
2022-04-09 13:09 ` Larry Stewart
2022-04-09 18:25 ` Ken Thompson
2022-04-11 19:24   ` Dan Cross
2022-04-28 21:05 ` Alan Glasser
2022-04-30 10:45   ` [TUHS] Aleph Null in Software Practice & Experience Ralph Corderoy
2022-04-30 15:42     ` John Cowan
2022-04-30 12:52       ` Ralph Corderoy
2022-04-30 13:33         ` Rob Pike
2022-05-02  9:55           ` Ralph Corderoy
2022-05-02 10:03             ` Rob Pike [this message]
2022-05-11 12:47 ` [TUHS] {TUHS] Interesting Commentary on Unix from Multicians Joe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAKzdPgycWORPb40EiSN6JxkQD2Z-ywH=1AbVWNE_=r=W_Mzn6g@mail.gmail.com' \
    --to=robpike@gmail.com \
    --cc=ralph@inputplus.co.uk \
    --cc=tuhs@minnie.tuhs.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).