The Unix Heritage Society mailing list
 help / color / mirror / Atom feed
From: Steffen Nurpmeso <steffen@sdaoden.eu>
To: arnold@skeeve.com
Cc: tuhs@tuhs.org
Subject: [TUHS] Re: Bell Foreign-Language UNIX Efforts
Date: Mon, 20 Mar 2023 16:44:30 +0100	[thread overview]
Message-ID: <20230320154430.DW_SS%steffen@sdaoden.eu> (raw)
In-Reply-To: <202303200755.32K7tIeW023352@freefriends.org>

arnold@skeeve.com wrote in
 <202303200755.32K7tIeW023352@freefriends.org>:
 |Rob Pike <robpike@gmail.com> wrote:
 |> (Speaking of design by committee, the multibyte stuff in C89 was \
 |> atrocious,
 |> and I heard was done in committee to get someone, perhaps the Japanese, \
 |> to
 |> sign off.)
 |
 |It's not lovely, but I wouldn't call it atrocious. It gets the job
 |done; code using it can handle multibyte encodings while being totally

No it does not.

 |character-set agnostic.  I speak from experience, gawk does this.

However note that even something like "uppercase this string"
cannot be done the right way, because a truly Unicode aware
operation needs to look at the entire string (sentence), because
there may be interdependencies that modify the result.  Therefore
the entire isw*() and tow*() series is simply wrong.  And
therefore gawk does this wrong, too.  (But the GNU environment
does have a solution, i think.)

 |(I use the "restartable" routins - mbrlen() and so on.)

Yes.

 |I understand that Unicode + UTF-8 solve the issue completely. But I'd

In fact to do it right you need something like ICU.
There are special number systems, they do not fit ISO C.
There are special grammatical rules to obey, which especially
hurts regarding everything truly collation aware.

(And then my brain simply runs away from the thinking that
invented strcoll(3) for anything beyond all-american ten inch.)

 |like to ask, in all seriousness and so that I can learn, given the world
 |as it was in 1989, how would you solve the problem? If you had designed
 |the C level routines, what would they have looked like?

P.S.: no, no, and one more no.
If you want to have a nice Monday, please have a look at NetBSD
current source code, lib/libc/gen/vis.c.  There you see how good
this interface "gets the job done".  And i saw it evolve as the
commits of Christos Zoulas flew by, ten years or so ago.
No.

 |Thanks,

Then again it all does not matter since IETF and more simply throw
one more thing upon the other, so that you need a JSON library for
a key=value list, and a HTTP, HTTP/2 and HTTP/3 library to
download it over TLS (i think the entire world now proxies all
protocols over :443, which makes it safer, and administration
easier! .. i have heard).  Why did you invent 16-bit ports by
then?  What were you thinking?  One is enough, and much safer!
That makes me wonder how OpenBSD could introduce two remotes holes
for only one port, .. but that likely is a different story.

Hysterical on a Monday, and that on Equinox.

--steffen
|
|Der Kragenbaer,                The moon bear,
|der holt sich munter           he cheerfully and one by one
|einen nach dem anderen runter  wa.ks himself off
|(By Robert Gernhardt)

  parent reply	other threads:[~2023-03-20 20:48 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-03-19  5:00 [TUHS] " segaloco via TUHS
2023-03-19 13:32 ` [TUHS] " Diomidis Spinellis
2023-03-19 13:47   ` [TUHS] " Ralph Corderoy
2023-03-19 20:27     ` [TUHS] " Rob Pike
2023-03-20  7:55       ` arnold
2023-03-20  9:22         ` Rob Pike
2023-03-20 11:02           ` arnold
2023-03-20 15:44         ` Steffen Nurpmeso [this message]
2023-03-20 22:01           ` John Cowan
2023-03-20 22:28             ` Steffen Nurpmeso
2023-03-22  2:25       ` Larry McVoy
2023-03-22  2:52         ` Rob Pike
2023-03-22  7:12           ` Mehdi Sadeghi via TUHS
2023-03-22  7:33             ` Rob Pike
2023-03-22  7:40               ` arnold
2023-03-22 10:02                 ` Skip Tavakkolian
2023-03-22 10:09                   ` Skip Tavakkolian
2023-03-22 12:02                     ` Rob Pike
2023-03-22 22:33                       ` Steffen Nurpmeso
2023-03-22 23:33                         ` segaloco via TUHS
2023-03-23  0:01                           ` Warren Toomey via TUHS
2023-03-19 13:38 ` Edouard Klein

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20230320154430.DW_SS%steffen@sdaoden.eu \
    --to=steffen@sdaoden.eu \
    --cc=arnold@skeeve.com \
    --cc=tuhs@tuhs.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).