From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-1.0 required=5.0 tests=MAILING_LIST_MULTI autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 30846 invoked from network); 20 Mar 2023 07:55:38 -0000 Received: from minnie.tuhs.org (50.116.15.146) by inbox.vuxu.org with ESMTPUTF8; 20 Mar 2023 07:55:38 -0000 Received: from minnie.tuhs.org (localhost [IPv6:::1]) by minnie.tuhs.org (Postfix) with ESMTP id 10D0A41341; Mon, 20 Mar 2023 17:55:33 +1000 (AEST) Received: from freefriends.org (freefriends.org [96.88.95.60]) by minnie.tuhs.org (Postfix) with ESMTPS id DC2764133A for ; Mon, 20 Mar 2023 17:55:23 +1000 (AEST) X-Envelope-From: arnold@skeeve.com Received: from freefriends.org (freefriends.org [96.88.95.60]) by freefriends.org (8.14.7/8.14.7) with ESMTP id 32K7tJqj023354 (version=TLSv1/SSLv3 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Mon, 20 Mar 2023 01:55:20 -0600 Received: (from arnold@localhost) by freefriends.org (8.14.7/8.14.7/Submit) id 32K7tIeW023352; Mon, 20 Mar 2023 01:55:18 -0600 From: arnold@skeeve.com Message-Id: <202303200755.32K7tIeW023352@freefriends.org> X-Authentication-Warning: frenzy.freefriends.org: arnold set sender to arnold@skeeve.com using -f Date: Mon, 20 Mar 2023 01:55:18 -0600 To: robpike@gmail.com, ralph@inputplus.co.uk References: <20230319134701.3A262220F7@orac.inputplus.co.uk> In-Reply-To: User-Agent: Heirloom mailx 12.5 7/5/10 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID-Hash: UTOUAQWMBKYUSSIWBBBN5MEU37GMU6NC X-Message-ID-Hash: UTOUAQWMBKYUSSIWBBBN5MEU37GMU6NC X-MailFrom: arnold@skeeve.com X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header CC: tuhs@tuhs.org X-Mailman-Version: 3.3.6b1 Precedence: list Subject: [TUHS] Re: Bell Foreign-Language UNIX Efforts List-Id: The Unix Heritage Society mailing list Archived-At: List-Archive: List-Help: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: Hi Rob. Rob Pike wrote: > (Speaking of design by committee, the multibyte stuff in C89 was atrocious, > and I heard was done in committee to get someone, perhaps the Japanese, to > sign off.) It's not lovely, but I wouldn't call it atrocious. It gets the job done; code using it can handle multibyte encodings while being totally character-set agnostic. I speak from experience, gawk does this. (I use the "restartable" routins - mbrlen() and so on.) I understand that Unicode + UTF-8 solve the issue completely. But I'd like to ask, in all seriousness and so that I can learn, given the world as it was in 1989, how would you solve the problem? If you had designed the C level routines, what would they have looked like? Thanks, Arnold