From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/778 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: Re: commit 1a63a9fc30e7a1f1239e3cedcb5041e5ec1c5351 Date: Sun, 29 Apr 2012 15:50:36 -0400 Message-ID: <20120429195036.GI14673@brightrain.aerifal.cx> References: Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Trace: dough.gmane.org 1335728846 3841 80.91.229.3 (29 Apr 2012 19:47:26 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sun, 29 Apr 2012 19:47:26 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-779-gllmg-musl=m.gmane.org@lists.openwall.com Sun Apr 29 21:47:26 2012 Return-path: Envelope-to: gllmg-musl@plane.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1SOa5d-0003GB-Rw for gllmg-musl@plane.gmane.org; Sun, 29 Apr 2012 21:47:25 +0200 Original-Received: (qmail 9645 invoked by uid 550); 29 Apr 2012 19:47:25 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 9637 invoked from network); 29 Apr 2012 19:47:24 -0000 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Xref: news.gmane.org gmane.linux.lib.musl.general:778 Archived-At: On Sun, Apr 29, 2012 at 03:02:22PM -0400, James Cloos wrote: > > also special-case ß (U+00DF) ... unicode added an uppercase version of > > this character but does not map it, presumably because the uppercase > > version is not actually used except for some obscure purpose ... > > For the record: > > ẞ (U+1E9E) is not mapped as ß’s majuscule because Unicode’s (really MS’s) > idea of stable is »no bug fixes« and a preference for SS by some, not > because of lack of use. > > The legacy of using SS as ß’s majuscule continues in most contexts, but > from what I hear, ẞ is gaining ground. Especially in text which is set > without miniscules, such as street and commercial signage. > > Even without a global ß→ẞ mapping, it might be true that such a mapping > already might be preferred for de_AT text. (The earliest sightings of > ẞ which were mentioned on unicode@ were, IIRC, in Austria.) Thank you for the information. I'm also aware of (and generally disagree with) Unicode's policy of not fixing bugs. While I'd like to see some more opinions (from both "sides", if there are sides to the issue) what you're saying makes a lot of sense, and it would definitely clean up naive case mapping quite a bit (i.e. de-uglify the result of applications performing naive uppercasing). One potential argument I can think of against the case mapping is that uppercasing would map Latin-1 text outside of Latin-1. But this is really a problem of the folks still using Latin-1, and not terribly relevant to a 100% UTF-8 system like one built on musl. Rich P.S. Thanks for writing ẞ out in the email and thereby bringing to my attention the fact that it's missing in uuterm's ytty font. This is probably sufficient to get me to move the uuterm repository to git on git.etalabs.net and make the first commit in years.