From mboxrd@z Thu Jan 1 00:00:00 1970 From: michael@kjorling.se (Michael =?utf-8?B?S2rDtnJsaW5n?=) Date: Sun, 7 May 2017 16:33:08 +0000 Subject: [TUHS] Discuss of style and design of computer programs from a user stand point In-Reply-To: <201705071458.v47EwOtp014162@freefriends.org> References: <201705060202.v4622L1J013430@coolidge.cs.Dartmouth.EDU> <5a2d6cc957c2efcd968f35aa5557c7a0e309dd27@webmail.yaccman.com> <20170506091857.GE12539@yeono.kjorling.se> <20170506144011.GF28787@mcvoy.com> <20170506195049.8FBEA124AEA4@mail.bitblocks.com> <20170507135407.GS12539@yeono.kjorling.se> <201705071458.v47EwOtp014162@freefriends.org> Message-ID: <20170507163308.GT12539@yeono.kjorling.se> On 7 May 2017 08:58 -0600, from arnold at skeeve.com: >>> I don't imagine it would be hard to re-write [uniq] to handle utf-8. >> >> It does look like at least GNU coreutils 8.13 uniq is broken in that >> regard, which frankly surprised me. That version isn't _that_ old. > > Are your LC_* env variables set correctly? I believe so (I don't recall seeing any other UTF-8-related weirdness for a very long time), but I would want to verify the behavior on a system that doesn't have a gazillion customizations accumulated over years _and_ has the most recent version of coreutils before I file an actual bug report. (It's not like Debian is ever bleeding edge, to begin with.) I'll see if I spin up a VM or three one of these days to check. -- Michael Kjörling • https://michael.kjorling.se • michael at kjorling.se “People who think they know everything really annoy those of us who know we don’t.” (Bjarne Stroustrup)