From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 6643 invoked from network); 28 Apr 2009 08:48:57 -0000 X-Spam-Checker-Version: SpamAssassin 3.2.5 (2008-06-10) on f.primenet.com.au X-Spam-Level: X-Spam-Status: No, score=-2.4 required=5.0 tests=AWL,BAYES_00 autolearn=ham version=3.2.5 Received: from news.dotsrc.org (HELO a.mx.sunsite.dk) (130.225.247.88) by ns1.primenet.com.au with SMTP; 28 Apr 2009 08:48:57 -0000 Received-SPF: none (ns1.primenet.com.au: domain at sunsite.dk does not designate permitted sender hosts) Received: (qmail 29753 invoked from network); 28 Apr 2009 08:48:48 -0000 Received: from sunsite.dk (130.225.247.90) by a.mx.sunsite.dk with SMTP; 28 Apr 2009 08:48:48 -0000 Received: (qmail 21863 invoked by alias); 28 Apr 2009 08:48:42 -0000 Mailing-List: contact zsh-workers-help@sunsite.dk; run by ezmlm Precedence: bulk X-No-Archive: yes X-Seq: 26892 Received: (qmail 21854 invoked from network); 28 Apr 2009 08:48:42 -0000 Received: from bifrost.dotsrc.org (130.225.254.106) by sunsite.dk with SMTP; 28 Apr 2009 08:48:42 -0000 Received: from cluster-g.mailcontrol.com (cluster-g.mailcontrol.com [208.87.233.190]) by bifrost.dotsrc.org (Postfix) with ESMTPS id 8ABD68028C71 for ; Tue, 28 Apr 2009 10:46:37 +0200 (CEST) Received: from cameurexb01.EUROPE.ROOT.PRI ([193.128.72.68]) by rly19g.srv.mailcontrol.com (MailControl) with ESMTP id n3S8mWDE002284 for ; Tue, 28 Apr 2009 09:48:34 +0100 Received: from news01 ([10.99.50.25]) by cameurexb01.EUROPE.ROOT.PRI with Microsoft SMTPSVC(6.0.3790.3959); Tue, 28 Apr 2009 09:48:32 +0100 Date: Tue, 28 Apr 2009 09:48:32 +0100 From: Peter Stephenson To: zsh-workers@sunsite.dk Subject: Re: D07multibyte.ztst failure on HP-UX 11.11 Message-ID: <20090428094832.443012a2@news01> In-Reply-To: <20090427192643.GD28369@otaku> References: <20090427031703.GC28369@otaku> <200904270842.55723.arvidjaar@gmail.com> <20090427192643.GD28369@otaku> Organization: CSR X-Mailer: Claws Mail 3.5.0 (GTK+ 2.12.8; i386-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-OriginalArrivalTime: 28 Apr 2009 08:48:32.0783 (UTC) FILETIME=[1C46A9F0:01C9C7DE] X-Scanned-By: MailControl A_08_51_00 (www.mailcontrol.com) on 10.71.0.129 X-Virus-Scanned: ClamAV 0.92.1/9297/Mon Apr 27 22:30:26 2009 on bifrost X-Virus-Status: Clean On Mon, 27 Apr 2009 19:26:43 +0000 Paul Ackersviller wrote: > On Mon, Apr 27, 2009 at 08:42:46AM +0400, Andrey Borzenkov wrote: > > Could you verify exact byte sequence with od, xxd or like? It is quite= =20 > > possible to have combined vs. non-combined characters here (which look= =20 > > alike in printable form but have different internal representation). >=20 > Of course, should've thought of that the first time. I've attempted > to annotate the mismatches, but could've missed something. >=20 >=20 > 0000000: 2a2a 2a20 312c 3220 2a2a 2a2a 0a21 2048 *** 1,2 ****.! H > 0000010: ce91 4820 48ce 9248 2048 ce93 4820 48ce ..H H..H H..H H. > ^ ^ > 0000020: 9448 2048 ce95 480a 2020 4841 4820 4845 .H H..H. HAH HE > ^ ^ > 0000030: 4820 4855 4820 48c3 8848 2048 c389 480a H HUH H..H H..H. >=20 >=20 > 0000040: 2d2d 2d20 312c 3220 2d2d 2d2d 0a21 2048 --- 1,2 ----.! H > 0000050: ce95 4820 48ce 9448 2048 ce93 4820 48ce ..H H..H H..H H. > ^ ^ > 0000060: 9248 2048 ce91 480a 2020 4841 4820 4845 .H H..H. HAH HE > ^ ^ > 0000070: 4820 4855 4820 48c3 8848 2048 c389 480a H HUH H..H H..H. You missed a 94 and a 92 which I've marked: the problem is again that the sort order isn't quite as deterministic as one might hope. It looks like something funny happened to the characters in your original post; this may or may not be related. It's possible the problem is in case modification. The desired answer is that (in the selected UTF-8 locale) print -oi H=CE=95H H=CE=94H H=CE=93H H=CE=92H H=CE=91H outputs H=CE=91H H=CE=92H H=CE=93H H=CE=94H H=CE=95H (the middle letters are all upper case Greek). Does it work without the -i? The sort tests have always been arguably more trouble than their worth, though I suppose it probably is worth spotlighting where the problems are. --=20 Peter Stephenson Software Engineer CSR PLC, Churchill House, Cambridge Business Park, Cowley Road Cambridge, CB4 0WZ, UK Tel: +44 (0)1223 692070