From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 23308 invoked from network); 11 Sep 2005 17:05:24 -0000 Received: from news.dotsrc.org (HELO a.mx.sunsite.dk) (130.225.247.88) by ns1.primenet.com.au with SMTP; 11 Sep 2005 17:05:24 -0000 Received: (qmail 12855 invoked from network); 11 Sep 2005 17:05:18 -0000 Received: from sunsite.dk (130.225.247.90) by a.mx.sunsite.dk with SMTP; 11 Sep 2005 17:05:18 -0000 Received: (qmail 9145 invoked by alias); 11 Sep 2005 17:05:15 -0000 Mailing-List: contact zsh-workers-help@sunsite.dk; run by ezmlm Precedence: bulk X-No-Archive: yes X-Seq: 21728 Received: (qmail 9135 invoked from network); 11 Sep 2005 17:05:15 -0000 Received: from news.dotsrc.org (HELO a.mx.sunsite.dk) (130.225.247.88) by sunsite.dk with SMTP; 11 Sep 2005 17:05:15 -0000 Received: (qmail 12572 invoked from network); 11 Sep 2005 17:05:15 -0000 Received: from mailgw4.technion.ac.il (132.68.238.37) by a.mx.sunsite.dk with SMTP; 11 Sep 2005 17:05:05 -0000 Received: from localhost (localhost.localdomain [127.0.0.1]) by mailgw4.technion.ac.il (Postfix) with ESMTP id 07753F793C for ; Sun, 11 Sep 2005 19:49:36 +0300 (IDT) Received: from mailgw4.technion.ac.il ([127.0.0.1]) by localhost (mailgw4.technion.ac.il [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 11114-01-22 for ; Sun, 11 Sep 2005 19:49:35 +0300 (IDT) Received: from fermat.math.technion.ac.il (fermat.math.technion.ac.il [132.68.115.6]) by mailgw4.technion.ac.il (Postfix) with ESMTP id CE597F7931 for ; Sun, 11 Sep 2005 19:49:35 +0300 (IDT) Received: from fermat.math.technion.ac.il (localhost [127.0.0.1]) by fermat.math.technion.ac.il (8.12.10/8.12.10) with ESMTP id j8BH53D6023464; Sun, 11 Sep 2005 20:05:03 +0300 (IDT) Received: (from rl@localhost) by fermat.math.technion.ac.il (8.12.10/8.12.10/Submit) id j8BH53IZ023463; Sun, 11 Sep 2005 20:05:03 +0300 (IDT) X-Authentication-Warning: fermat.math.technion.ac.il: rl set sender to rl@math.technion.ac.il using -f Date: Sun, 11 Sep 2005 20:05:03 +0300 From: "Zvi Har'El" To: Zsh hackers list Cc: "Nadav Har'El" Subject: Re: problem in prompt in utf-8 Message-ID: <20050911170503.GA23381@fermat.math.technion.ac.il> References: <20050911121345.GA14384@fermat.math.technion.ac.il> <20050911165516.GA23234@fermat.math.technion.ac.il> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20050911165516.GA23234@fermat.math.technion.ac.il> User-Agent: Mutt/1.4.2i Organization: Technion--Israel Institute of Technology Hebrew-Date: 8 Elul 5765 X-PGP-Public-Key: http://www.math.technion.ac.il/~rl/etc/pubkey.html X-Virus-Scanned: by amavisd-new at technion.ac.il X-Spam-Checker-Version: SpamAssassin 3.0.4 (2005-06-05) on f.primenet.com.au X-Spam-Level: X-Spam-Status: No, score=-2.3 required=5.0 tests=AWL,BAYES_00 autolearn=ham version=3.0.4 Final piece of information: using "screen" and taking a screenlog I have been able to check that the illegal sequences in the prompt all result from a solitary 0xD7 (M-W) byte (i.e, the second byte of the sequence disappears. On Sun, 11 Sep 2005 19:55:16 +0300, Zvi Har'El wrote about "Re: problem in prompt in utf-8": > I found out that the problem of the prompt (rather then the print -P $PS1), is > TERM dependent. When I set TERM=dumb, the prompt is printed correctly on the > screen, but afterwords the cursor moves to the right several positions which > is seems to be identical in length to the size of the prompt. Since each > character is 2 bytes, I suppose the length is incorrectly calculated. > > On Sun, 11 Sep 2005 15:13:45 +0300, Zvi Har'El wrote about "problem in prompt in utf-8": > > Hello, > > > > I have started using zsh-4.3.0 from the CVS, in a uft-8 locale. I enjoy it > > very much. However, I have a problem with the prompting. This is not new, but > > since the completion now works nicely, I thought I'll mention it, since it is > > not solved yet. > > > > I have the setting > > > > PS1=%/$\ > > > > I expect that > > > > print -P $PS1 > > > > > > and > > > > pwd > > > > will give the same output, which will also be the zsh prompt (except the final > > $ and space, of course). However, if the current directory name contains > > hebrew letter, which are in the range U+05D0 to U+05EA, i.e., the utf-8 > > sequences have two bytes, with the first one is always 0xD7 (M-W) and the > > second in the range 0x90 (M-^P) to 0xAA (M-*). I mkdir'ed a directory which > > has all the letters in this range: > > > > /home/rl$ mkdir אבגדהוזחטיךכלםמןנסעףפץצקרשת > > > > cd'ed to that directory: > > > > /home/rl$ cd אבגדהוזחטיךכלםמןנסעףפץצקרשת > > > > I got as an echo a correct result: > > > > ~/אבגדהוזחטיךכלםמןנסעףפץצקרשת > > > > The next prompt had invalid utf-8 sequences: > > > > > > /home/rl/������������לםמןנסעףפץצקרשת$ > > > > > > To make it more specific, all the range U+05D0 to U+05DB, (second byte 0x90 > > to 0x9ba) got invalid. I don't know exactly what is wrong. Notice that 'pwd' > > produces > > > > /home/rl/אבגדהוזחטיךכלםמןנסעףפץצקרשת > > > > I.e, all the letters are correct, while 'print -P $PS1' produces > > > > /home/rl/אבגדהוזחטיך�לםמןנסעףפץצקרשת$ > > > > With exactly one invalid utf-8 seqience, more specifically, U+05DB (second > > byte 0x9ba) - the last one in the previous range, is bad. > > > > print -P $PS1 | cat -v produces > > > > /home/rl/M-WM-^PM-WM-^QM-WM-^RM-WM-^SM-WM-^TM-WM-^UM-WM-^VM-WM-^WM-WM-^XM-WM-^YM-WM-^ZM-WM-WM-^\M-WM-^]M-WM-^^M-WM-^_M-WM- M-WM-!M-WM-"M-WM-#M-WM-$M-WM-%M-WM-&M-WM-'M-WM-(M-WM-)M-WM-*$ > > > > while pwd | cat -v produces > > > > /home/rl/M-WM-^PM-WM-^QM-WM-^RM-WM-^SM-WM-^TM-WM-^UM-WM-^VM-WM-^WM-WM-^XM-WM-^YM-WM-^ZM-WM-^[M-WM-^\M-WM-^]M-WM-^^M-WM-^_M-WM- M-WM-!M-WM-"M-WM-#M-WM-$M-WM-%M-WM-&M-WM-'M-WM-(M-WM-)M-WM-* > > > > It is perhaps hard to see the difference, but a close inspection shows that > > the first string contains a solitary M-W between the M-WM-^Z and the the > > M-WM-^\ sequences, while the second one contains there the sequence M-WM-^[ , > > i.e., a M-^[, or Meta-Esacpe, was dropped from the string. > > > > Unfortunately, I didn't find an easy way to put the real prompt on a file, so > > I can't tell what is the exact sequences in it. > > > > I hope this make some sense. > > > > -- > > Dr. Zvi Har'El mailto:rl@math.technion.ac.il Department of Mathematics > > tel:+972-54-4227607 icq:179294841 Technion - Israel Institute of Technology > > fax:+972-4-8293388 http://www.math.technion.ac.il/~rl/ Haifa 32000, ISRAEL > > "If you can't say somethin' nice, don't say nothin' at all." -- Thumper (1942) > > Sunday, 7 Elul 5765, 11 September 2005, 1:54PM > > -- > Dr. Zvi Har'El mailto:rl@math.technion.ac.il Department of Mathematics > tel:+972-54-4227607 icq:179294841 Technion - Israel Institute of Technology > fax:+972-4-8293388 http://www.math.technion.ac.il/~rl/ Haifa 32000, ISRAEL > "If you can't say somethin' nice, don't say nothin' at all." -- Thumper (1942) > Sunday, 8 Elul 5765, 11 September 2005, 7:51PM -- Dr. Zvi Har'El mailto:rl@math.technion.ac.il Department of Mathematics tel:+972-54-4227607 icq:179294841 Technion - Israel Institute of Technology fax:+972-4-8293388 http://www.math.technion.ac.il/~rl/ Haifa 32000, ISRAEL "If you can't say somethin' nice, don't say nothin' at all." -- Thumper (1942) Sunday, 8 Elul 5765, 11 September 2005, 8:00PM