From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 10160 invoked by alias); 14 May 2013 13:37:27 -0000 Mailing-List: contact zsh-workers-help@zsh.org; run by ezmlm Precedence: bulk X-No-Archive: yes List-Id: Zsh Workers List List-Post: List-Help: X-Seq: 31403 Received: (qmail 26868 invoked from network); 14 May 2013 13:37:22 -0000 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on f.primenet.com.au X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham version=3.3.2 Received-SPF: none (ns1.primenet.com.au: domain at vinc17.net does not designate permitted sender hosts) Date: Tue, 14 May 2013 15:28:40 +0200 From: Vincent Lefevre To: zsh-workers@zsh.org Subject: Re: Subversion completion don't work with UTF8 (and other) file names Message-ID: <20130514132840.GA3544@ypig.lip.ens-lyon.fr> Mail-Followup-To: zsh-workers@zsh.org References: <20130426123921.GT16210@xvii.vinc17.org> <20130426154427.06972873@pwslap01u.europe.root.pri> <20130426153546.GV16210@xvii.vinc17.org> <20130429095741.7867e358@pwslap01u.europe.root.pri> <19868.1367633701@thecus.kiddle.eu> <20130504142355.0fdaf612@pws-pc.ntlworld.com> <14778.1367912315@thecus.kiddle.eu> <20130507102115.59d18d55@pwslap01u.europe.root.pri> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20130507102115.59d18d55@pwslap01u.europe.root.pri> X-Mailer-Info: http://www.vinc17.net/mutt/ User-Agent: Mutt/1.5.21-6305-vl-r59709 (2013-04-16) On 2013-05-07 10:21:15 +0100, Peter Stephenson wrote: > I suppose I was asking the more general question: given a utility we > need output from, is it OK to keep LC_CTYPE and set everything else > to C or is there something else going on? It depends on the utility. With some utilities, you may want to set LC_CTYPE to C in order to avoid non-ASCII characters and/or get charset-independent output. For instance, with the US-ASCII charmap, $ LANG=C LC_CTYPE=C cp foo cp: missing destination file operand after 'foo' Try 'cp --help' for more information. but with the UTF-8 charmap, $ LANG=C LC_CTYPE=C.UTF-8 cp foo cp: missing destination file operand after ‘foo’ Try 'cp --help' for more information. The quotes of foo are non-ASCII ones in the latter case. But if the filename can have non-ASCII characters... $ LANG=C LC_CTYPE=C cp aéb cp: missing destination file operand after 'a\303\251b' Try 'cp --help' for more information. while one gets the accented character in UTF-8: $ LANG=C LC_CTYPE=C.UTF-8 cp aéb cp: missing destination file operand after ‘aéb’ Try 'cp --help' for more information. but things get complex if the filename has special characters (even ASCII ones)... -- Vincent Lefèvre - Web: 100% accessible validated (X)HTML - Blog: Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)