From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 21043 invoked by alias); 18 Dec 2014 09:25:27 -0000 Mailing-List: contact zsh-users-help@zsh.org; run by ezmlm Precedence: bulk X-No-Archive: yes List-Id: Zsh Users List List-Post: List-Help: X-Seq: 19559 Received: (qmail 7366 invoked from network); 18 Dec 2014 09:25:24 -0000 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on f.primenet.com.au X-Spam-Level: X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,FREEMAIL_FROM,RCVD_IN_DNSWL_LOW autolearn=ham version=3.3.2 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; bh=giB4xNfe7xz7w1iJC+vAPnbuoim1mOGK5cw14uVmt9M=; b=FrRFcpXBIdL9pH3r/qrNLfwpNxlkOvtUwZgzk2o39QVh1Xaf1E5nZZnTI+PTO2eU6t A/GfoCuJvlA25vSNzpIx0x0YyYXc6s7nKPsmhj29+peIqrceWhYI80MhFdUaMQeKEZ9c UUxcFCd20bw9BXmNNB2cp7eiFST/LEYmUCPqUyG8o2xaLFidaKgfevxy1KyzDRdKLe76 77tgKkAUbTQyvtDz3Ei5+Wwdu6+8FFptFTbD2WXGqiCmnQsiwHTXqR9w+/t3+7AF14Ry s5et50NYYbXQLZIVjf1eL3LnOhmPREt4pQaZiBTla3XTMgHsZOuZS2Hsf4faxrIZMTGR Rt1w== MIME-Version: 1.0 X-Received: by 10.50.80.36 with SMTP id o4mr12453031igx.37.1418894718327; Thu, 18 Dec 2014 01:25:18 -0800 (PST) In-Reply-To: <48944F64-7308-4D4A-B056-6B467CE232C1@yandex.ru> References: <5491C5E7.1070207@eastlink.ca> <577101418848314@web25o.yandex.ru> <5492225C.3070403@eastlink.ca> <48944F64-7308-4D4A-B056-6B467CE232C1@yandex.ru> Date: Thu, 18 Dec 2014 10:25:18 +0100 Message-ID: Subject: Re: utf-8 From: Mikael Magnusson To: =?UTF-8?B?0J/QsNCy0LvQvtCyINCd0LjQutC+0LvQsNC5INCQ0LvQtdC60YHQsNC90LTRgNC+0LLQuNGH?= Cc: Ray Andrews , Zsh Users Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Thu, Dec 18, 2014 at 7:48 AM, =D0=9F=D0=B0=D0=B2=D0=BB=D0=BE=D0=B2 =D0= =9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0=D0=B9 =D0=90=D0=BB=D0=B5=D0=BA=D1=81=D0= =B0=D0=BD=D0=B4=D1=80=D0=BE=D0=B2=D0=B8=D1=87 wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA512 > > On December 18, 2014 3:39:56 AM EAT, Ray Andrews = wrote: >>On 12/17/2014 12:31 PM, ZyX wrote: >> >> >>ZyX, >>> It looks like it is the following: - Explicit support in RE patterns. >> >>> - COMBINING_CHARS option that tells zsh that terminal is able to >>display >>... I did some reading, but it's too 'zoomed in' for me, it presumes >>one >>already more or less knows what's going on. I don't. > > Your question is too broad to give more detailed answer and the intent is= not clear. You are also posting to zsh users and developers mainly live in= zsh workers, reading users with lower priority. I know some internals of z= sh (not the part you are requesting though) and know some "dark corners" of= unicode processing in general, but I cannot give more detailed explanation= without knowing what you are after. All mails to zsh-users are automatically sent to subscribers of zsh-workers as well. The main issue with non-singlebyte encodings is that almost all the code used to assume that one byte equals one character equals one on-screen character cell. This took a couple of years to fix, but is more or less done now. There is nothing specific to UTF-8 in the code as far as I know, except in getkeystring, but that looks more like an optimization to avoid calling iconv(). Eg, zsh works fine if you run under EUC-JP too, but then you can of course only type japanese characters (and the ascii set). Most of what Pavlov(if my cyrillic isn't too rusty) said applies to unicode, not utf-8, which is a character set, not a character encoding. All the unicode things should work fine in any encoding/character set, assuming the character you want exists in it. --=20 Mikael Magnusson