From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 1827 invoked from network); 10 May 2007 07:56:51 -0000 X-Spam-Checker-Version: SpamAssassin 3.2.0 (2007-05-01) on f.primenet.com.au X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=AWL,BAYES_00 autolearn=no version=3.2.0 Received: from news.dotsrc.org (HELO a.mx.sunsite.dk) (130.225.247.88) by ns1.primenet.com.au with SMTP; 10 May 2007 07:56:51 -0000 Received-SPF: none (ns1.primenet.com.au: domain at sunsite.dk does not designate permitted sender hosts) Received: (qmail 22844 invoked from network); 10 May 2007 07:56:44 -0000 Received: from sunsite.dk (130.225.247.90) by a.mx.sunsite.dk with SMTP; 10 May 2007 07:56:44 -0000 Received: (qmail 13669 invoked by alias); 10 May 2007 07:56:42 -0000 Mailing-List: contact zsh-workers-help@sunsite.dk; run by ezmlm Precedence: bulk X-No-Archive: yes X-Seq: 23410 Received: (qmail 13659 invoked from network); 10 May 2007 07:56:42 -0000 Received: from news.dotsrc.org (HELO a.mx.sunsite.dk) (130.225.247.88) by sunsite.dk with SMTP; 10 May 2007 07:56:42 -0000 Received: (qmail 22549 invoked from network); 10 May 2007 07:56:42 -0000 Received: from flock1.newmail.ru (82.204.219.207) by a.mx.sunsite.dk with SMTP; 10 May 2007 07:56:38 -0000 Received: (qmail 32277 invoked from network); 10 May 2007 07:56:21 -0000 Received: from unknown (HELO cooker.local) (arvidjaar@newmail.ru@85.141.134.196) by smtpd.newmail.ru with SMTP; 10 May 2007 07:56:21 -0000 From: Andrey Borzenkov To: zsh-workers@sunsite.dk Subject: Silent UTF-8 assumption? Date: Thu, 10 May 2007 11:56:18 +0400 User-Agent: KMail/1.9.6 MIME-Version: 1.0 Content-Type: multipart/signed; boundary="nextPart1795203.6vxPbZfGLe"; protocol="application/pgp-signature"; micalg=pgp-sha1 Content-Transfer-Encoding: 7bit Message-Id: <200705101156.19776.arvidjaar@newmail.ru> --nextPart1795203.6vxPbZfGLe Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline This caught my attention: static wchar_t charref(char *x, char *y) { wchar_t wc; size_t ret; if (!(patglobflags & GF_MULTIBYTE) || !(STOUC(*x) & 0x80)) return (wchar_t) STOUC(*x); well, this is definitely not valid for arbitrary multibyte character set. I= am=20 just curious if it is possible to consistently assume that UTF-8 is in use?= =20 That can definitely simplify things. --nextPart1795203.6vxPbZfGLe Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (GNU/Linux) iD8DBQBGQtAjR6LMutpd94wRAuAMAJ0WH83bFuTC+fvtLoAZL8Xr0AQZ1QCfTVW2 J5wZJSSeL7YH3dbHMWJdJrM= =H1xI -----END PGP SIGNATURE----- --nextPart1795203.6vxPbZfGLe--