zsh-workers
 help / color / mirror / code / Atom feed
* PATCH: Don't treat NUL as a combining character
@ 2015-05-09  7:46 Mikael Magnusson
  2015-05-10  0:02 ` Mikael Magnusson
  0 siblings, 1 reply; 4+ messages in thread
From: Mikael Magnusson @ 2015-05-09  7:46 UTC (permalink / raw)
  To: zsh workers

As title, if you input
% something<ctrl-v ctrl-space>
while combining_chars is set, it visually doesn't appear, but toggling
the option makes it appear again (just like for actual combining
characters).

(I was momentarily confused why a command looking like *.mp4(e'fattr
notes test') worked, then i remembered my ^_e bind to enter an e::
glob uses ^@ as the separator. Normally I don't have combining_chars
set, but this directory contains some NFD files.)

I considered using L'\0' to clarify the intent, but I have no idea if
all compilers support wchar literals?

index 6b3cb97..13c053c 100644
--- i/Src/zsh.h
+++ w/Src/zsh.h
@@ -2982,9 +2982,9 @@ typedef wint_t convchar_t;
  * We can't be quite sure the wcwidth we've provided is entirely
  * in agreement with the system's, so be extra safe.
  */
-#define IS_COMBINING(wc)       (WCWIDTH(wc) == 0 && !iswcntrl(wc))
+#define IS_COMBINING(wc)       (wc != 0 && WCWIDTH(wc) == 0 && !iswcntrl(wc))
 #else
-#define IS_COMBINING(wc)       (WCWIDTH(wc) == 0)
+#define IS_COMBINING(wc)       (wc != 0 && WCWIDTH(wc) == 0)
 #endif
 /*
  * Test for the base of a combining character.


-- 
Mikael Magnusson


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: PATCH: Don't treat NUL as a combining character
  2015-05-09  7:46 PATCH: Don't treat NUL as a combining character Mikael Magnusson
@ 2015-05-10  0:02 ` Mikael Magnusson
  2015-05-10  0:14   ` Bart Schaefer
  0 siblings, 1 reply; 4+ messages in thread
From: Mikael Magnusson @ 2015-05-10  0:02 UTC (permalink / raw)
  To: zsh workers

On Sat, May 9, 2015 at 9:46 AM, Mikael Magnusson <mikachu@gmail.com> wrote:
> As title, if you input
> % something<ctrl-v ctrl-space>
> while combining_chars is set, it visually doesn't appear, but toggling
> the option makes it appear again (just like for actual combining
> characters).
>
> (I was momentarily confused why a command looking like *.mp4(e'fattr
> notes test') worked, then i remembered my ^_e bind to enter an e::
> glob uses ^@ as the separator. Normally I don't have combining_chars
> set, but this directory contains some NFD files.)
>
> I considered using L'\0' to clarify the intent, but I have no idea if
> all compilers support wchar literals?
>
> index 6b3cb97..13c053c 100644
> --- i/Src/zsh.h
> +++ w/Src/zsh.h
> @@ -2982,9 +2982,9 @@ typedef wint_t convchar_t;
>   * We can't be quite sure the wcwidth we've provided is entirely
>   * in agreement with the system's, so be extra safe.
>   */
> -#define IS_COMBINING(wc)       (WCWIDTH(wc) == 0 && !iswcntrl(wc))
> +#define IS_COMBINING(wc)       (wc != 0 && WCWIDTH(wc) == 0 && !iswcntrl(wc))
>  #else
> -#define IS_COMBINING(wc)       (WCWIDTH(wc) == 0)
> +#define IS_COMBINING(wc)       (wc != 0 && WCWIDTH(wc) == 0)
>  #endif
>  /*
>   * Test for the base of a combining character.

Instead of making this path ever so slightly slower by adding an extra
test that only matters a fraction of the time, could we actually make
it wc > 127 instead? I'm assuming unicode doesn't define anything in
ascii to be a combining character anyway, and then we would avoid the
wcwidth call in Most™ cases.

-- 
Mikael Magnusson


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: PATCH: Don't treat NUL as a combining character
  2015-05-10  0:02 ` Mikael Magnusson
@ 2015-05-10  0:14   ` Bart Schaefer
  2015-05-10  0:28     ` Mikael Magnusson
  0 siblings, 1 reply; 4+ messages in thread
From: Bart Schaefer @ 2015-05-10  0:14 UTC (permalink / raw)
  To: zsh workers

On May 10,  2:02am, Mikael Magnusson wrote:
} Subject: Re: PATCH: Don't treat NUL as a combining character
}
} > -#define IS_COMBINING(wc)       (WCWIDTH(wc) == 0 && !iswcntrl(wc))
} > +#define IS_COMBINING(wc)       (wc != 0 && WCWIDTH(wc) == 0 && !iswcntrl(wc))
} 
} Instead of making this path ever so slightly slower by adding an extra
} test that only matters a fraction of the time

I think the test would be no slower (or at least faster than what you have
there) if you just put the wc != 0 as the last branch of the && chain?

} could we actually make it wc > 127 instead?

I don't know the definitive answer to that, but it seems it should also go
at the end.  We're really micro-optimizing here, though.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: PATCH: Don't treat NUL as a combining character
  2015-05-10  0:14   ` Bart Schaefer
@ 2015-05-10  0:28     ` Mikael Magnusson
  0 siblings, 0 replies; 4+ messages in thread
From: Mikael Magnusson @ 2015-05-10  0:28 UTC (permalink / raw)
  To: Bart Schaefer; +Cc: zsh workers

On Sun, May 10, 2015 at 2:14 AM, Bart Schaefer
<schaefer@brasslantern.com> wrote:
> On May 10,  2:02am, Mikael Magnusson wrote:
> } Subject: Re: PATCH: Don't treat NUL as a combining character
> }
> } > -#define IS_COMBINING(wc)       (WCWIDTH(wc) == 0 && !iswcntrl(wc))
> } > +#define IS_COMBINING(wc)       (wc != 0 && WCWIDTH(wc) == 0 && !iswcntrl(wc))
> }
> } Instead of making this path ever so slightly slower by adding an extra
> } test that only matters a fraction of the time
>
> I think the test would be no slower (or at least faster than what you have
> there) if you just put the wc != 0 as the last branch of the && chain?
>
> } could we actually make it wc > 127 instead?
>
> I don't know the definitive answer to that, but it seems it should also go
> at the end.  We're really micro-optimizing here, though.

That's a good point, I put it first because checking value is much
faster than calling a function, but NUL is super rare which I didn't
consider at the time.

-- 
Mikael Magnusson


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2015-05-10  0:29 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-05-09  7:46 PATCH: Don't treat NUL as a combining character Mikael Magnusson
2015-05-10  0:02 ` Mikael Magnusson
2015-05-10  0:14   ` Bart Schaefer
2015-05-10  0:28     ` Mikael Magnusson

Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/zsh/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).