zsh-workers
 help / color / mirror / code / Atom feed
* [PATCH] ztrsub() execution time / 2
@ 2018-10-17  4:29 Sebastian Gniazdowski
  2018-10-17 14:25 ` Daniel Shahaf
  0 siblings, 1 reply; 4+ messages in thread
From: Sebastian Gniazdowski @ 2018-10-17  4:29 UTC (permalink / raw)
  To: Zsh hackers list

[-- Attachment #1: Type: text/plain, Size: 893 bytes --]

Hello,
this optimization is a drop of callgrind instruction count by 43 mln
for ztrsub() – it had 89 mln instructions normally, now it has 46 mln:

ztrsub() callgrind I-count:
1> (89873326 - 46452836) / 1000000.0
43,4205

total callgrind I-count
2> (7906279976 - 7858975498) / 1000000.0
47,3045

 89,873,326  ztrsub [/usr/local/bin/zsh-5.6.2-dev-1-2noopt]
vs
 46,452,836  ztrsub [/usr/local/bin/zsh-5.6.2-dev-1-2yesopt]

Ran this callgrind probe twice to confirm. That said, `zsh -i -c exit'
nor a performance test script show any gain. I wonder why.. But
callgrind doesn't lie, and looking at the code one can understand why
I-count dropped by factor of 2. The patch has many context lines to
see whole function.
-- 
Sebastian Gniazdowski
News: https://twitter.com/ZdharmaI
IRC: https://kiwiirc.com/client/chat.freenode.net:+6697/#zplugin
Blog: http://zdharma.org

[-- Attachment #2: ztrsub_opt.diff.txt --]
[-- Type: text/plain, Size: 633 bytes --]

diff --git a/Src/utils.c b/Src/utils.c
index 914e30c..1eb8c71 100644
--- a/Src/utils.c
+++ b/Src/utils.c
@@ -5058,23 +5058,23 @@ ztrlenend(char const *s, char const *eptr)
 #endif /* MULTIBYTE_SUPPORT */
 
 /* Subtract two pointers in a metafied string. */
 
 /**/
 mod_export int
 ztrsub(char const *t, char const *s)
 {
     int l = t - s;
 
     while (s != t) {
-	if (*s++ == Meta) {
+	if (*s++ == Meta || (s != t && *s++ == Meta)) {
 #ifdef DEBUG
 	    if (! *s || s == t)
 		fprintf(stderr, "BUG: substring ends in the middle of a metachar in ztrsub()\n");
 	    else
 #endif
 	    s++;
 	    l--;
 	}
     }
     return l;
 }

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] ztrsub() execution time / 2
  2018-10-17  4:29 [PATCH] ztrsub() execution time / 2 Sebastian Gniazdowski
@ 2018-10-17 14:25 ` Daniel Shahaf
  2018-10-17 19:02   ` Sebastian Gniazdowski
  0 siblings, 1 reply; 4+ messages in thread
From: Daniel Shahaf @ 2018-10-17 14:25 UTC (permalink / raw)
  To: Sebastian Gniazdowski; +Cc: Zsh hackers list

Sebastian Gniazdowski wrote on Wed, Oct 17, 2018 at 06:29:08 +0200:
> diff --git a/Src/utils.c b/Src/utils.c
> index 914e30c..1eb8c71 100644
> --- a/Src/utils.c
> +++ b/Src/utils.c
> @@ -5058,23 +5058,23 @@ ztrlenend(char const *s, char const *eptr)
>  #endif /* MULTIBYTE_SUPPORT */
>  
>  /* Subtract two pointers in a metafied string. */
>  
>  /**/
>  mod_export int
>  ztrsub(char const *t, char const *s)
>  {
>      int l = t - s;
>  
>      while (s != t) {
> -	if (*s++ == Meta) {
> +	if (*s++ == Meta || (s != t && *s++ == Meta)) {

Thank you for generating the diff with additional context lines.

As far as I can tell, this patch is the sort of transformation that an
optimizing compiler should do by itself.  In fact, I'm having a hard time
seeing how the sequence of "BRANCH IF EQUAL" / "BRANCH IF NOT EQUAL" /
"INCREMENT" assembly instructions would be altered by the patch.

If your profiling was performed on zsh compiled with -O3, I would suggest that
you file a bug against your compiler's optimizer.

>  #ifdef DEBUG
>  	    if (! *s || s == t)
>  		fprintf(stderr, "BUG: substring ends in the middle of a metachar in ztrsub()\n");
>  	    else
>  #endif
>  	    s++;
>  	    l--;

Cheers,

Daniel

>  	}
>      }
>      return l;
>  }


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] ztrsub() execution time / 2
  2018-10-17 14:25 ` Daniel Shahaf
@ 2018-10-17 19:02   ` Sebastian Gniazdowski
  2018-10-17 19:34     ` Daniel Shahaf
  0 siblings, 1 reply; 4+ messages in thread
From: Sebastian Gniazdowski @ 2018-10-17 19:02 UTC (permalink / raw)
  To: Daniel Shahaf; +Cc: Zsh hackers list

On Wed, 17 Oct 2018 at 16:25, Daniel Shahaf <d.s@daniel.shahaf.name> wrote:
> As far as I can tell, this patch is the sort of transformation that an
> optimizing compiler should do by itself.  In fact, I'm having a hard time
> seeing how the sequence of "BRANCH IF EQUAL" / "BRANCH IF NOT EQUAL" /
> "INCREMENT" assembly instructions would be altered by the patch.
>
> If your profiling was performed on zsh compiled with -O3, I would suggest that
> you file a bug against your compiler's optimizer.

Why not "do it right, then do it yourself"? Why ignore -O0 zsh users?
Should I subscribe to compilers mailing lists and investigate this
well defined 1-line change, i.e. possible to grasp by logic?

--
Sebastian Gniazdowski
News: https://twitter.com/ZdharmaI
IRC: https://kiwiirc.com/client/chat.freenode.net:+6697/#zplugin
Blog: http://zdharma.org

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] ztrsub() execution time / 2
  2018-10-17 19:02   ` Sebastian Gniazdowski
@ 2018-10-17 19:34     ` Daniel Shahaf
  0 siblings, 0 replies; 4+ messages in thread
From: Daniel Shahaf @ 2018-10-17 19:34 UTC (permalink / raw)
  To: zsh-workers

Sebastian Gniazdowski wrote on Wed, 17 Oct 2018 21:02 +0200:
> On Wed, 17 Oct 2018 at 16:25, Daniel Shahaf <d.s@daniel.shahaf.name> wrote:
> > As far as I can tell, this patch is the sort of transformation that an
> > optimizing compiler should do by itself.  In fact, I'm having a hard time
> > seeing how the sequence of "BRANCH IF EQUAL" / "BRANCH IF NOT EQUAL" /
> > "INCREMENT" assembly instructions would be altered by the patch.
> >
> > If your profiling was performed on zsh compiled with -O3, I would suggest that
> > you file a bug against your compiler's optimizer.
> 
> Why not "do it right, then do it yourself"? Why ignore -O0 zsh users?
> Should I subscribe to compilers mailing lists and investigate this
> well defined 1-line change, i.e. possible to grasp by logic?

I have a hard time charitably interpreting what you just wrote.  It
doesn't come across as constructive.

I think the patch should not be applied because it makes the code harder
to read and to maintain.

Daniel

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2018-10-17 19:34 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-10-17  4:29 [PATCH] ztrsub() execution time / 2 Sebastian Gniazdowski
2018-10-17 14:25 ` Daniel Shahaf
2018-10-17 19:02   ` Sebastian Gniazdowski
2018-10-17 19:34     ` Daniel Shahaf

Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/zsh/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).