* Re: Inconsistent behavior with comparisons and recursive glob patterns
[not found] ` <CAH+w=7YcdDM6enW37=j3LvOaBnPvazvE9DcHKiJ7VSz-zF7SgQ@mail.gmail.com>
@ 2024-04-30 20:08 ` Bart Schaefer
2024-05-05 6:29 ` Alan Wagner-Krankel
0 siblings, 1 reply; 7+ messages in thread
From: Bart Schaefer @ 2024-04-30 20:08 UTC (permalink / raw)
To: Zsh hackers list; +Cc: Alan Wagner-Krankel
[-- Attachment #1: Type: text/plain, Size: 569 bytes --]
(Moved from -users)
On Tue, Apr 30, 2024 at 11:15 AM Bart Schaefer
<schaefer@brasslantern.com> wrote:
>
> I suspect there's no explicit reasoning in zmv. **/ is specifically
> handled in the case of zmv -w/-W (workers/27247) but not for '$f'
> placeholders.
>
> It works if you do this:
>
> zmv -n '(**/)f?' '$f.txt'
>
> Whether it should also work without the parens and also without the -w
> option is unclear.
Attached patch makes it work without the parens. Is this within spec?
It's hard to tell from the solitary '$f' example in the doc.
[-- Attachment #2: zmv-recur.txt --]
[-- Type: text/plain, Size: 871 bytes --]
diff --git a/Functions/Misc/zmv b/Functions/Misc/zmv
index 5c03e9ea1..2002af5a6 100644
--- a/Functions/Misc/zmv
+++ b/Functions/Misc/zmv
@@ -249,13 +249,13 @@ errs=()
(( ${#files} )) || errs=( "no files matched \`$fpat'" )
for f in $files; do
- if [[ $pat = (#b)(*)\(\*\*##/\)(*) ]]; then
+ if [[ $pat = (#b)(*)(\(\*\*##/\)|\*\*##/)(*) ]]; then
# This looks like a recursive glob. This isn't good enough,
- # because we should really enforce that $match[1] and $match[2]
+ # because we should really enforce that $match[1] and $match[3]
# don't match slashes unless they were explicitly given. But
# it's a start. It's fine for the classic case where (**/) is
# at the start of the pattern.
- pat="$match[1](*/|)$match[2]"
+ pat="$match[1](*/|)$match[3]"
fi
[[ -e $f && $f = (#b)${~pat} ]] || continue
set -- "$match[@]"
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Inconsistent behavior with comparisons and recursive glob patterns
[not found] ` <c97c8fe9-2f56-45b7-bfb2-9f2a97283859@gmx.com>
[not found] ` <CAH+w=7YcdDM6enW37=j3LvOaBnPvazvE9DcHKiJ7VSz-zF7SgQ@mail.gmail.com>
@ 2024-05-01 7:55 ` Alan Wagner-Krankel
2024-05-01 8:38 ` Peter Stephenson
1 sibling, 1 reply; 7+ messages in thread
From: Alan Wagner-Krankel @ 2024-05-01 7:55 UTC (permalink / raw)
To: Eric Cook, zsh-workers
Thanks - I did miss that **/ wasn't supported in conditional expressions.
I wonder if there might be some value in making that explicit in the
documentation, possibly by changing the paragraph you referenced to
something like this:
Pattern metacharacters are active for the pattern arguments. The patterns
are the same as those used for filename generation, see zshexpn(1), but
there is no special behaviour of `/' nor initial dots, shorthand operators
such as **/ are not converted to their expanded forms, and glob qualifiers
are only allowed for forcing filenames to be generated as described above.
There's a slight issue that "such as **/" implies there are many
shorthand operators; I have no idea if there are any others besides
**/ and ***/.
Thanks,
Awk
On Tue, Apr 30, 2024 at 9:11 AM Eric Cook <llua@gmx.com> wrote:
>
> On 4/30/24 04:14, Alan Wagner-Krankel wrote:
> > The results of these conditional expressions using recursive glob
> > operators were unexpectedly different:
> >
> >> [[ f0 = **/f? ]] && print true || print false
> > false
> >> setopt extendedglob
> >> [[ f0 = (*/)#f? ]] && print true || print false
> > true
>
> A couple things about [[:
>
> ```
> Pattern metacharacters are active for the pattern arguments; the patterns are
> the same as those used for filename generation, see zshexpn(1), but there is
> no special behaviour of ‘/' nor initial dots, and no glob qualifiers are al‐
> lowed.
> ```
>
> That last bit about glob qualifiers is out of date since you can now use them to
> force filename generation on in [[. but the point is you aren't doing filename
> generation in [[ (by default), which is where **/ is shorthand for (*/)#.
> pattern matching/globbing is tightly coupled to filename generation in unix shells
> but there are difference.
>
> Even when you use a glob qualifier to force filename generation to happen, you
> still wasn't going to get the behavior you desired since the pattern would've
> underwent filename generation before the comparison that [['s =/== does.
>
> % (set -x; [[ f0 = **/f?(#qN) ]]; touch f2; [[ f0 = **/f?(#qN) ]])
> +zsh:3> [[ f0 = ]]
> +zsh:3> touch f2
> +zsh:3> [[ f0 = f2 ]]
>
> >
> > Since **/ is a shorthand version of (*/)#, it seems like they both
> > should have returned 'true'. The comparisons behave the same when
> > there is at least one directory in the path:
> >
> >> [[ d1/f1 = **/f? ]] && print true || print false
> > true
> >> setopt extendedglob
> >> [[ d1/f1 = (*/)#f? ]] && print true || print false
> > true
> >
> > I came across this because it affects the behavior of zmv, via a test
> > that filters filenames (line 254 in zsh5.9). An example:
> >
> >> mkdir d1 d1/d2
> >> touch f0 d1/f1 d1/d2/f2
> >> print -rl -- **/f?
> > d1/d2/f2
> > d1/f1
> > f0
> >> autoload zmv
> >> zmv -n '**/f?' '$f.txt'
> > mv -- d1/d2/f2 d1/d2/f2.txt
> > mv -- d1/f1 d1/f1.txt
> >> zmv -n '(*/)#f?' '$f.txt'
> > mv -- d1/d2/f2 d1/d2/f2.txt
> > mv -- d1/f1 d1/f1.txt
> > mv -- f0 f0.txt
> >
> > I think both zmv calls should have attempted to update the 'f0' file
> > in the base directory. Am I missing something?
> >
> > Thanks,
> > Awk
> >
> I'll leave zmv's implementation for someone else to answer but i suspect the reasoning is similar.
>
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Inconsistent behavior with comparisons and recursive glob patterns
2024-05-01 7:55 ` Alan Wagner-Krankel
@ 2024-05-01 8:38 ` Peter Stephenson
2024-05-05 6:33 ` Alan Wagner-Krankel
0 siblings, 1 reply; 7+ messages in thread
From: Peter Stephenson @ 2024-05-01 8:38 UTC (permalink / raw)
To: Alan Wagner-Krankel, zsh-workers
> On 01/05/2024 08:55 BST Alan Wagner-Krankel <awk@awkwork.com> wrote:
> Thanks - I did miss that **/ wasn't supported in conditional expressions.
>
> I wonder if there might be some value in making that explicit in the
> documentation, possibly by changing the paragraph you referenced to
> something like this:
>
> Pattern metacharacters are active for the pattern arguments. The patterns
> are the same as those used for filename generation, see zshexpn(1), but
> there is no special behaviour of `/' nor initial dots, shorthand operators
> such as **/ are not converted to their expanded forms, and glob qualifiers
> are only allowed for forcing filenames to be generated as described above.
>
> There's a slight issue that "such as **/" implies there are many
> shorthand operators; I have no idea if there are any others besides
> **/ and ***/.
That's a sensible suggestion, how about this? Please do comment further.
pws
diff --git a/Doc/Zsh/cond.yo b/Doc/Zsh/cond.yo
index 000e576..4216f89 100644
--- a/Doc/Zsh/cond.yo
+++ b/Doc/Zsh/cond.yo
@@ -241,7 +241,10 @@ ifnzman(\
noderef(Filename Generation)\
)\
, but there is no special behaviour
-of `tt(/)' nor initial dots, and no glob qualifiers are allowed.
+of `tt(/)' nor initial dot, and the patterns `tt(**/)' and `tt(***/)' behave
+the same as (*/), in which the tt(*) may match one or more `tt(/)' characters.
+Also, no bare glob qualifiers are allowed, though the form `(#q)var(...)' is
+allowed as shown above.
In each of the above expressions, if
var(file) is of the form `tt(/dev/fd/)var(n)',
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Inconsistent behavior with comparisons and recursive glob patterns
2024-04-30 20:08 ` Inconsistent behavior with comparisons and recursive glob patterns Bart Schaefer
@ 2024-05-05 6:29 ` Alan Wagner-Krankel
0 siblings, 0 replies; 7+ messages in thread
From: Alan Wagner-Krankel @ 2024-05-05 6:29 UTC (permalink / raw)
To: Bart Schaefer; +Cc: Zsh hackers list
I dug through StackOverflow a bit to see if I could find how folks in
the wild were expecting zmv to work. Interestingly, most of the posts
that called zmv with `**/` and '$f' were by the same author. In one if
them (https://unix.stackexchange.com/a/271816), they explicitly stated
that "zmv '(**/)(*)' $1$2..." and "zmv '**/*' '$f...'" were
equivalent.
In the absence of a specific statement in the documentation about
'**/' and '$f', it seems safe to assume that the intended spec was
that '**/' would behave the same with the various zmv flavors as it
does in other situations.
Thanks,
Awk
On Tue, Apr 30, 2024 at 3:09 PM Bart Schaefer <schaefer@brasslantern.com> wrote:
>
> (Moved from -users)
>
> On Tue, Apr 30, 2024 at 11:15 AM Bart Schaefer
> <schaefer@brasslantern.com> wrote:
> >
> > I suspect there's no explicit reasoning in zmv. **/ is specifically
> > handled in the case of zmv -w/-W (workers/27247) but not for '$f'
> > placeholders.
> >
> > It works if you do this:
> >
> > zmv -n '(**/)f?' '$f.txt'
> >
> > Whether it should also work without the parens and also without the -w
> > option is unclear.
>
> Attached patch makes it work without the parens. Is this within spec?
> It's hard to tell from the solitary '$f' example in the doc.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Inconsistent behavior with comparisons and recursive glob patterns
2024-05-01 8:38 ` Peter Stephenson
@ 2024-05-05 6:33 ` Alan Wagner-Krankel
2024-05-07 15:08 ` Peter Stephenson
0 siblings, 1 reply; 7+ messages in thread
From: Alan Wagner-Krankel @ 2024-05-05 6:33 UTC (permalink / raw)
To: Peter Stephenson; +Cc: zsh-workers
I think there's some risk that "... '*' may match one or more '/'
characters" could be misinterpreted as "... '*' may *only* match one
or more '/' characters".
Maybe:
the patterns `tt(**/)' and `tt(***/)' behave the same as `tt(*/)',
i.e. as a standard wildcard tt(*) that could match any set of zero or
more characters (including `tt(/)'), followed by a `tt(/)'.
Thanks,
Awk
On Wed, May 1, 2024 at 3:38 AM Peter Stephenson
<p.w.stephenson@ntlworld.com> wrote:
>
> > On 01/05/2024 08:55 BST Alan Wagner-Krankel <awk@awkwork.com> wrote:
> > Thanks - I did miss that **/ wasn't supported in conditional expressions.
> >
> > I wonder if there might be some value in making that explicit in the
> > documentation, possibly by changing the paragraph you referenced to
> > something like this:
> >
> > Pattern metacharacters are active for the pattern arguments. The patterns
> > are the same as those used for filename generation, see zshexpn(1), but
> > there is no special behaviour of `/' nor initial dots, shorthand operators
> > such as **/ are not converted to their expanded forms, and glob qualifiers
> > are only allowed for forcing filenames to be generated as described above.
> >
> > There's a slight issue that "such as **/" implies there are many
> > shorthand operators; I have no idea if there are any others besides
> > **/ and ***/.
>
> That's a sensible suggestion, how about this? Please do comment further.
>
> pws
>
> diff --git a/Doc/Zsh/cond.yo b/Doc/Zsh/cond.yo
> index 000e576..4216f89 100644
> --- a/Doc/Zsh/cond.yo
> +++ b/Doc/Zsh/cond.yo
> @@ -241,7 +241,10 @@ ifnzman(\
> noderef(Filename Generation)\
> )\
> , but there is no special behaviour
> -of `tt(/)' nor initial dots, and no glob qualifiers are allowed.
> +of `tt(/)' nor initial dot, and the patterns `tt(**/)' and `tt(***/)' behave
> +the same as (*/), in which the tt(*) may match one or more `tt(/)' characters.
> +Also, no bare glob qualifiers are allowed, though the form `(#q)var(...)' is
> +allowed as shown above.
>
> In each of the above expressions, if
> var(file) is of the form `tt(/dev/fd/)var(n)',
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Inconsistent behavior with comparisons and recursive glob patterns
2024-05-05 6:33 ` Alan Wagner-Krankel
@ 2024-05-07 15:08 ` Peter Stephenson
2024-05-08 5:59 ` Alan Wagner-Krankel
0 siblings, 1 reply; 7+ messages in thread
From: Peter Stephenson @ 2024-05-07 15:08 UTC (permalink / raw)
To: Alan Wagner-Krankel, zsh-workers
> On 05/05/2024 07:33 BST Alan Wagner-Krankel <awk@awkwork.com> wrote:
> I think there's some risk that "... '*' may match one or more '/'
> characters" could be misinterpreted as "... '*' may *only* match one
> or more '/' characters".
>
> Maybe:
> the patterns `tt(**/)' and `tt(***/)' behave the same as `tt(*/)',
> i.e. as a standard wildcard tt(*) that could match any set of zero or
> more characters (including `tt(/)'), followed by a `tt(/)'.
Hmm, I'd perhaps limit that to only the information we wish to add to
keep it simple...
pws
diff --git a/Doc/Zsh/cond.yo b/Doc/Zsh/cond.yo
index 000e576..c58aea6 100644
--- a/Doc/Zsh/cond.yo
+++ b/Doc/Zsh/cond.yo
@@ -241,7 +241,11 @@ ifnzman(\
noderef(Filename Generation)\
)\
, but there is no special behaviour
-of `tt(/)' nor initial dots, and no glob qualifiers are allowed.
+of `tt(/)' nor initial dot, and the patterns `tt(**/)' and `tt(***/)' behave
+the same as `tt(*/)', in which the `tt(*)' has its standard behaviour
+but may also match further `tt(/)' characters. Also, no bare glob
+qualifiers are allowed, though the form `((#q)var(...))' is allowed as
+shown above.
In each of the above expressions, if
var(file) is of the form `tt(/dev/fd/)var(n)',
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: Inconsistent behavior with comparisons and recursive glob patterns
2024-05-07 15:08 ` Peter Stephenson
@ 2024-05-08 5:59 ` Alan Wagner-Krankel
0 siblings, 0 replies; 7+ messages in thread
From: Alan Wagner-Krankel @ 2024-05-08 5:59 UTC (permalink / raw)
To: Peter Stephenson; +Cc: zsh-workers
Excellent. It's succinct and conveys the necessary information.
Thanks,
Awk
On Tue, May 7, 2024 at 10:08 AM Peter Stephenson
<p.w.stephenson@ntlworld.com> wrote:
>
> > On 05/05/2024 07:33 BST Alan Wagner-Krankel <awk@awkwork.com> wrote:
> > I think there's some risk that "... '*' may match one or more '/'
> > characters" could be misinterpreted as "... '*' may *only* match one
> > or more '/' characters".
> >
> > Maybe:
> > the patterns `tt(**/)' and `tt(***/)' behave the same as `tt(*/)',
> > i.e. as a standard wildcard tt(*) that could match any set of zero or
> > more characters (including `tt(/)'), followed by a `tt(/)'.
>
> Hmm, I'd perhaps limit that to only the information we wish to add to
> keep it simple...
>
> pws
>
> diff --git a/Doc/Zsh/cond.yo b/Doc/Zsh/cond.yo
> index 000e576..c58aea6 100644
> --- a/Doc/Zsh/cond.yo
> +++ b/Doc/Zsh/cond.yo
> @@ -241,7 +241,11 @@ ifnzman(\
> noderef(Filename Generation)\
> )\
> , but there is no special behaviour
> -of `tt(/)' nor initial dots, and no glob qualifiers are allowed.
> +of `tt(/)' nor initial dot, and the patterns `tt(**/)' and `tt(***/)' behave
> +the same as `tt(*/)', in which the `tt(*)' has its standard behaviour
> +but may also match further `tt(/)' characters. Also, no bare glob
> +qualifiers are allowed, though the form `((#q)var(...))' is allowed as
> +shown above.
>
> In each of the above expressions, if
> var(file) is of the form `tt(/dev/fd/)var(n)',
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2024-05-08 6:01 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <CAGZNKjL2cJhLopFPAFFTewh2cHxX+_L4vMx2Nf9bAS0ud58RBw@mail.gmail.com>
[not found] ` <c97c8fe9-2f56-45b7-bfb2-9f2a97283859@gmx.com>
[not found] ` <CAH+w=7YcdDM6enW37=j3LvOaBnPvazvE9DcHKiJ7VSz-zF7SgQ@mail.gmail.com>
2024-04-30 20:08 ` Inconsistent behavior with comparisons and recursive glob patterns Bart Schaefer
2024-05-05 6:29 ` Alan Wagner-Krankel
2024-05-01 7:55 ` Alan Wagner-Krankel
2024-05-01 8:38 ` Peter Stephenson
2024-05-05 6:33 ` Alan Wagner-Krankel
2024-05-07 15:08 ` Peter Stephenson
2024-05-08 5:59 ` Alan Wagner-Krankel
Code repositories for project(s) associated with this public inbox
https://git.vuxu.org/mirror/zsh/
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).