From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-3.3 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, MAILING_LIST_MULTI,RCVD_IN_DNSWL_MED autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 8580 invoked from network); 18 Nov 2022 14:28:00 -0000 Received: from zero.zsh.org (2a02:898:31:0:48:4558:7a:7368) by inbox.vuxu.org with ESMTPUTF8; 18 Nov 2022 14:28:00 -0000 ARC-Seal: i=1; cv=none; a=rsa-sha256; d=zsh.org; s=rsa-20210803; t=1668781680; b=grzI2/AUnIrKVWjYz8b3uW/PlvvxjvLYHn8C9a68AfmiDZ4uWFKJgdjWpL3/450yJ+vLbmhHjb L0Xxr6OtmOmm+Kk/H5dl9Z2I3c8I7mgfUkFXZZ6S8cxigm6DATTcasR80rN2I8z4BA8m+2BAHG vepq5Jw6B8H1uUsQ31J2ROfXDRLu68BxeiapPpJnVCSWfuACUkcJ3VMtEbt4hqanfTckeMK216 RDcl3/qQ21KjwHj3+lUlUsUtcK4bujsV8MBeeP21qATyXrMKLjSm0Ny0DJTAlgfYSHUpjRfgvt GosyyxJpeDz87TQjHVy5qEUEVH3D0cMfgJWGOyj7561O4A==; ARC-Authentication-Results: i=1; zsh.org; iprev=pass (relay6-d.mail.gandi.net) smtp.remote-ip=217.70.183.198; dmarc=none header.from=chazelas.org; arc=none ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed; d=zsh.org; s=rsa-20210803; t=1668781680; bh=5/I35tAzSc9BgdfdnFTEF3Fvh9FVrON1No9Tg3WLQBw=; h=List-Archive:List-Owner:List-Post:List-Unsubscribe:List-Subscribe:List-Help: List-Id:Sender:Content-Transfer-Encoding:Content-Type:MIME-Version: Message-ID:Subject:To:From:Date:DKIM-Signature; b=EU2pmQEHjr+Er2qQzfmGENQ33sqWzauNUWImLULLYCZKiuNqycLr9/Vnt5CO0JjS5bbfU0Koir /0kMsED8f4mT3h2vl0KXXng+O8AdU7qgBmYiU92/b+oGpYGNZGTX/JHe8CHJJp0nuLPML0VOjE saIfPRqxXF+3jvklryrm+U8MJNSDMeS+UR2X/ju6rZNxRfoyWn79tX8C9DmJxpYgdaMJP05o0l +ehV/X8AA8Mx1jx0g4x4xf8y0z9SQ16NE2W/CiCtN7b71iwNGm0SZy8Ys0PwexoNNG2Ogug0m+ 71kUw+JQYeqf3vqf3iPd+jTEmhyA6mRnFB1CVyfcYKUamA==; DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=zsh.org; s=rsa-20210803; h=List-Archive:List-Owner:List-Post:List-Unsubscribe: List-Subscribe:List-Help:List-Id:Sender:Content-Transfer-Encoding: Content-Type:MIME-Version:Message-ID:Subject:To:From:Date:Reply-To:Cc: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:In-Reply-To:References; bh=fZ7S4hlHGZIip2sisdEJ5V3lfAFBd7qYei4cY6C+Ank=; b=g09gW8vykSEDSwGfj+VCcm6dNF jeU5ubw0UDD3MfobfD6C7K3zLkYPp3MhOd2SU8MYMHZI1sKIYeuPmsVuekICr82D++880sIdW/bZV Ni3jPZ+jx0MnenFQAeDp9fsqQOC8B+kGgwXN8/67ecpYMBmK3z28m7x5l/hF9AmGxyDA8F/NprsGM r4Kz/eCvPUEk5gFY0t1ynpbSesE1Sm6DHrZ+DyVxjO0MfZlEhBDwhd7zlvCLge+/EOR37oblYMUcw HawAMPeCNtjDmsG1/RsHfVIpoOe2uqpwxnK4ZOic5lcWkt05fLEAdz8ahteL43K2kMhfgahS0X9A1 i2LoUkkA==; Received: by zero.zsh.org with local id 1ow2LX-0009uU-DN; Fri, 18 Nov 2022 14:27:59 +0000 Authentication-Results: zsh.org; iprev=pass (relay6-d.mail.gandi.net) smtp.remote-ip=217.70.183.198; dmarc=none header.from=chazelas.org; arc=none Received: from relay6-d.mail.gandi.net ([217.70.183.198]:55991) by zero.zsh.org with esmtps (TLS1.2:ECDHE-RSA-AES256-GCM-SHA384:256) id 1ow2Kt-0009YN-7l; Fri, 18 Nov 2022 14:27:21 +0000 Received: (Authenticated sender: stephane@chazelas.org) by mail.gandi.net (Postfix) with ESMTPSA id 52165C0004 for ; Fri, 18 Nov 2022 14:27:17 +0000 (UTC) Date: Fri, 18 Nov 2022 14:27:17 +0000 From: Stephane Chazelas To: Zsh hackers list Subject: [bug] busyloop upon $=var with NULs when $IFS contains both NUL and a byte > 0x7f Message-ID: <20221118142717.t4elzrigjeizjm6w@chazelas.org> Mail-Followup-To: Zsh hackers list MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit X-Seq: 50992 Archived-At: X-Loop: zsh-workers@zsh.org Errors-To: zsh-workers-owner@zsh.org Precedence: list Precedence: bulk Sender: zsh-workers-request@zsh.org X-no-archive: yes List-Id: List-Help: , List-Subscribe: , List-Unsubscribe: , List-Post: List-Owner: List-Archive: $ LC_ALL=C zsh -c 'IFS=é$IFS; echo $=IFS' ^C (busy loop had to be interrupted with ^C). Call trace: (gdb) bt #0 0x00005555556249ad in mb_metacharlenconv (s=0x555555675185 "\203 ", wcp=0x7fffffffc760) at utils.c:5541 #1 0x0000555555622791 in itype_end (ptr=0x555555675185 "\203 ", itype=32, once=1) at utils.c:4332 #2 0x000055555562121e in wordcount (s=0x555555675185 "\203 ", sep=0x0, mul=-1) at utils.c:3835 #3 0x0000555555620b29 in spacesplit (s=0x555555675180 "\303\251 \t\n\203 ", allownull=0, heap=1, quote=0) at utils.c:3650 #4 0x0000555555621495 in sepsplit (s=0x555555675180 "\303\251 \t\n\203 ", sep=0x0, allownull=0, heap=1) at utils.c:3908 #5 0x0000555555612f55 in paramsubst (l=0x7ffff7fbf560, n=0x7ffff7fbf590, str=0x7fffffffce00, qt=0, pf_flags=0, ret_flags=0x7fffffffcfcc) at subst.c:3660 #6 0x000055555560bb37 in stringsubst (list=0x7ffff7fbf560, node=0x7ffff7fbf590, pf_flags=0, ret_flags=0x7fffffffcfcc, asssub=0) at subst.c:322 #7 0x000055555560abbc in prefork (list=0x7ffff7fbf560, flags=0, ret_flags=0x7fffffffcfcc) at subst.c:142 #8 0x0000555555595cfd in execcmd_exec (state=0x7fffffffd940, eparams=0x7fffffffd540, input=0, output=0, how=18, last1=1, close_if_forked=-1) at exec.c:3232 #9 0x0000555555592757 in execpline2 (state=0x7fffffffd940, pcode=131, how=18, input=0, output=0, last1=1) at exec.c:1966 #10 0x0000555555590f30 in execpline (state=0x7fffffffd940, slcode=4098, how=18, last1=1) at exec.c:1691 #11 0x000055555559009a in execlist (state=0x7fffffffd940, dont_change_job=0, exiting=1) at exec.c:1444 #12 0x000055555558f735 in execode (p=0x7ffff7fbf448, dont_change_job=0, exiting=1, context=0x55555562e108 "cmdarg") at exec.c:1221 #13 0x000055555558f60c in execstring (s=0x7fffffffdedc "IFS=\303\251$IFS; echo $=IFS", dont_change_job=0, exiting=1, context=0x55555562e108 "cmdarg") at exec.c:1187 #14 0x00005555555ba5f9 in init_misc (cmd=0x7fffffffdedc "IFS=\303\251$IFS; echo $=IFS", zsh_name=0x7fffffffded5 "zsh") at init.c:1389 #15 0x00005555555bbd2d in zsh_main (argc=3, argv=0x7fffffffdb58) at init.c:1780 #16 0x000055555556ad29 in main (argc=3, argv=0x7fffffffdb58) at ./main.c:93 With +o multibyte, no busy loop, but splitting doesn't work properly: $ LC_ALL=C zsh +o multibyte -c 'IFS=é$IFS; printf "<%q>\n" $=IFS' <$'\303'$'\251'> <''> That's triggered when IFS contains both NUL and a byte over 0x7f (in any order) and when the variable to split contains NUL. In UTF-8 locales, that's triggered when IFS contains NUL and bytes or byte sequences not forming parts of valid characters. "read" doesn't seem to be affected: $ print 'foo\0bar' | LC_ALL=C zsh -c 'IFS=é$IFS read -rA a; typeset a' a=( foo bar ) (that's on Debian GNU/Linux amd64 with zsh git HEAD) -- Stephane