From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 7079 invoked by alias); 17 Sep 2012 05:59:38 -0000 Mailing-List: contact zsh-workers-help@zsh.org; run by ezmlm Precedence: bulk X-No-Archive: yes List-Id: Zsh Workers List List-Post: List-Help: X-Seq: 30671 Received: (qmail 18467 invoked from network); 17 Sep 2012 05:59:36 -0000 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on f.primenet.com.au X-Spam-Level: X-Spam-Status: No, score=-3.3 required=5.0 tests=BAYES_00,DKIM_ADSP_ALL, DKIM_SIGNED,RCVD_IN_DNSWL_MED,T_DKIM_INVALID,UNPARSEABLE_RELAY autolearn=ham version=3.3.2 Received-SPF: none (ns1.primenet.com.au: domain at spodhuis.org does not designate permitted sender hosts) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=spodhuis.org; s=d201107; h=In-Reply-To:Content-Type:MIME-Version:References:Message-ID:Subject:To:From:Date; bh=27sKg/mWIFth068TIntkPrKn3Xffuamq/3yxBm4zeUM=; b=TrfqBcGOY8yuLE7bS3Eflq+9EuGXEmkY3QVlK5ahfGUIGeZhOjpWs8mY7vpHGFPTTnT93Yn+Sv4q+JyfuUz+i8vHu1E5tJB/RW36vvG0QtV7MucTWlEpbBMD4zaZt+Lfi3v88efUu7Xl2kQyqoHl7740wxwJZdx3vBoZEpcue3w=; Date: Mon, 17 Sep 2012 01:59:25 -0400 From: Phil Pennock To: zsh-workers@zsh.org Subject: Re: PATCH: PCRE support for embedded NUL characters Message-ID: <20120917055925.GA32663@redoubt.spodhuis.org> Mail-Followup-To: zsh-workers@zsh.org References: <20120916125015.GA87764@redoubt.spodhuis.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20120916125015.GA87764@redoubt.spodhuis.org> On 2012-09-16 at 08:50 -0400, Phil Pennock wrote: > Another open question: are $mbegin/$mend offsets supposed to be in > octets or in characters? Given the MB_ prefix, I'm guessing I just > broke this and will need to fix it tomorrow, before commit, after I get > some sleep. Do we have a decent way to count the number of wide > characters in an unmetafied string which can contain NUL characters? Anyone have any opinions of what ZPCRE_OP should count? It's currently documented as counting bytes, which is certainly accurate, but seems not very useful given that you can't index strings by byte offsets in zsh (or can you, with some option I've not noticed before?) -Phil