mailing list of musl libc
 help / color / mirror / code / Atom feed
From: Rich Felker <dalias@libc.org>
To: musl@lists.openwall.com
Subject: Re: [musl] [PATCH v2] Add big-endian support to ARM assembler memcpy
Date: Thu, 25 Jun 2020 17:50:42 -0400	[thread overview]
Message-ID: <20200625215041.GT6430@brightrain.aerifal.cx> (raw)
In-Reply-To: <20200121185215.5958-1-armccurdy@gmail.com>

On Tue, Jan 21, 2020 at 10:52:15AM -0800, Andre McCurdy wrote:
> Allow the existing ARM assembler memcpy implementation to be used for
> both big and little endian targets.
> ---
> 
> Exactly the same changes as before but rebased to account for
> whitespace changes in the preceding patch to add Thumb2 support.
> 
>  COPYRIGHT                                |   2 +-
>  src/string/arm/{memcpy_le.S => memcpy.S} | 101 ++++++++++++++++++++++-
>  src/string/arm/memcpy.c                  |   3 -
>  3 files changed, 98 insertions(+), 8 deletions(-)
>  rename src/string/arm/{memcpy_le.S => memcpy.S} (82%)
>  delete mode 100644 src/string/arm/memcpy.c
> 
> diff --git a/COPYRIGHT b/COPYRIGHT
> index e6472371..d3edc2a2 100644
> --- a/COPYRIGHT
> +++ b/COPYRIGHT
> @@ -127,7 +127,7 @@ Copyright © 2017-2018 Arm Limited
>  and labelled as such in comments in the individual source files. All
>  have been licensed under extremely permissive terms.
>  
> -The ARM memcpy code (src/string/arm/memcpy_el.S) is Copyright © 2008
> +The ARM memcpy code (src/string/arm/memcpy.S) is Copyright © 2008
>  The Android Open Source Project and is licensed under a two-clause BSD
>  license. It was taken from Bionic libc, used on Android.
>  
> diff --git a/src/string/arm/memcpy_le.S b/src/string/arm/memcpy.S
> similarity index 82%
> rename from src/string/arm/memcpy_le.S
> rename to src/string/arm/memcpy.S
> index 7b35d305..869e3448 100644
> --- a/src/string/arm/memcpy_le.S
> +++ b/src/string/arm/memcpy.S
> @@ -1,5 +1,3 @@
> -#if !__ARMEB__
> -
>  /*
>   * Copyright (C) 2008 The Android Open Source Project
>   * All rights reserved.
> @@ -42,7 +40,7 @@
>   * code safely callable from thumb mode, adjusting the return
>   * instructions to be compatible with pre-thumb ARM cpus, removal of
>   * prefetch code that is not compatible with older cpus and support for
> - * building as thumb 2.
> + * building as thumb 2 and big-endian.
>   */
>  
>  .syntax unified
> @@ -227,24 +225,45 @@ non_congruent:
>  	 * becomes aligned to 32 bits (r5 = nb of words to copy for alignment)
>  	 */
>  	movs    r5, r5, lsl #31
> +
> +#if __ARMEB__
> +	movmi   r3, r3, ror #24
> +	strbmi	r3, [r0], #1
> +	movcs   r3, r3, ror #24
> +	strbcs	r3, [r0], #1
> +	movcs   r3, r3, ror #24
> +	strbcs	r3, [r0], #1
> +#else
>  	strbmi r3, [r0], #1
>  	movmi   r3, r3, lsr #8
>  	strbcs r3, [r0], #1
>  	movcs   r3, r3, lsr #8
>  	strbcs r3, [r0], #1
>  	movcs   r3, r3, lsr #8
> +#endif
>  
>  	cmp     r2, #4
>  	blo     partial_word_tail
>  
> +#if __ARMEB__
> +	mov	r3, r3, lsr r12
> +	mov	r3, r3, lsl r12
> +#endif
> +
>  	/* Align destination to 32 bytes (cache line boundary) */
>  1:      tst     r0, #0x1c
>  	beq     2f
>  	ldr     r5, [r1], #4
>  	sub     r2, r2, #4
> +#if __ARMEB__
> +	mov     r4, r5,                 lsr lr
> +	orr     r4, r4, r3
> +	mov     r3, r5,                 lsl r12
> +#else
>  	mov     r4, r5,                 lsl lr
>  	orr     r4, r4, r3
>  	mov     r3, r5,                 lsr r12
> +#endif

Am I missing something or are both cases identical here? That would
either indicate this is gratuitous or there's a bug here and they were
intended not to be the same.

> [...]
> @@ -350,9 +429,15 @@ less_than_thirtytwo:
>  
>  1:      ldr     r5, [r1], #4
>  	sub     r2, r2, #4
> +#if __ARMEB__
> +	mov     r4, r5,                 lsr lr
> +	orr     r4, r4, r3
> +	mov     r3,     r5,                     lsl r12
> +#else
>  	mov     r4, r5,                 lsl lr
>  	orr     r4, r4, r3
>  	mov     r3,     r5,                     lsr r12
> +#endif

And again here.

Rich

  parent reply	other threads:[~2020-06-25 22:06 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-01-21 18:52 Andre McCurdy
2020-01-22  0:36 ` Rich Felker
2020-01-22  0:47   ` Andre McCurdy
2020-01-22  1:31     ` Rich Felker
2020-06-25 21:50 ` Rich Felker [this message]
2020-06-25 22:11   ` Andre McCurdy
2020-06-25 22:24     ` Rich Felker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20200625215041.GT6430@brightrain.aerifal.cx \
    --to=dalias@libc.org \
    --cc=musl@lists.openwall.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/musl/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).