From: "Stefan Kanthak" <stefan.kanthak@nexgo.de>
To: "Alexander Monakov" <amonakov@ispras.ru>
Cc: "Szabolcs Nagy" <nsz@port70.net>, <musl@lists.openwall.com>
Subject: Re: [musl] [Patch] src/math/i386/remquo.s: remove conditional branch, shorter bit twiddling
Date: Fri, 6 Aug 2021 12:17:12 +0200 [thread overview]
Message-ID: <6C4DCCC86B014B68877D73C798F54180@H270> (raw)
In-Reply-To: <alpine.LNX.2.20.13.2108051626420.2536@monopod.intra.ispras.ru>
[-- Attachment #1: Type: text/plain, Size: 601 bytes --]
Alexander Monakov <amonakov@ispras.ru> wrote:
> On Wed, 4 Aug 2021, Stefan Kanthak wrote:
>> The change just follows by removing 6 LOC/instructions.-)
>
> Have you considered collecting the three bits in one go via a multiplication?
No. My mind is not that twisted;-)
> You can first isolate the necessary bits with 'and $0x4300, %eax', then do
> 'imul $0x910000, %eax, %eax' to put the required bits in EAX[31:29] in the
> right order, then shift right by 29. Three instructions, 14 bytes.
Thanks, VERY NICE! How did you come up to it?
Revised patch with shorter bit twiddling attached.
Stefan
[-- Attachment #2: remquo.patch --]
[-- Type: application/octet-stream, Size: 1073 bytes --]
--- -remquo.s
+++ +remquo.s
@@ -2,49 +2,41 @@
.type remquof,@function
remquof:
mov 12(%esp),%ecx
+ mov 8(%esp),%eax
+ xor 4(%esp),%eax
flds 8(%esp)
flds 4(%esp)
- mov 11(%esp),%dh
- xor 7(%esp),%dh
- jmp 1f
+ jmp 0f
.global remquol
.type remquol,@function
remquol:
mov 28(%esp),%ecx
+ mov 24(%esp),%eax
+ xor 12(%esp),%eax
+ cwtl
fldt 16(%esp)
fldt 4(%esp)
- mov 25(%esp),%dh
- xor 13(%esp),%dh
- jmp 1f
+ jmp 0f
.global remquo
.type remquo,@function
remquo:
mov 20(%esp),%ecx
+ mov 16(%esp),%eax
+ xor 8(%esp),%eax
fldl 12(%esp)
fldl 4(%esp)
- mov 19(%esp),%dh
- xor 11(%esp),%dh
+0: cltd
1: fprem1
fnstsw %ax
sahf
jp 1b
fstp %st(1)
- mov %ah,%dl
- shr %dl
- and $1,%dl
- mov %ah,%al
- shr $5,%al
- and $2,%al
- or %al,%dl
- mov %ah,%al
- shl $2,%al
- and $4,%al
- or %al,%dl
- test %dh,%dh
- jns 1f
- neg %dl
-1: movsbl %dl,%edx
- mov %edx,(%ecx)
+ and $0x4300,%eax
+ imul $0x910000,%eax,%eax
+ shr $29,%eax
+ xor %edx,%eax
+ sub %edx,%eax
+ mov %eax,(%ecx)
ret
next prev parent reply other threads:[~2021-08-06 10:21 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-08-01 15:59 Stefan Kanthak
2021-08-03 20:27 ` Szabolcs Nagy
2021-08-04 10:02 ` Stefan Kanthak
2021-08-05 13:40 ` Alexander Monakov
2021-08-06 10:17 ` Stefan Kanthak [this message]
2021-08-06 14:27 ` Rich Felker
2021-08-06 17:23 ` Stefan Kanthak
2021-08-07 0:55 ` Rich Felker
2021-08-07 13:12 ` Stefan Kanthak
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6C4DCCC86B014B68877D73C798F54180@H270 \
--to=stefan.kanthak@nexgo.de \
--cc=amonakov@ispras.ru \
--cc=musl@lists.openwall.com \
--cc=nsz@port70.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.vuxu.org/mirror/musl/
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).