From: "Stefan Kanthak" <stefan.kanthak@nexgo.de>
To: "Rich Felker" <dalias@libc.org>
Cc: <musl@lists.openwall.com>
Subject: Re: More patches for math subtree
Date: Wed, 11 Dec 2019 10:53:41 +0100 [thread overview]
Message-ID: <F2BD74F09B4748A094E99A4B8C5CBB4A@H270> (raw)
In-Reply-To: <20191210221738.GL1666@brightrain.aerifal.cx>
"Rich Felker" <dalias@libc.org> wrote:
> On Tue, Dec 10, 2019 at 10:32:26PM +0100, Stefan Kanthak wrote:
[ asm vs. C ]
>> Does any compiler emit branch-free instruction sequences like the
>> following for Intel CPUs without SSE4.1, i.e. without ROUNDSS/ROUNDSD?
>>
>> .code ; Intel syntax
>> ceil proc public
>> extern __real@8000000000000000:real8
>> movsd xmm1, __real@8000000000000000
>> extern __real@3ff0000000000000:real8
>> movsd xmm2, __real@3ff0000000000000
>> extern __real@4330000000000000:real8
>> movsd xmm3, __real@4330000000000000
>> movsd xmm4, xmm1
>> andnpd xmm1, xmm0
>> andpd xmm4, xmm0
>> cmpltsd xmm1, xmm3
>> andpd xmm1, xmm3
>> orpd xmm1, xmm4
>> movsd xmm3, xmm0
>> addsd xmm0, xmm1
>> subsd xmm0, xmm1
>> movsd xmm1, xmm0
>> cmpltsd xmm0, xmm3
>> andpd xmm0, xmm2
>> addsd xmm0, xmm1
>> orpd xmm0, xmm4
>> ret
>> ceil endp
>>
>> Or instruction sequences like
>>
>> .code ; Intel syntax
>> copysign proc public
>> movd rcx, xmm0
>> movd rdx, xmm1
>> shld rcx, rdx, 1
>> ror rcx, 1
>> movd xmm0, rcx
>> ret
>> copysign endp
>
> Not quite (but it might be possible to write the C in terms of shifts
> instead of masks such that it does), but I also don't think it's clear
> which version is better. Yours here is mildly smaller and might
> perform better, but when making changes that aren't clearly better
> there should be some evidence that it's actually an improvement --
> especially if it's not just improving existing arch optimizations but
> adding new ones where the C was formerly used.
Correct.
I expect the compiler to emit such properly optimised code instead of
calls to the library for standard functions like copysign(), fdim(),
etc. which can be written with just a few instructions ... what the
compiler but not (always) does.
JFTR: I don't know whether GCC or clang either provide intrinsics or
__builtin_* for such (or all those) small standard functions.
> Generally musl avoids asm and arch-specific files as much as possible,
> using them only for things that aren't representable in C or where
> the C is a lot larger or slower or both.
>
>> .code ; Intel syntax
>> fdim proc public
>> movsd xmm2, xmm0
>> cmpsd xmm0, xmm1, 6
>> subsd xmm2, xmm1
>> andpd xmm0, xmm2
>> ret
>> fdim endp
>
> Does this handle nans correctly?
Of course! It's equivalent to
double fdim(double a, double b)
{
uint64_t mask = (a <= b) ? 0ull : ~0ull;
union {double dbl; uint64_t ull;} u = {a - b};
u.ull &= mask;
return u.dbl;
}
[...]
> OK. I don't mind looking at these patches further as-is, and I'll try
> to continue offering constructive comments now, but it'll be after
> this release cycle (hopefully wrapping that up in the next week or so)
> before consideration for merging. musl 1.2.0 is already going to be a
> release with big changes (time64) and I don't want to risk subtle
> breakage with new changes that haven't been reviewed in detail yet or
> had time for users to test.
That's OK.
Stefan
next prev parent reply other threads:[~2019-12-11 9:53 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-12-10 16:57 Stefan Kanthak
2019-12-10 19:35 ` Rich Felker
2019-12-10 21:32 ` Stefan Kanthak
2019-12-10 22:17 ` Rich Felker
2019-12-11 1:13 ` Rosen Penev
2019-12-11 9:53 ` Stefan Kanthak [this message]
2019-12-11 10:28 ` Szabolcs Nagy
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=F2BD74F09B4748A094E99A4B8C5CBB4A@H270 \
--to=stefan.kanthak@nexgo.de \
--cc=dalias@libc.org \
--cc=musl@lists.openwall.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.vuxu.org/mirror/musl/
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).