mailing list of musl libc
 help / color / mirror / code / Atom feed
From: Rich Felker <dalias@libc.org>
To: Jeffrey Walton <noloader@gmail.com>
Cc: musl@lists.openwall.com, 847567161 <847567161@qq.com>
Subject: Re: Re: [musl] Question:Why musl call a_barrier in __pthread_once?
Date: Thu, 18 May 2023 10:20:12 -0400	[thread overview]
Message-ID: <20230518142011.GR4163@brightrain.aerifal.cx> (raw)
In-Reply-To: <CAH8yC8mGa7VRhSFhWsZB1EYXFSA6A29Ps-KOFwfNvGf-v6+H5g@mail.gmail.com>

On Thu, May 18, 2023 at 10:15:36AM -0400, Jeffrey Walton wrote:
> On Thu, May 18, 2023 at 9:29 AM Rich Felker <dalias@libc.org> wrote:
> >
> > On Thu, May 18, 2023 at 02:23:06PM +0200, Szabolcs Nagy wrote:
> > > * 847567161 <847567161@qq.com> [2023-05-18 10:49:44 +0800]:
> > > > &gt; There is an alternate algorithm for pthread_once that doesn't require
> > > > &gt; a barrier in the common case, which I've considered implementing.. But
> > > > &gt; it does need efficient access to thread-local storage. At one time,
> > > > &gt; this was a kinda bad assumption (especially legacy mips is horribly
> > > > &gt; slow at TLS) but nowadays it's probably the right choice to make, and
> > > > &gt; we should check that out again...
> > > >
> > > > 1、Can we move dmb after we get the value of control? like this:
> > > >
> > > > int __pthread_once(pthread_once_t *control, void (*init)(void))
> > > > {
> > > >     /* Return immediately if init finished before, but ensure that
> > > >     * effects of the init routine are visible to the caller. */
> > > >     if (*(volatile int *)control == 2) {
> > > >         // a_barrier();
> > > >         return 0;
> > > >     }
> > >
> > > writes in init may not be visible when *control==2, without
> > > the barrier. (there are many explanations on the web why
> > > double-checked locking is wrong without an acquire barrier,
> > > that's the same issue if you are interested in the details)
> > >
> > > > 2、Can we use 'ldar' to  instead of dmb here? I see musl
> > > > already use 'stlxr' in a_sc.  like this:
> > > >
> > > > static inline int load(volatile int *p)
> > > > {
> > > >     int v;
> > > >     __asm__ __volatile__ ("ldar %w0,%1" : "=r"(v) : "Q"(*p));
> > > >     return v;
> > > > }
> > > >
> > > > if (load((volatile int *)control) == 2) {
> > > >     return 0;
> > > > }
> > >
> > > i think acquire ordering is enough because posix does not
> > > require pthread_once to synchronize memory, but musl does
> > > not have an acquire barrier/load, so it uses a_barrier.
> >
> > POSIX does require this. It's specified where Memory Synchronization
> > is defined,
> > https://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap04.html#tag_04_12
> >
> >     "The pthread_once() function shall synchronize memory for the
> >     first call in each thread for a given pthread_once_t object."
> >
> > > it is probably not worth optimizing the memory order since
> > > we know there is an algorithm that does not need a barrier
> > > in the common case.
> >
> > Arguably the above might make the barrier-free algorithm invalid for
> > pthread_once, but I'm not sure if the lack of "synchronize memory"
> > property in this case would be observable. It probably is with an
> > intentional construct trying to observe it. There may be some way to
> > salvage this with a second thread-local counter to account for
> > gratuitous extra synchronization needed.
> >
> > Of course call_once is exempt from any such requirements (also exempt
> > from cancellation shenanigans) and is probably the optimal thing for
> > programs to use. If needed we can make call_once have a different,
> > more optimal implementation than pthread_once.
> 
> Be careful of call_once.
> 
> Several years ago I cut over to C++11's call_once. The problem was, it
> only worked reliably on 32-bit and 64-bit Intel platforms. It was a
> disaster on Aarch64, PowerPC and Sparc. I had to back it out.

That is about the C++ std::call_once, implemented by GNU libstdc++,
and is unrelated to the C11 call_once, which libc implements.

> The problems happened back when GCC 6 and 7 were popular. The problem
> was due to something sideways in glibc.
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66146
> 
> If you want a call_once-like initialization then rely on N2660:
> Dynamic Initialization and Destruction with Concurrency.

That's the general algorithm we've been talking about (though without
bad properties like gratuitously inlining it to lock-in implementation
details as ABI).

Rich

      reply	other threads:[~2023-05-18 14:20 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-05-18  2:49 =?gb18030?B?UmU6IFJlOiBbbXVzbF0gUXVlc3Rpb26juldoeSBtdXNsIGNhbGwgYV9iYXJyaWVyIGluIF9fcHRocmVhZF9vbmNlPw==?= =?gb18030?B?ODQ3NTY3MTYx?=
2023-05-18 12:23 ` Re: [musl] Question:Why musl call a_barrier in __pthread_once? Szabolcs Nagy
2023-05-18 13:29   ` Rich Felker
2023-05-18 14:01     ` Jₑₙₛ Gustedt
2023-05-18 14:08       ` Rich Felker
2023-05-18 14:15     ` Jeffrey Walton
2023-05-18 14:20       ` Rich Felker [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20230518142011.GR4163@brightrain.aerifal.cx \
    --to=dalias@libc.org \
    --cc=847567161@qq.com \
    --cc=musl@lists.openwall.com \
    --cc=noloader@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/musl/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).