From mboxrd@z Thu Jan  1 00:00:00 1970
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org
X-Spam-Level: 
X-Spam-Status: No, score=-3.3 required=5.0 tests=MAILING_LIST_MULTI,
	RCVD_IN_DNSWL_MED,RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL autolearn=ham
	autolearn_force=no version=3.4.4
Received: (qmail 32083 invoked from network); 20 May 2020 17:49:41 -0000
Received: from mother.openwall.net (195.42.179.200)
  by inbox.vuxu.org with ESMTPUTF8; 20 May 2020 17:49:41 -0000
Received: (qmail 16043 invoked by uid 550); 20 May 2020 17:49:38 -0000
Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm
Precedence: bulk
List-Post: <mailto:musl@lists.openwall.com>
List-Help: <mailto:musl-help@lists.openwall.com>
List-Unsubscribe: <mailto:musl-unsubscribe@lists.openwall.com>
List-Subscribe: <mailto:musl-subscribe@lists.openwall.com>
List-ID: <musl.lists.openwall.com>
Reply-To: musl@lists.openwall.com
Received: (qmail 16022 invoked from network); 20 May 2020 17:49:37 -0000
Date: Wed, 20 May 2020 13:49:25 -0400
From: Rich Felker <dalias@libc.org>
To: Konstantin Khlebnikov <khlebnikov@yandex-team.ru>
Cc: musl@lists.openwall.com
Message-ID: <20200520174925.GR1079@brightrain.aerifal.cx>
References: <c6502d12-8092-3572-2827-1f7884402b8d@yandex-team.ru>
 <20200520160506.GL1079@brightrain.aerifal.cx>
 <a869a173-a8c7-0d82-41f8-0953b3db1068@yandex-team.ru>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <a869a173-a8c7-0d82-41f8-0953b3db1068@yandex-team.ru>
User-Agent: Mutt/1.5.21 (2010-09-15)
Subject: Re: [musl] pthread shouldn't ignore errors from syscall futex()

On Wed, May 20, 2020 at 08:38:35PM +0300, Konstantin Khlebnikov wrote:
> On 20/05/2020 19.05, Rich Felker wrote:
> >On Wed, May 20, 2020 at 03:31:46PM +0300, Konstantin Khlebnikov wrote:
> >>Userspace implementations of mutexes (including glibc) in some cases
> >>retries operation without checking error code from syscall futex.
> >>
> >>Example which loops inside second call rather than hung (or die) peacefully:
> >>
> >>#include <stdlib.h>
> >>#include <pthread.h>
> >>
> >>int main(int argc, char **argv)
> >>{
> >>	char buf[sizeof(pthread_mutex_t) + 1];
> >>	pthread_mutex_t *mutex = (pthread_mutex_t *)(buf + 1);
> >>
> >>	pthread_mutex_init(mutex, NULL);
> >>	pthread_mutex_lock(mutex);
> >>	pthread_mutex_lock(mutex);
> >>}
> >>
> >>Thread in lkml:
> >>https://lore.kernel.org/lkml/158955700764.647498.18025770126733698386.stgit@buzz/T/
> >>
> >>Related bug in glibc:
> >>https://sourceware.org/bugzilla/show_bug.cgi?id=25997
> >
> >In general, this behavior is intentional. If running on a system where
> >futexx is broken (incomplete implementation of Linux syscall API,
> >Linux built with flags that break futex which is possible on some
> >archs, etc.), or if the kernel cannot perform the wait because of an
> >OOM condition in the kernel (Linux is *not* written to be resilent
> >against OOM and it shows), the behavior degrades to spinlocks rather
> >than crashing. Aborting the application because of OOM conditions in
> >the kernel is simply not acceptable.
> 
> Yes, OOM condition in cgroup before linux 4.19 definitely could lead to
> returning EFAULT by almost any syscall. This is worth to document in
> futex manpage.
> 
> But EINVAL from futex() always meant arguments were wrong.
> 
> Ignoring unknown errors feels wrong anyway. That just hides bugs.
> And provokes appearing these incomplete/buggy implementations of futex.

Ignoring errors is never wrong unless the error can happen in a
correct program or ignoring it in an incorrect program produces a
vulnerability that affects the user's safety.

> Also degrading silently to spin-locks isn't very safe.
> Not all schedulers guarantee progress if waiter spins.
> At least add some delay or yield into that fallback waiting loop.

We are not going to add costs to valid programs to make up for bad
behavior by buggy programs unless it's mitigating a security issue
potentially leading to code execution, information disclosure, loss of
data integrity, or something similarly severe. "Box bogged down at
100% cpu because you ran a buggy program" is not something that merits
libc trying to mitigate it.

> >It would be possible to try to distinguish the causes of futex failure
> >and handle the unaligned case specially, but this would put more code
> >in hot paths, impacting size and possibly performance in valid
> >programs for the sake of catching a non-security bug in invalid ones.
> >This does not seem like a useful tradeoff.
> 
> I've proposed to send SIGBUS from syscall when futex address is unligned.
> (In LKML thread, see link above)

And they rightly rejected it. Arguably every EFAULT should also
produce SIGBUS or SIGSEGV, but they don't, because the policy has
always been to report erroneous userspace addresses to userspace
rather than crashing. Changing this would be a major change in kernel
policy, and a break in stability policy. For example there may even be
applications calling futex on a misaligned address to *test whether*
it's supported usage, relying on the existing contract to report it.

> >Assuming the buggy program actually calls pthread_mutex_init rather
> >than just using an uninitialized/zero-initialized mutex object at
> >misaligned address, pthread_mutex_init (and likewise other pthread
> >object init functions) could possibly trap on the error (with no
> >syscall, just looking for a misaligned address mod _Alignof() the
> >object type) to catch it. I'm not sure if this is worthwhile though
> >since, while being UB, it doesn't seem to be UB with any security
> >impact.
> 
> Yeah, I'm worried more about debugability and CO2 emission =)

Then proposing this for the init functions is probably a good idea. In
musl we generally like to trap on UB, but only when it either happens
naturally (like null arg dereference) or can be done at low cost,
ideally not in a hot path. So this seems like a reasonable solution to
achieve what you want.

Rich