mailing list of musl libc
 help / color / mirror / code / Atom feed
From: Rich Felker <dalias@libc.org>
To: musl@lists.openwall.com
Subject: Re: [PATCH] fix tls offsets when p_vaddr%p_align != 0 for TLS_ABOVE_TP
Date: Wed, 15 May 2019 20:20:51 -0400	[thread overview]
Message-ID: <20190516002051.GX23599@brightrain.aerifal.cx> (raw)
In-Reply-To: <20190514020131.GC16415@port70.net>

On Tue, May 14, 2019 at 04:01:31AM +0200, Szabolcs Nagy wrote:
> this came up because lld changed tls alignment on aarch64 as a
> workaround for a bionic abi issue https://reviews.llvm.org/D53906
> but lld does not handle p_vaddr%p_align!=0 right so it broke on glibc
> https://reviews.llvm.org/D61824
> 
> the patch is untested (bfd linker cannot seem to create problematic
> elf objects), but at least there are no regressions with libc-test.

> >From 8c94fcbc9faeb8b07132506757c3d3973652420e Mon Sep 17 00:00:00 2001
> From: Szabolcs Nagy <nsz@port70.net>
> Date: Mon, 13 May 2019 18:47:11 +0000
> Subject: [PATCH] fix tls offsets when p_vaddr%p_align != 0 for TLS_ABOVE_TP
> 
> currently the bfd linker does not seem to create tls segments where
> p_vaddr%p_align != 0, but this is valid in ELF and then the runtime
> computed tls offset must satisfy
> 
>   offset%p_align == (base+p_vaddr)%p_align
> 
> and in case of local exec tls (main executable) the smallest such
> offset must be used (otherwise it is incompatible with the offset
> computed by the static linker). the !TLS_ABOVE_TP case handled this
> correctly (the offset is negative then in the formula).
> 
> the ldso code for TLS_ABOVE_TP is changed so the static tls offset
> of each module satisfies the formula and tls_offset always points
> to the end of the allocated static tls area (and not aligned up to
> tls_align or MIN_TLS_ALIGN).

I guess this saves some wasted memory?

> the tls_offset computation was wrong
> when multiple modules were loaded with static tls and in some the
> tls segment p_memsz%p_align != 0.

I don't understand this part. Are you saying we're currently
misaligning TLS for some libraries now? 

> ---
>  ldso/dynlink.c       | 13 ++++++-------
>  src/env/__init_tls.c |  3 ++-
>  2 files changed, 8 insertions(+), 8 deletions(-)
> 
> diff --git a/ldso/dynlink.c b/ldso/dynlink.c
> index 42a5470d..6dc39483 100644
> --- a/ldso/dynlink.c
> +++ b/ldso/dynlink.c
> @@ -1126,9 +1126,9 @@ static struct dso *load_library(const char *name, struct dso *needed_by)
>  		p->tls_id = ++tls_cnt;
>  		tls_align = MAXP2(tls_align, p->tls.align);
>  #ifdef TLS_ABOVE_TP
> -		p->tls.offset = tls_offset + ( (tls_align-1) &
> -			-(tls_offset + (uintptr_t)p->tls.image) );
> -		tls_offset += p->tls.size;
> +		p->tls.offset = tls_offset + ( (p->tls.align-1) &
> +			(-tls_offset + (uintptr_t)p->tls.image) );
> +		tls_offset = p->tls.offset + p->tls.size;

Is there a motivation for the seemingly independent change from use of
tls_align to use of p->tls.align here?

>  #else
>  		tls_offset += p->tls.size + p->tls.align - 1;
>  		tls_offset -= (tls_offset + (uintptr_t)p->tls.image)
> @@ -1797,10 +1797,9 @@ _Noreturn void __dls3(size_t *sp)
>  		app.tls_id = tls_cnt = 1;
>  #ifdef TLS_ABOVE_TP
>  		app.tls.offset = GAP_ABOVE_TP;
> -		app.tls.offset += -GAP_ABOVE_TP & (app.tls.align-1);
> -		tls_offset = app.tls.offset + app.tls.size
> -			+ ( -((uintptr_t)app.tls.image + app.tls.size)
> -			& (app.tls.align-1) );
> +		app.tls.offset += (-GAP_ABOVE_TP + (uintptr_t)app.tls.image)
> +			& (app.tls.align-1);
> +		tls_offset = app.tls.offset + app.tls.size;
>  #else
>  		tls_offset = app.tls.offset = app.tls.size
>  			+ ( -((uintptr_t)app.tls.image + app.tls.size)
> diff --git a/src/env/__init_tls.c b/src/env/__init_tls.c
> index 5f12500c..772baba3 100644
> --- a/src/env/__init_tls.c
> +++ b/src/env/__init_tls.c
> @@ -115,7 +115,8 @@ static void static_init_tls(size_t *aux)
>  		& (main_tls.align-1);
>  #ifdef TLS_ABOVE_TP
>  	main_tls.offset = GAP_ABOVE_TP;
> -	main_tls.offset += -GAP_ABOVE_TP & (main_tls.align-1);
> +	main_tls.offset += (-GAP_ABOVE_TP + (uintptr_t)main_tls.image)
> +		& (main_tls.align-1);
>  #else
>  	main_tls.offset = main_tls.size;
>  #endif
> -- 
> 2.21.0
> 

I think you're probably right about all these things and I want to
apply this, but I also want to understand it a bit better first.

Rich


  reply	other threads:[~2019-05-16  0:20 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-05-14  2:01 Szabolcs Nagy
2019-05-16  0:20 ` Rich Felker [this message]
2019-05-16  7:48   ` Szabolcs Nagy
2019-05-16 13:22     ` Rich Felker
2019-05-16 22:51       ` Szabolcs Nagy
2019-05-17  1:50         ` Rich Felker
2019-05-17 12:32           ` Szabolcs Nagy
2019-05-17 16:01             ` Fangrui Song

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20190516002051.GX23599@brightrain.aerifal.cx \
    --to=dalias@libc.org \
    --cc=musl@lists.openwall.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/musl/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).