From: Hongliang Wang <wanghongliang@loongson.cn>
To: Szabolcs Nagy <nsz@port70.net>, musl@lists.openwall.com
Subject: Re: [musl] add loongarch64 port v9.
Date: Fri, 17 Nov 2023 15:19:03 +0800 [thread overview]
Message-ID: <8cf01ee8-4c45-340c-8a17-4df1154de322@loongson.cn> (raw)
In-Reply-To: <20231116171824.GK1427497@port70.net>
在 2023/11/17 上午1:18, Szabolcs Nagy 写道:
> * Szabolcs Nagy <nsz@port70.net> [2023-11-16 17:44:20 +0100]:
>
>> * Hongliang Wang <wanghongliang@loongson.cn> [2023-11-16 10:54:44 +0800]:
>>> Thank you for your suggestion, I have modified the dynamic linker
>>> name according to the basic ABI types are specified in the ABI
>>> document of the LoongArch, and post 0001-add-loongarch64-port-v9.patch,
>>> as shown in the attachment.
>>>
>>> Based on 0001-add-loongarch64-port-v8.patch,the modifications for
>>> 0001-add-loongarch64-port-v9.patch are as follows:
>>
>> note, the new names must match gcc, which seems to be the case:
>>
>> https://gcc.gnu.org/git/?p=gcc.git;a=blob;f=gcc/config/loongarch/gnu-user.h;h=9616d6e8a0b8629720a844fd0122dafef682a4cb;hb=HEAD#l36
>
> this seems to be new for gcc-14 and not present in any
> released version of gcc, so in principle gcc can be
> changed.
>
> we just have to make sure whatever is committed in musl
> is compatible with whatever that gets released in gcc-14
> also backport that change to gcc-13 and gcc-12 which
> currently have a broken dynamic linker spec:
> +#define MUSL_DYNAMIC_LINKER \
> + "/lib" ABI_GRLEN_SPEC "/ld-musl-loongarch-" ABI_SPEC ".so.1"
>
We will backport the change to gcc-13 and gcc-12.
>>
>> they do *not* match the names in the cited abi document:
>>
>> https://loongson.github.io/LoongArch-Documentation/LoongArch-ELF-ABI-EN.html#_program_interpreter_path
>>
>> (uses /lib64/..-loongarch-.. instead of /lib/..-loongarch64-..)
Currently, the abi document only describes the glibc's
program interpreter path, and we will update the musl's
program interpreter path to the abi document later.
>>
>> the diff looks good.
>>
>>>
>>> ---
>>> arch/loongarch64/reloc.h | 10 ++++++----
>>> configure | 4 +++-
>>> 2 files changed, 9 insertions(+), 5 deletions(-)
>>>
>>> diff --git a/arch/loongarch64/reloc.h b/arch/loongarch64/reloc.h
>>> index a4482b48..6907de8e 100644
>>> --- a/arch/loongarch64/reloc.h
>>> +++ b/arch/loongarch64/reloc.h
>>> @@ -1,7 +1,9 @@
>>> -#ifdef __loongarch_soft_float
>>> -#define FP_SUFFIX "-sf"
>>> -#else
>>> -#define FP_SUFFIX ""
>>> +#if defined __loongarch_double_float
>>> +#define FP_SUFFIX "-lp64d"
>>> +#elif defined __loongarch_single_float
>>> +#define FP_SUFFIX "-lp64f"
>>> +#elif defined __loongarch_soft_float
>>> +#define FP_SUFFIX "-lp64s"
>>> #endif
>>>
>>> #define LDSO_ARCH "loongarch64" FP_SUFFIX
>>> diff --git a/configure b/configure
>>> index 55d179f1..93b06287 100755
>>> --- a/configure
>>> +++ b/configure
>>> @@ -673,7 +673,9 @@ trycppif __AARCH64EB__ "$t" && SUBARCH=${SUBARCH}_be
>>> fi
>>>
>>> if test "$ARCH" = "loongarch64" ; then
>>> -trycppif __loongarch_soft_float "$t" && SUBARCH=${SUBARCH}-sf
>>> +trycppif __loongarch_double_float "$t" && SUBARCH=${SUBARCH}-lp64d
>>> +trycppif __loongarch_single_float "$t" && SUBARCH=${SUBARCH}-lp64f
>>> +trycppif __loongarch_soft_float "$t" && SUBARCH=${SUBARCH}-lp64s
>>> printf "checking whether compiler support FCSRs... "
>>> echo "__asm__(\"movfcsr2gr \$t0,\$fcsr0\");" > "$tmpc"
>>> if $CC -c -o /dev/null "$tmpc" >/dev/null 2>&1 ; then
>>> --
>>>
>>> Please review again, and point them out if any questions need to be
>>> modified, thanks.
>>>
>>>
>>> Hongliang Wang
>>>
>>> 在 2023/11/14 下午9:16, Jingyun Hua 写道:
>>>> hi,
>>>>
>>>> I have a suggestion regarding the musl dynamic linker name for the
>>>> LoongArch architecture:
>>>>
>>>> The basic ABI types are specified in the ABI document of the LoongArch,
>>>> which can be seen:
>>>> https://loongson.github.io/LoongArch-Documentation/LoongArch-ELF-ABI-EN.html
>>>>
>>>> and the dynamic linker name of glibc distinguishes lp64d, lp64s and
>>>> lp64f with abi types.The v8 patch only defines the value of SUBARCH
>>>> as "-sf" for “__loongarch_soft_float”, which means that only
>>>> “ld-musl-loongarch64.so.1” or “ld-musl-loongarch64-sf.so.1” can be
>>>> generated when compiled with the v8 patch.
>>>>
>>>> So I think the musl dynamic linker name for the LoongArch architecture
>>>> needs to be modified, maybe defined like glibc?
>>>>
>>>> Regards,
>>>> Jingyun Hua
>>>>
>>>> On 11/9/23 9:15 PM, Jingyun Hua wrote:
>>>>> Hi all,
>>>>>
>>>>> Thank you for taking the time to check this email. I'm very interested
>>>>> in musl and alpine, and I'd like to know the current status of this
>>>>> patch and what we can do for it next.
>>>>>
>>>>> After Hongliang updated this patch from v7 to v8, in order to verify it,
>>>>> I built the alpine linux from scratch base on this musl v8 patch on the
>>>>> LoongArch64 machine, and successfully compiled projects such as gcc,
>>>>> rust, llvm, go, etc.. Currently, more than 8,500 packages have been
>>>>> built and running normally in my local repository.
>>>>>
>>>>> The patch has indeed been on hold for a long time. Is it in the
>>>>> immediate plans? Is there anything that needs to be modified and
>>>>> improved?
>>>>>
>>>>> Looking forward to your reply, thanks!
>>>>>
>>>>> Regards,
>>>>> Jingyun Hua
>>>>>
>>>>> 在 2023/9/26 上午11:28, Hongliang Wang 写道:
>>>>>> Hi,
>>>>>>
>>>>>> I have fixed the issues listed below, and post
>>>>>> 0001-add-loongarch64-port-v8.patch,as shown in the attachment.
>>>>>> please review again, if there are anything that need to be modified,
>>>>>> please point out, thank you.
>>>>>>
>>>>>> Hongliang Wang
>>>>>>
>>>>>> 在 2023/9/22 上午9:37, Hongliang Wang 写道:
>>>>>>> Hi,
>>>>>>>
>>>>>>> Thank you for your review, I will modify it according to the review
>>>>>>> suggestions and post v8 patch later.
>>>>>>>
>>>>>>> Hongliang Wang
>>>>>>>
>>>>>>> 在 2023/9/20 下午9:16, Szabolcs Nagy 写道:
>>>>>>>> * Jianmin Lv <lvjianmin@loongson.cn> [2023-09-20 15:45:39 +0800]:
>>>>>>>>> Sorry to bother you, but I just want to know if
>>>>>>>>> there is any progress on
>>>>>>>>> this, because Alpine is blocking by this patch for a
>>>>>>>>> long time. As Hongliang
>>>>>>>>> has explained questions you mentioned one by one, if
>>>>>>>>> any question need to be
>>>>>>>>> discussed, please point them out, so that the patch
>>>>>>>>> can be handled further.
>>>>>>>>
>>>>>>>> i think you should post a v8 patch to move
>>>>>>>> this forward. (not on github, but here)
>>>>>>>>
>>>>>>>>> On 2023/8/15 下午4:17, Hongliang Wang wrote:
>>>>>>>>>> 在 2023/8/13 上午9:41, Rich Felker 写道:
>>>>>>>>>>> On Sat, Aug 05, 2023 at 11:43:08AM -0400, Rich Felker wrote:
>>>>>>>>>>>> On Sat, Aug 05, 2023 at 02:18:35PM +0800, 翟小娟 wrote:
>>>>>>>>>>>> -+#define __BYTE_ORDER 1234
>>>>>>>>>>>> ++#define __BYTE_ORDER __LITTLE_ENDIAN
>>>>>>>>>>>
>>>>>>>>>>> This is gratuitous, mismatches what is done on other archs, and is
>>>>>>>>>>> less safe.
>>>>>>>>>>>
>>>>>>>>>> The modification is based on the following review suggestion(
>>>>>>>>>> WANG Xuerui <i@xen0n.name> reviewed in
>>>>>>>>>> https://www.openwall.com/lists/musl/2022/10/12/1):
>>>>>>>>>>
>>>>>>>>>> `#define __BYTE_ORDER __LITTLE_ENDIAN` could be more consistent
>>>>>>>>>> with other arches.
>>>>>>>>
>>>>>>>> please use 1234.
>>>>>>
>>>>>> --- a/arch/loongarch64/bits/alltypes.h.in
>>>>>> +++ b/arch/loongarch64/bits/alltypes.h.in
>>>>>> @@ -2,7 +2,7 @@
>>>>>> #define _Int64 long
>>>>>> #define _Reg long
>>>>>>
>>>>>> -#define __BYTE_ORDER __LITTLE_ENDIAN
>>>>>> +#define __BYTE_ORDER 1234
>>>>>> #define __LONG_MAX 0x7fffffffffffffffL
>>>>>>
>>>>>>
>>>>>>>>
>>>>>>>>>>>> -+TYPEDEF unsigned nlink_t;
>>>>>>>>>>>> ++TYPEDEF unsigned int nlink_t;
>>>>>>>>>>>
>>>>>>>>>>> Gratuitous and in opposite direction of
>>>>>>>>>>> coding style used elsewhere in
>>>>>>>>>>> musl. There are a few other instances of this too.
>>>>>>>>>>>
>>>>>>>>>> Based on the following review question(WANG Xuerui <i@xen0n.name>
>>>>>>>>>> reviewed in https://www.openwall.com/lists/musl/2022/10/12/1):
>>>>>>>>>>
>>>>>>>>>> `unsigned int`? Same for other bare `unsigned` usages.
>>>>>>>>>>
>>>>>>>>>> I fixed it to a explicit definition.
>>>>>>>>
>>>>>>>> please use plain unsigned, not unsigned int.
>>>>>>
>>>>>> --- a/arch/loongarch64/bits/alltypes.h.in
>>>>>> +++ b/arch/loongarch64/bits/alltypes.h.in
>>>>>> @@ -2,7 +2,7 @@
>>>>>>
>>>>>> -TYPEDEF unsigned int nlink_t;
>>>>>> +TYPEDEF unsigned nlink_t;
>>>>>> TYPEDEF int blksize_t;
>>>>>>
>>>>>>>>
>>>>>>>>>>>> -+ register uintptr_t tp __asm__("tp");
>>>>>>>>>>>> -+ __asm__ ("" : "=r" (tp) );
>>>>>>>>>>>> ++ uintptr_t tp;
>>>>>>>>>>>> ++ __asm__ __volatile__("move %0, $tp" : "=r"(tp));
>>>>>>>>>>>> + return tp;
>>>>>>>>>>>
>>>>>>>>>>> Not clear what the motivation is here. Is it
>>>>>>>>>>> working around a compiler
>>>>>>>>>>> bug? The original form was more optimal.
>>>>>>>>>>
>>>>>>>>>> The modification is based on the following review suggestion(
>>>>>>>>>> WANG Xuerui <i@xen0n.name> reviewed in
>>>>>>>>>> https://www.openwall.com/lists/musl/2022/10/12/1):
>>>>>>>>>>
>>>>>>>>>> While the current approach works, it's a bit fragile [1], and
>>>>>>>>>> the simple and plain riscv version works too:
>>>>>>>>>>
>>>>>>>>>> uintptr_t tp;
>>>>>>>>>> __asm__ ("move %0, $tp" : "=r"(tp));
>>>>>>>>>>
>>>>>>>>>> [1]:https://gcc.gnu.org/onlinedocs/gcc/Local-Register-Variables.html#Local-Register-Variables
>>>>>>>>>>
>>>>>>>>
>>>>>>>> this looks ok to me.
>>>>>>>>
>>>>>>>> (original code looks sketchy to me.)
>>>>>>>>
>>>>>>>>>>>> +#define CRTJMP(pc,sp) __asm__ __volatile__( \
>>>>>>>>>>>> -+ "move $sp,%1 ; jr %0" : : "r"(pc), "r"(sp) : "memory" )
>>>>>>>>>>>> -+
>>>>>>>>>>>> -+#define GETFUNCSYM(fp, sym, got) __asm__ ( \
>>>>>>>>>>>> -+ ".hidden " #sym "\n" \
>>>>>>>>>>>> -+ ".align 8 \n" \
>>>>>>>>>>>> -+ " la.local $t1, "#sym" \n" \
>>>>>>>>>>>> -+ " move %0, $t1 \n" \
>>>>>>>>>>>> -+ : "=r"(*(fp)) : : "memory" )
>>>>>>>>>>>> ++ "move $sp, %1 ; jr %0" : : "r"(pc), "r"(sp) : "memory" )
>>>>>>>>>>>
>>>>>>>>>>> Not clear why this was changed. It was never
>>>>>>>>>>> discussed afaik. It looks
>>>>>>>>>>> somewhat dubious removing GETFUNCSYM and using the maybe-unreliable
>>>>>>>>>>> default definition.
>>>>>>>>>>>
>>>>>>>>>> Based on the following review question(
>>>>>>>>>> WANG Xuerui <i@xen0n.name> reviewed
>>>>>>>>>> in https://www.openwall.com/lists/musl/2022/10/12/1):
>>>>>>>>>>
>>>>>>>>>> Does the generic version residing in ldso/dlstart.c not work?
>>>>>>>>>>
>>>>>>>>>> I found the code logic is consistent with the generic version, So
>>>>>>>>>> I removed the definition here and replaced it with generic version.
>>>>>>>>
>>>>>>>> i would change it back to the asm.
>>>>>>>>
>>>>>>>> generic version should work, but maybe we don't
>>>>>>>> want to trust the compiler here (there may be
>>>>>>>> ways to compile the generic code that is not
>>>>>>>> compatible with incompletely relocated libc.so)
>>>>>>>> the asm is safer.
>>>>>>>>
>>>>>> --- a/arch/loongarch64/reloc.h
>>>>>> +++ b/arch/loongarch64/reloc.h
>>>>>> @@ -18,3 +18,10 @@
>>>>>>
>>>>>> #define CRTJMP(pc,sp) __asm__ __volatile__( \
>>>>>> "move $sp, %1 ; jr %0" : : "r"(pc), "r"(sp) : "memory" )
>>>>>> +
>>>>>> +#define GETFUNCSYM(fp, sym, got) __asm__ ( \
>>>>>> + ".hidden " #sym "\n" \
>>>>>> + ".align 8 \n" \
>>>>>> + " la.local $t1, "#sym" \n" \
>>>>>> + " move %0, $t1 \n" \
>>>>>> + : "=r"(*(fp)) : : "memory" )
>>>>>>
>>>>>>>>>>> It also looks like v5->v6 rewrote sigsetjmp.
>>>>>>>>>>>
>>>>>>>>>> Based on the following review question(
>>>>>>>>>> WANG Xuerui <i@xen0n.name> reviewed
>>>>>>>>>> in https://www.openwall.com/lists/musl/2022/10/12/1):
>>>>>>>>>>
>>>>>>>>>> This is crazy complicated compared to the riscv port, why is the
>>>>>>>>>> juggling between a0/a1 and t5/t6 necessary?
>>>>>>>>>>
>>>>>>>>>> I optimized the code implementations.
>>>>>>>>
>>>>>>>> this should be ok.
>>>>>>>>
>>>>>>>>>>> v6->v7:
>>>>>>>>>>>
>>>>>>>>>>> sigcontext/mcontext_t change mostly makes sense, but the
>>>>>>>>>>> namespace-safe and full mcontext_t differ in
>>>>>>>>>>> their use of the aligned
>>>>>>>>>>> attribute and it's not clear that the
>>>>>>>>>>> attribute is needed (since the
>>>>>>>>>>> offset is naturally aligned and any instance
>>>>>>>>>>> of the object is produced
>>>>>>>>>>> by the kernel, not by userspace).
>>>>>>>>>>>
>>>>>>>>>>>> -+ long __uc_pad;
>>>>>>>>>>>
>>>>>>>>>>> This change to ucontext_t actually makes the
>>>>>>>>>>> struct ABI-incompatible
>>>>>>>>>>> in the namespace-safe version without the
>>>>>>>>>>> alignment attribute. IIRC I
>>>>>>>>>>> explicitly requested the explicit padding
>>>>>>>>>>> field to avoid this kind of
>>>>>>>>>>> footgun. Removing it does not seem like a good idea.
>>>>>>>>>>>
>>>>>>>>>> Initially, we add __uc_pad to ensure uc_mcontext
>>>>>>>>>> is 16 byte alignment.
>>>>>>>>>> Now, we added __attribute__((__aligned__(16))) to
>>>>>>>>>> uc_mcontext.__extcontext[],this can ensure
>>>>>>>>>> uc_mcontext is also 16 byte
>>>>>>>>>> alignment. so __uc_pad is not used.
>>>>>>>>>>
>>>>>>>>>> Remove __uc_pad, from the point of struct
>>>>>>>>>> layout, musl and kernel are
>>>>>>>>>> consistent. otherwise, I think it may bring a sense of inconsistency
>>>>>>>>>> between kernel and musl. Due to Loongarch is not
>>>>>>>>>> merged into musl now,
>>>>>>>>>> Remove it no compatibility issues.
>>>>>>>>
>>>>>>>> please add back the padding field.
>>>>>>
>>>>>> --- a/arch/loongarch64/bits/signal.h
>>>>>> +++ b/arch/loongarch64/bits/signal.h
>>>>>> @@ -40,6 +40,7 @@ typedef struct __ucontext
>>>>>> struct __ucontext *uc_link;
>>>>>> stack_t uc_stack;
>>>>>> sigset_t uc_sigmask;
>>>>>> + long __uc_pad;
>>>>>> mcontext_t uc_mcontext;
>>>>>> } ucontext_t;
>>>>>>
>>>>>>>>
>>>>>>>>>>> In stat.h:
>>>>>>>>>>>
>>>>>>>>>>>> -+ unsigned long __pad;
>>>>>>>>>>>> ++ unsigned long __pad1;
>>>>>>>>>>>
>>>>>>>>>>> This is gratuitous and makes the definition
>>>>>>>>>>> gratuitously mismatch what
>>>>>>>>>>> will become the "generic" version of
>>>>>>>>>>> bits/stat.h. There is no contract
>>>>>>>>>>> for applications to be able to access these
>>>>>>>>>>> padding fields by name, so
>>>>>>>>>>> no motivation to make their names match
>>>>>>>>>>> glibc's names or the kernel's.
>>>>>>>>>>>
>>>>>>>>>> OK.
>>>>>>>>
>>>>>>>> please fix it.
>>>>>>
>>>>>> --- a/arch/loongarch64/bits/stat.h
>>>>>> +++ b/arch/loongarch64/bits/stat.h
>>>>>> @@ -6,7 +6,7 @@ struct stat {
>>>>>> uid_t st_uid;
>>>>>> gid_t st_gid;
>>>>>> dev_t st_rdev;
>>>>>> - unsigned long __pad1;
>>>>>> + unsigned long __pad;
>>>>>> off_t st_size;
>>>>>> blksize_t st_blksize;
>>>>>> int __pad2;
>>>>>>
>>>>>>>>
>>>>>>>>>>> In fenv.S:
>>>>>>>>>>>
>>>>>>>>>>>> ++#ifdef __clang__
>>>>>>>>>>>> ++#define FCSR $fcsr0
>>>>>>>>>>>> ++#else
>>>>>>>>>>>> ++#define FCSR $r0
>>>>>>>>>>>> ++#endif
>>>>>>>>>>>
>>>>>>>>>>> It's not clear to me what's going on here. Is there a clang
>>>>>>>>>>> incompatibility in the assembler language you're working around? Or
>>>>>>>>>>> what? If so that seems like a tooling bug that should be fixed.
>>>>>>>>>>>
>>>>>>>>>> The GNU assembler cannot correctly recognize $fcsr0, but the
>>>>>>>>>> LLVM IAS does not have this issue, so make a distinction.
>>>>>>>>>> This issue has been fixed in GNU assembler 2.41. but for compatible
>>>>>>>>>> with GNU assembler 2.40 and below, $r0 need reserved.
>>>>>>>>
>>>>>>>> it sounds like the correct asm is $fcsr0, so that's
>>>>>>>> what should be used on all assemblers, not just on
>>>>>>>> clang as.
>>>>>>>>
>>>>>>>> only broken old gnu as should use different syntax.
>>>>>>>> for this a configure test is needed that adds a
>>>>>>>> CFLAG like -DBROKEN_LOONGARCH_FCSR_ASM when fails.
>>>>>>>> and use that for the ifdef.
>>>>>>>>
>>>>>> --- a/configure
>>>>>> +++ b/configure
>>>>>>
>>>>>> if test "$ARCH" = "loongarch64" ; then
>>>>>> trycppif __loongarch_soft_float "$t" && SUBARCH=${SUBARCH}-sf
>>>>>> +printf "checking whether compiler support FCSRs... "
>>>>>> +echo "__asm__(\"movfcsr2gr \$t0,\$fcsr0\");" > "$tmpc"
>>>>>> +if $CC -c -o /dev/null "$tmpc" >/dev/null 2>&1 ; then
>>>>>> +printf "yes\n"
>>>>>> +else
>>>>>> +printf "no\n"
>>>>>> +CFLAGS_AUTO="$CFLAGS_AUTO -DBROKEN_LOONGARCH_FCSR_ASM"
>>>>>> +fi
>>>>>> fi
>>>>>>
>>>>>> --- a/src/fenv/loongarch64/fenv.S
>>>>>> +++ b/src/fenv/loongarch64/fenv.S
>>>>>> @@ -1,9 +1,9 @@
>>>>>> #ifndef __loongarch_soft_float
>>>>>>
>>>>>> -#ifdef __clang__
>>>>>> -#define FCSR $fcsr0
>>>>>> -#else
>>>>>> +#ifdef BROKEN_LOONGARCH_FCSR_ASM
>>>>>> #define FCSR $r0
>>>>>> +#else
>>>>>> +#define FCSR $fcsr0
>>>>>> #endif
>>>>>>
>>>>>> .global feclearexcept
>>>>>>
>>>>>>>>>>
>>>>>>>>>> The linux kernel also has a similar distinction:
>>>>>>>>>>
>>>>>>>>>> commit 38bb46f94544c5385bc35aa2bfc776dcf53a7b5d
>>>>>>>>>> Author: WANG Xuerui <git@xen0n.name>
>>>>>>>>>> Date: Thu Jun 29 20:58:43 2023 +0800
>>>>>>>>>>
>>>>>>>>>> LoongArch: Prepare for assemblers with
>>>>>>>>>> proper FCSR class support
>>>>>>>>>>
>>>>>>>>>> The GNU assembler (as of 2.40) mis-treats
>>>>>>>>>> FCSR operands as GPRs, but
>>>>>>>>>> the LLVM IAS does not. Probe for this and
>>>>>>>>>> refer to FCSRs as"$fcsrNN"
>>>>>>>>>> if support is present.
>>>>>>>>>>
>>>>>>>>>> Signed-off-by: WANG Xuerui <git@xen0n.name>
>>>>>>>>>> Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
>>>>>>>>>>
>>>>>>>>>>> Otherwise, everything looks as expected, I
>>>>>>>>>>> think. I'm okay with making
>>>>>>>>>>> any fixups for merging rather than throwing
>>>>>>>>>>> this back on your team for
>>>>>>>>>>> more revisions, but can you please follow up and clarify the above?
>>>>>>>>
>>>>>>>> all issues look minor.
>>>>>>>>
>>>>>>>> if you post a v8 it likely gets into a release faster.
>>>>>>>>
>>>>>
>>>>
>>
>>> >From 2d5f9717fda5c16f10d805ce8a26f1e78de440ea Mon Sep 17 00:00:00 2001
>>> From: wanghongliang <wanghongliang@loongson.cn>
>>> Date: Wed, 15 Nov 2023 01:31:51 +0800
>>> Subject: [PATCH] add loongarch64 port v9.
>>>
>>> Author: Xiaojuan Zhai <zhaixiaojuan@loongson.cn>
>>> Author: Meidan Li <limeidan@loongson.cn>
>>> Author: Guoqi Chen <chenguoqi@loongson.cn>
>>> Author: Xiaolin Zhao <zhaoxiaolin@loongson.cn>
>>> Author: Fan peng <fanpeng@loongson.cn>
>>> Author: Jiantao Shan <shanjiantao@loongson.cn>
>>> Author: Xuhui Qiang <qiangxuhui@loongson.cn>
>>> Author: Jingyun Hua <huajingyun@loongson.cn>
>>> Author: Liu xue <liuxue@loongson.cn>
>>> Author: Hongliang Wang <wanghongliang@loongson.cn>
>>>
>>> Signed-off-by: wanghongliang <wanghongliang@loongson.cn>
>>> ---
>>> arch/loongarch64/atomic_arch.h | 53 ++++
>>> arch/loongarch64/bits/alltypes.h.in | 18 ++
>>> arch/loongarch64/bits/fenv.h | 20 ++
>>> arch/loongarch64/bits/float.h | 16 ++
>>> arch/loongarch64/bits/posix.h | 2 +
>>> arch/loongarch64/bits/ptrace.h | 4 +
>>> arch/loongarch64/bits/reg.h | 2 +
>>> arch/loongarch64/bits/setjmp.h | 1 +
>>> arch/loongarch64/bits/signal.h | 92 +++++++
>>> arch/loongarch64/bits/stat.h | 18 ++
>>> arch/loongarch64/bits/stdint.h | 20 ++
>>> arch/loongarch64/bits/syscall.h.in | 303 +++++++++++++++++++++
>>> arch/loongarch64/bits/user.h | 5 +
>>> arch/loongarch64/crt_arch.h | 13 +
>>> arch/loongarch64/pthread_arch.h | 11 +
>>> arch/loongarch64/reloc.h | 29 ++
>>> arch/loongarch64/syscall_arch.h | 137 ++++++++++
>>> configure | 15 +
>>> include/elf.h | 104 ++++++-
>>> src/fenv/loongarch64/fenv.S | 78 ++++++
>>> src/ldso/loongarch64/dlsym.s | 7 +
>>> src/setjmp/loongarch64/longjmp.S | 32 +++
>>> src/setjmp/loongarch64/setjmp.S | 34 +++
>>> src/signal/loongarch64/restore.s | 10 +
>>> src/signal/loongarch64/sigsetjmp.s | 25 ++
>>> src/thread/loongarch64/__set_thread_area.s | 7 +
>>> src/thread/loongarch64/__unmapself.s | 7 +
>>> src/thread/loongarch64/clone.s | 28 ++
>>> src/thread/loongarch64/syscall_cp.s | 29 ++
>>> 29 files changed, 1119 insertions(+), 1 deletion(-)
>>> create mode 100644 arch/loongarch64/atomic_arch.h
>>> create mode 100644 arch/loongarch64/bits/alltypes.h.in
>>> create mode 100644 arch/loongarch64/bits/fenv.h
>>> create mode 100644 arch/loongarch64/bits/float.h
>>> create mode 100644 arch/loongarch64/bits/posix.h
>>> create mode 100644 arch/loongarch64/bits/ptrace.h
>>> create mode 100644 arch/loongarch64/bits/reg.h
>>> create mode 100644 arch/loongarch64/bits/setjmp.h
>>> create mode 100644 arch/loongarch64/bits/signal.h
>>> create mode 100644 arch/loongarch64/bits/stat.h
>>> create mode 100644 arch/loongarch64/bits/stdint.h
>>> create mode 100644 arch/loongarch64/bits/syscall.h.in
>>> create mode 100644 arch/loongarch64/bits/user.h
>>> create mode 100644 arch/loongarch64/crt_arch.h
>>> create mode 100644 arch/loongarch64/pthread_arch.h
>>> create mode 100644 arch/loongarch64/reloc.h
>>> create mode 100644 arch/loongarch64/syscall_arch.h
>>> create mode 100644 src/fenv/loongarch64/fenv.S
>>> create mode 100644 src/ldso/loongarch64/dlsym.s
>>> create mode 100644 src/setjmp/loongarch64/longjmp.S
>>> create mode 100644 src/setjmp/loongarch64/setjmp.S
>>> create mode 100644 src/signal/loongarch64/restore.s
>>> create mode 100644 src/signal/loongarch64/sigsetjmp.s
>>> create mode 100644 src/thread/loongarch64/__set_thread_area.s
>>> create mode 100644 src/thread/loongarch64/__unmapself.s
>>> create mode 100644 src/thread/loongarch64/clone.s
>>> create mode 100644 src/thread/loongarch64/syscall_cp.s
>>>
>>> diff --git a/arch/loongarch64/atomic_arch.h b/arch/loongarch64/atomic_arch.h
>>> new file mode 100644
>>> index 00000000..2225d027
>>> --- /dev/null
>>> +++ b/arch/loongarch64/atomic_arch.h
>>> @@ -0,0 +1,53 @@
>>> +#define a_ll a_ll
>>> +static inline int a_ll(volatile int *p)
>>> +{
>>> + int v;
>>> + __asm__ __volatile__ (
>>> + "ll.w %0, %1"
>>> + : "=r"(v)
>>> + : "ZC"(*p));
>>> + return v;
>>> +}
>>> +
>>> +#define a_sc a_sc
>>> +static inline int a_sc(volatile int *p, int v)
>>> +{
>>> + int r;
>>> + __asm__ __volatile__ (
>>> + "sc.w %0, %1"
>>> + : "=r"(r), "=ZC"(*p)
>>> + : "0"(v) : "memory");
>>> + return r;
>>> +}
>>> +
>>> +#define a_ll_p a_ll_p
>>> +static inline void *a_ll_p(volatile void *p)
>>> +{
>>> + void *v;
>>> + __asm__ __volatile__ (
>>> + "ll.d %0, %1"
>>> + : "=r"(v)
>>> + : "ZC"(*(void *volatile *)p));
>>> + return v;
>>> +}
>>> +
>>> +#define a_sc_p a_sc_p
>>> +static inline int a_sc_p(volatile void *p, void *v)
>>> +{
>>> + long r;
>>> + __asm__ __volatile__ (
>>> + "sc.d %0, %1"
>>> + : "=r"(r), "=ZC"(*(void *volatile *)p)
>>> + : "0"(v)
>>> + : "memory");
>>> + return r;
>>> +}
>>> +
>>> +#define a_barrier a_barrier
>>> +static inline void a_barrier()
>>> +{
>>> + __asm__ __volatile__ ("dbar 0" : : : "memory");
>>> +}
>>> +
>>> +#define a_pre_llsc a_barrier
>>> +#define a_post_llsc a_barrier
>>> diff --git a/arch/loongarch64/bits/alltypes.h.in b/arch/loongarch64/bits/alltypes.h.in
>>> new file mode 100644
>>> index 00000000..06db4096
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/alltypes.h.in
>>> @@ -0,0 +1,18 @@
>>> +#define _Addr long
>>> +#define _Int64 long
>>> +#define _Reg long
>>> +
>>> +#define __BYTE_ORDER 1234
>>> +#define __LONG_MAX 0x7fffffffffffffffL
>>> +
>>> +#ifndef __cplusplus
>>> +TYPEDEF int wchar_t;
>>> +#endif
>>> +
>>> +TYPEDEF float float_t;
>>> +TYPEDEF double double_t;
>>> +
>>> +TYPEDEF struct { long long __ll; long double __ld; } max_align_t;
>>> +
>>> +TYPEDEF unsigned nlink_t;
>>> +TYPEDEF int blksize_t;
>>> diff --git a/arch/loongarch64/bits/fenv.h b/arch/loongarch64/bits/fenv.h
>>> new file mode 100644
>>> index 00000000..99e916e1
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/fenv.h
>>> @@ -0,0 +1,20 @@
>>> +#define FE_INEXACT 0x010000
>>> +#define FE_UNDERFLOW 0x020000
>>> +#define FE_OVERFLOW 0x040000
>>> +#define FE_DIVBYZERO 0x080000
>>> +#define FE_INVALID 0x100000
>>> +
>>> +#define FE_ALL_EXCEPT 0x1F0000
>>> +
>>> +#define FE_TONEAREST 0x000
>>> +#define FE_TOWARDZERO 0x100
>>> +#define FE_UPWARD 0x200
>>> +#define FE_DOWNWARD 0x300
>>> +
>>> +typedef unsigned fexcept_t;
>>> +
>>> +typedef struct {
>>> + unsigned int __cw;
>>> +} fenv_t;
>>> +
>>> +#define FE_DFL_ENV ((const fenv_t *) -1)
>>> diff --git a/arch/loongarch64/bits/float.h b/arch/loongarch64/bits/float.h
>>> new file mode 100644
>>> index 00000000..63e86d44
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/float.h
>>> @@ -0,0 +1,16 @@
>>> +#define FLT_EVAL_METHOD 0
>>> +
>>> +#define LDBL_TRUE_MIN 6.47517511943802511092443895822764655e-4966L
>>> +#define LDBL_MIN 3.36210314311209350626267781732175260e-4932L
>>> +#define LDBL_MAX 1.18973149535723176508575932662800702e+4932L
>>> +#define LDBL_EPSILON 1.92592994438723585305597794258492732e-34L
>>> +
>>> +#define LDBL_MANT_DIG 113
>>> +#define LDBL_MIN_EXP (-16381)
>>> +#define LDBL_MAX_EXP 16384
>>> +
>>> +#define LDBL_DIG 33
>>> +#define LDBL_MIN_10_EXP (-4931)
>>> +#define LDBL_MAX_10_EXP 4932
>>> +
>>> +#define DECIMAL_DIG 36
>>> diff --git a/arch/loongarch64/bits/posix.h b/arch/loongarch64/bits/posix.h
>>> new file mode 100644
>>> index 00000000..c37b94c1
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/posix.h
>>> @@ -0,0 +1,2 @@
>>> +#define _POSIX_V6_LP64_OFF64 1
>>> +#define _POSIX_V7_LP64_OFF64 1
>>> diff --git a/arch/loongarch64/bits/ptrace.h b/arch/loongarch64/bits/ptrace.h
>>> new file mode 100644
>>> index 00000000..dce2fa51
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/ptrace.h
>>> @@ -0,0 +1,4 @@
>>> +#define PTRACE_GET_THREAD_AREA 25
>>> +#define PTRACE_SET_THREAD_AREA 26
>>> +#define PTRACE_GET_WATCH_REGS 0xd0
>>> +#define PTRACE_SET_WATCH_REGS 0xd1
>>> diff --git a/arch/loongarch64/bits/reg.h b/arch/loongarch64/bits/reg.h
>>> new file mode 100644
>>> index 00000000..2633f39d
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/reg.h
>>> @@ -0,0 +1,2 @@
>>> +#undef __WORDSIZE
>>> +#define __WORDSIZE 64
>>> diff --git a/arch/loongarch64/bits/setjmp.h b/arch/loongarch64/bits/setjmp.h
>>> new file mode 100644
>>> index 00000000..4bfa374d
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/setjmp.h
>>> @@ -0,0 +1 @@
>>> +typedef unsigned long __jmp_buf[23];
>>> diff --git a/arch/loongarch64/bits/signal.h b/arch/loongarch64/bits/signal.h
>>> new file mode 100644
>>> index 00000000..e1d256e7
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/signal.h
>>> @@ -0,0 +1,92 @@
>>> +#if defined(_POSIX_SOURCE) || defined(_POSIX_C_SOURCE) \
>>> + || defined(_XOPEN_SOURCE) || defined(_GNU_SOURCE) || defined(_BSD_SOURCE)
>>> +
>>> +#if defined(_XOPEN_SOURCE) || defined(_GNU_SOURCE) || defined(_BSD_SOURCE)
>>> +#define MINSIGSTKSZ 4096
>>> +#define SIGSTKSZ 16384
>>> +#endif
>>> +
>>> +#if defined(_GNU_SOURCE) || defined(_BSD_SOURCE)
>>> +typedef unsigned long greg_t, gregset_t[32];
>>> +
>>> +struct sigcontext {
>>> + unsigned long sc_pc;
>>> + unsigned long sc_regs[32];
>>> + unsigned int sc_flags;
>>> + unsigned long sc_extcontext[] __attribute__((__aligned__(16)));
>>> +};
>>> +
>>> +typedef struct {
>>> + unsigned long __pc;
>>> + unsigned long __gregs[32];
>>> + unsigned int __flags;
>>> + unsigned long __extcontext[] __attribute__((__aligned__(16)));
>>> +} mcontext_t;
>>> +#else
>>> +typedef struct {
>>> + unsigned long __space[34];
>>> +} mcontext_t;
>>> +#endif
>>> +
>>> +struct sigaltstack {
>>> + void *ss_sp;
>>> + int ss_flags;
>>> + size_t ss_size;
>>> +};
>>> +
>>> +typedef struct __ucontext
>>> +{
>>> + unsigned long __uc_flags;
>>> + struct __ucontext *uc_link;
>>> + stack_t uc_stack;
>>> + sigset_t uc_sigmask;
>>> + long __uc_pad;
>>> + mcontext_t uc_mcontext;
>>> +} ucontext_t;
>>> +
>>> +#define SA_NOCLDSTOP 1
>>> +#define SA_NOCLDWAIT 2
>>> +#define SA_SIGINFO 4
>>> +#define SA_ONSTACK 0x08000000
>>> +#define SA_RESTART 0x10000000
>>> +#define SA_NODEFER 0x40000000
>>> +#define SA_RESETHAND 0x80000000
>>> +#define SA_RESTORER 0x0
>>> +
>>> +#endif
>>> +
>>> +#define SIGHUP 1
>>> +#define SIGINT 2
>>> +#define SIGQUIT 3
>>> +#define SIGILL 4
>>> +#define SIGTRAP 5
>>> +#define SIGABRT 6
>>> +#define SIGIOT SIGABRT
>>> +#define SIGBUS 7
>>> +#define SIGFPE 8
>>> +#define SIGKILL 9
>>> +#define SIGUSR1 10
>>> +#define SIGSEGV 11
>>> +#define SIGUSR2 12
>>> +#define SIGPIPE 13
>>> +#define SIGALRM 14
>>> +#define SIGTERM 15
>>> +#define SIGSTKFLT 16
>>> +#define SIGCHLD 17
>>> +#define SIGCONT 18
>>> +#define SIGSTOP 19
>>> +#define SIGTSTP 20
>>> +#define SIGTTIN 21
>>> +#define SIGTTOU 22
>>> +#define SIGURG 23
>>> +#define SIGXCPU 24
>>> +#define SIGXFSZ 25
>>> +#define SIGVTALRM 26
>>> +#define SIGPROF 27
>>> +#define SIGWINCH 28
>>> +#define SIGIO 29
>>> +#define SIGPOLL SIGIO
>>> +#define SIGPWR 30
>>> +#define SIGSYS 31
>>> +#define SIGUNUSED SIGSYS
>>> +#define _NSIG 65
>>> diff --git a/arch/loongarch64/bits/stat.h b/arch/loongarch64/bits/stat.h
>>> new file mode 100644
>>> index 00000000..b7f4221b
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/stat.h
>>> @@ -0,0 +1,18 @@
>>> +struct stat {
>>> + dev_t st_dev;
>>> + ino_t st_ino;
>>> + mode_t st_mode;
>>> + nlink_t st_nlink;
>>> + uid_t st_uid;
>>> + gid_t st_gid;
>>> + dev_t st_rdev;
>>> + unsigned long __pad;
>>> + off_t st_size;
>>> + blksize_t st_blksize;
>>> + int __pad2;
>>> + blkcnt_t st_blocks;
>>> + struct timespec st_atim;
>>> + struct timespec st_mtim;
>>> + struct timespec st_ctim;
>>> + unsigned __unused[2];
>>> +};
>>> diff --git a/arch/loongarch64/bits/stdint.h b/arch/loongarch64/bits/stdint.h
>>> new file mode 100644
>>> index 00000000..60c12499
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/stdint.h
>>> @@ -0,0 +1,20 @@
>>> +typedef int32_t int_fast16_t;
>>> +typedef int32_t int_fast32_t;
>>> +typedef uint32_t uint_fast16_t;
>>> +typedef uint32_t uint_fast32_t;
>>> +
>>> +#define INT_FAST16_MIN INT32_MIN
>>> +#define INT_FAST32_MIN INT32_MIN
>>> +
>>> +#define INT_FAST16_MAX INT32_MAX
>>> +#define INT_FAST32_MAX INT32_MAX
>>> +
>>> +#define UINT_FAST16_MAX UINT32_MAX
>>> +#define UINT_FAST32_MAX UINT32_MAX
>>> +
>>> +#define INTPTR_MIN INT64_MIN
>>> +#define INTPTR_MAX INT64_MAX
>>> +#define UINTPTR_MAX UINT64_MAX
>>> +#define PTRDIFF_MIN INT64_MIN
>>> +#define PTRDIFF_MAX INT64_MAX
>>> +#define SIZE_MAX UINT64_MAX
>>> diff --git a/arch/loongarch64/bits/syscall.h.in b/arch/loongarch64/bits/syscall.h.in
>>> new file mode 100644
>>> index 00000000..0980e533
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/syscall.h.in
>>> @@ -0,0 +1,303 @@
>>> +#define __NR_io_setup 0
>>> +#define __NR_io_destroy 1
>>> +#define __NR_io_submit 2
>>> +#define __NR_io_cancel 3
>>> +#define __NR_io_getevents 4
>>> +#define __NR_setxattr 5
>>> +#define __NR_lsetxattr 6
>>> +#define __NR_fsetxattr 7
>>> +#define __NR_getxattr 8
>>> +#define __NR_lgetxattr 9
>>> +#define __NR_fgetxattr 10
>>> +#define __NR_listxattr 11
>>> +#define __NR_llistxattr 12
>>> +#define __NR_flistxattr 13
>>> +#define __NR_removexattr 14
>>> +#define __NR_lremovexattr 15
>>> +#define __NR_fremovexattr 16
>>> +#define __NR_getcwd 17
>>> +#define __NR_lookup_dcookie 18
>>> +#define __NR_eventfd2 19
>>> +#define __NR_epoll_create1 20
>>> +#define __NR_epoll_ctl 21
>>> +#define __NR_epoll_pwait 22
>>> +#define __NR_dup 23
>>> +#define __NR_dup3 24
>>> +#define __NR3264_fcntl 25
>>> +#define __NR_inotify_init1 26
>>> +#define __NR_inotify_add_watch 27
>>> +#define __NR_inotify_rm_watch 28
>>> +#define __NR_ioctl 29
>>> +#define __NR_ioprio_set 30
>>> +#define __NR_ioprio_get 31
>>> +#define __NR_flock 32
>>> +#define __NR_mknodat 33
>>> +#define __NR_mkdirat 34
>>> +#define __NR_unlinkat 35
>>> +#define __NR_symlinkat 36
>>> +#define __NR_linkat 37
>>> +#define __NR_umount2 39
>>> +#define __NR_mount 40
>>> +#define __NR_pivot_root 41
>>> +#define __NR_nfsservctl 42
>>> +#define __NR3264_statfs 43
>>> +#define __NR3264_fstatfs 44
>>> +#define __NR3264_truncate 45
>>> +#define __NR3264_ftruncate 46
>>> +#define __NR_fallocate 47
>>> +#define __NR_faccessat 48
>>> +#define __NR_chdir 49
>>> +#define __NR_fchdir 50
>>> +#define __NR_chroot 51
>>> +#define __NR_fchmod 52
>>> +#define __NR_fchmodat 53
>>> +#define __NR_fchownat 54
>>> +#define __NR_fchown 55
>>> +#define __NR_openat 56
>>> +#define __NR_close 57
>>> +#define __NR_vhangup 58
>>> +#define __NR_pipe2 59
>>> +#define __NR_quotactl 60
>>> +#define __NR_getdents64 61
>>> +#define __NR3264_lseek 62
>>> +#define __NR_read 63
>>> +#define __NR_write 64
>>> +#define __NR_readv 65
>>> +#define __NR_writev 66
>>> +#define __NR_pread64 67
>>> +#define __NR_pwrite64 68
>>> +#define __NR_preadv 69
>>> +#define __NR_pwritev 70
>>> +#define __NR3264_sendfile 71
>>> +#define __NR_pselect6 72
>>> +#define __NR_ppoll 73
>>> +#define __NR_signalfd4 74
>>> +#define __NR_vmsplice 75
>>> +#define __NR_splice 76
>>> +#define __NR_tee 77
>>> +#define __NR_readlinkat 78
>>> +#define __NR_sync 81
>>> +#define __NR_fsync 82
>>> +#define __NR_fdatasync 83
>>> +#define __NR_sync_file_range 84
>>> +#define __NR_timerfd_create 85
>>> +#define __NR_timerfd_settime 86
>>> +#define __NR_timerfd_gettime 87
>>> +#define __NR_utimensat 88
>>> +#define __NR_acct 89
>>> +#define __NR_capget 90
>>> +#define __NR_capset 91
>>> +#define __NR_personality 92
>>> +#define __NR_exit 93
>>> +#define __NR_exit_group 94
>>> +#define __NR_waitid 95
>>> +#define __NR_set_tid_address 96
>>> +#define __NR_unshare 97
>>> +#define __NR_futex 98
>>> +#define __NR_set_robust_list 99
>>> +#define __NR_get_robust_list 100
>>> +#define __NR_nanosleep 101
>>> +#define __NR_getitimer 102
>>> +#define __NR_setitimer 103
>>> +#define __NR_kexec_load 104
>>> +#define __NR_init_module 105
>>> +#define __NR_delete_module 106
>>> +#define __NR_timer_create 107
>>> +#define __NR_timer_gettime 108
>>> +#define __NR_timer_getoverrun 109
>>> +#define __NR_timer_settime 110
>>> +#define __NR_timer_delete 111
>>> +#define __NR_clock_settime 112
>>> +#define __NR_clock_gettime 113
>>> +#define __NR_clock_getres 114
>>> +#define __NR_clock_nanosleep 115
>>> +#define __NR_syslog 116
>>> +#define __NR_ptrace 117
>>> +#define __NR_sched_setparam 118
>>> +#define __NR_sched_setscheduler 119
>>> +#define __NR_sched_getscheduler 120
>>> +#define __NR_sched_getparam 121
>>> +#define __NR_sched_setaffinity 122
>>> +#define __NR_sched_getaffinity 123
>>> +#define __NR_sched_yield 124
>>> +#define __NR_sched_get_priority_max 125
>>> +#define __NR_sched_get_priority_min 126
>>> +#define __NR_sched_rr_get_interval 127
>>> +#define __NR_restart_syscall 128
>>> +#define __NR_kill 129
>>> +#define __NR_tkill 130
>>> +#define __NR_tgkill 131
>>> +#define __NR_sigaltstack 132
>>> +#define __NR_rt_sigsuspend 133
>>> +#define __NR_rt_sigaction 134
>>> +#define __NR_rt_sigprocmask 135
>>> +#define __NR_rt_sigpending 136
>>> +#define __NR_rt_sigtimedwait 137
>>> +#define __NR_rt_sigqueueinfo 138
>>> +#define __NR_rt_sigreturn 139
>>> +#define __NR_setpriority 140
>>> +#define __NR_getpriority 141
>>> +#define __NR_reboot 142
>>> +#define __NR_setregid 143
>>> +#define __NR_setgid 144
>>> +#define __NR_setreuid 145
>>> +#define __NR_setuid 146
>>> +#define __NR_setresuid 147
>>> +#define __NR_getresuid 148
>>> +#define __NR_setresgid 149
>>> +#define __NR_getresgid 150
>>> +#define __NR_setfsuid 151
>>> +#define __NR_setfsgid 152
>>> +#define __NR_times 153
>>> +#define __NR_setpgid 154
>>> +#define __NR_getpgid 155
>>> +#define __NR_getsid 156
>>> +#define __NR_setsid 157
>>> +#define __NR_getgroups 158
>>> +#define __NR_setgroups 159
>>> +#define __NR_uname 160
>>> +#define __NR_sethostname 161
>>> +#define __NR_setdomainname 162
>>> +#define __NR_getrlimit 163
>>> +#define __NR_setrlimit 164
>>> +#define __NR_getrusage 165
>>> +#define __NR_umask 166
>>> +#define __NR_prctl 167
>>> +#define __NR_getcpu 168
>>> +#define __NR_gettimeofday 169
>>> +#define __NR_settimeofday 170
>>> +#define __NR_adjtimex 171
>>> +#define __NR_getpid 172
>>> +#define __NR_getppid 173
>>> +#define __NR_getuid 174
>>> +#define __NR_geteuid 175
>>> +#define __NR_getgid 176
>>> +#define __NR_getegid 177
>>> +#define __NR_gettid 178
>>> +#define __NR_sysinfo 179
>>> +#define __NR_mq_open 180
>>> +#define __NR_mq_unlink 181
>>> +#define __NR_mq_timedsend 182
>>> +#define __NR_mq_timedreceive 183
>>> +#define __NR_mq_notify 184
>>> +#define __NR_mq_getsetattr 185
>>> +#define __NR_msgget 186
>>> +#define __NR_msgctl 187
>>> +#define __NR_msgrcv 188
>>> +#define __NR_msgsnd 189
>>> +#define __NR_semget 190
>>> +#define __NR_semctl 191
>>> +#define __NR_semtimedop 192
>>> +#define __NR_semop 193
>>> +#define __NR_shmget 194
>>> +#define __NR_shmctl 195
>>> +#define __NR_shmat 196
>>> +#define __NR_shmdt 197
>>> +#define __NR_socket 198
>>> +#define __NR_socketpair 199
>>> +#define __NR_bind 200
>>> +#define __NR_listen 201
>>> +#define __NR_accept 202
>>> +#define __NR_connect 203
>>> +#define __NR_getsockname 204
>>> +#define __NR_getpeername 205
>>> +#define __NR_sendto 206
>>> +#define __NR_recvfrom 207
>>> +#define __NR_setsockopt 208
>>> +#define __NR_getsockopt 209
>>> +#define __NR_shutdown 210
>>> +#define __NR_sendmsg 211
>>> +#define __NR_recvmsg 212
>>> +#define __NR_readahead 213
>>> +#define __NR_brk 214
>>> +#define __NR_munmap 215
>>> +#define __NR_mremap 216
>>> +#define __NR_add_key 217
>>> +#define __NR_request_key 218
>>> +#define __NR_keyctl 219
>>> +#define __NR_clone 220
>>> +#define __NR_execve 221
>>> +#define __NR3264_mmap 222
>>> +#define __NR3264_fadvise64 223
>>> +#define __NR_swapon 224
>>> +#define __NR_swapoff 225
>>> +#define __NR_mprotect 226
>>> +#define __NR_msync 227
>>> +#define __NR_mlock 228
>>> +#define __NR_munlock 229
>>> +#define __NR_mlockall 230
>>> +#define __NR_munlockall 231
>>> +#define __NR_mincore 232
>>> +#define __NR_madvise 233
>>> +#define __NR_remap_file_pages 234
>>> +#define __NR_mbind 235
>>> +#define __NR_get_mempolicy 236
>>> +#define __NR_set_mempolicy 237
>>> +#define __NR_migrate_pages 238
>>> +#define __NR_move_pages 239
>>> +#define __NR_rt_tgsigqueueinfo 240
>>> +#define __NR_perf_event_open 241
>>> +#define __NR_accept4 242
>>> +#define __NR_recvmmsg 243
>>> +#define __NR_arch_specific_syscall 244
>>> +#define __NR_wait4 260
>>> +#define __NR_prlimit64 261
>>> +#define __NR_fanotify_init 262
>>> +#define __NR_fanotify_mark 263
>>> +#define __NR_name_to_handle_at 264
>>> +#define __NR_open_by_handle_at 265
>>> +#define __NR_clock_adjtime 266
>>> +#define __NR_syncfs 267
>>> +#define __NR_setns 268
>>> +#define __NR_sendmmsg 269
>>> +#define __NR_process_vm_readv 270
>>> +#define __NR_process_vm_writev 271
>>> +#define __NR_kcmp 272
>>> +#define __NR_finit_module 273
>>> +#define __NR_sched_setattr 274
>>> +#define __NR_sched_getattr 275
>>> +#define __NR_renameat2 276
>>> +#define __NR_seccomp 277
>>> +#define __NR_getrandom 278
>>> +#define __NR_memfd_create 279
>>> +#define __NR_bpf 280
>>> +#define __NR_execveat 281
>>> +#define __NR_userfaultfd 282
>>> +#define __NR_membarrier 283
>>> +#define __NR_mlock2 284
>>> +#define __NR_copy_file_range 285
>>> +#define __NR_preadv2 286
>>> +#define __NR_pwritev2 287
>>> +#define __NR_pkey_mprotect 288
>>> +#define __NR_pkey_alloc 289
>>> +#define __NR_pkey_free 290
>>> +#define __NR_statx 291
>>> +#define __NR_io_pgetevents 292
>>> +#define __NR_rseq 293
>>> +#define __NR_kexec_file_load 294
>>> +#define __NR_pidfd_send_signal 424
>>> +#define __NR_io_uring_setup 425
>>> +#define __NR_io_uring_enter 426
>>> +#define __NR_io_uring_register 427
>>> +#define __NR_open_tree 428
>>> +#define __NR_move_mount 429
>>> +#define __NR_fsopen 430
>>> +#define __NR_fsconfig 431
>>> +#define __NR_fsmount 432
>>> +#define __NR_fspick 433
>>> +#define __NR_pidfd_open 434
>>> +#define __NR_clone3 435
>>> +#define __NR_close_range 436
>>> +#define __NR_openat2 437
>>> +#define __NR_pidfd_getfd 438
>>> +#define __NR_faccessat2 439
>>> +#define __NR_process_madvise 440
>>> +#define __NR_fcntl __NR3264_fcntl
>>> +#define __NR_statfs __NR3264_statfs
>>> +#define __NR_fstatfs __NR3264_fstatfs
>>> +#define __NR_truncate __NR3264_truncate
>>> +#define __NR_ftruncate __NR3264_ftruncate
>>> +#define __NR_lseek __NR3264_lseek
>>> +#define __NR_sendfile __NR3264_sendfile
>>> +#define __NR_mmap __NR3264_mmap
>>> +#define __NR_fadvise64 __NR3264_fadvise64
>>> diff --git a/arch/loongarch64/bits/user.h b/arch/loongarch64/bits/user.h
>>> new file mode 100644
>>> index 00000000..5a71d132
>>> --- /dev/null
>>> +++ b/arch/loongarch64/bits/user.h
>>> @@ -0,0 +1,5 @@
>>> +#define ELF_NGREG 45
>>> +#define ELF_NFPREG 33
>>> +
>>> +typedef unsigned long elf_greg_t, elf_gregset_t[ELF_NGREG];
>>> +typedef double elf_fpreg_t, elf_fpregset_t[ELF_NFPREG];
>>> diff --git a/arch/loongarch64/crt_arch.h b/arch/loongarch64/crt_arch.h
>>> new file mode 100644
>>> index 00000000..e0760d9e
>>> --- /dev/null
>>> +++ b/arch/loongarch64/crt_arch.h
>>> @@ -0,0 +1,13 @@
>>> +__asm__(
>>> +".text \n"
>>> +".global " START "\n"
>>> +".type " START ", @function\n"
>>> +START ":\n"
>>> +" move $fp, $zero\n"
>>> +" move $a0, $sp\n"
>>> +".weak _DYNAMIC\n"
>>> +".hidden _DYNAMIC\n"
>>> +" la.local $a1, _DYNAMIC\n"
>>> +" bstrins.d $sp, $zero, 3, 0\n"
>>> +" b " START "_c\n"
>>> +);
>>> diff --git a/arch/loongarch64/pthread_arch.h b/arch/loongarch64/pthread_arch.h
>>> new file mode 100644
>>> index 00000000..28fbfcd1
>>> --- /dev/null
>>> +++ b/arch/loongarch64/pthread_arch.h
>>> @@ -0,0 +1,11 @@
>>> +static inline uintptr_t __get_tp()
>>> +{
>>> + uintptr_t tp;
>>> + __asm__ __volatile__("move %0, $tp" : "=r"(tp));
>>> + return tp;
>>> +}
>>> +
>>> +#define TLS_ABOVE_TP
>>> +#define GAP_ABOVE_TP 0
>>> +#define DTP_OFFSET 0
>>> +#define MC_PC __pc
>>> diff --git a/arch/loongarch64/reloc.h b/arch/loongarch64/reloc.h
>>> new file mode 100644
>>> index 00000000..6907de8e
>>> --- /dev/null
>>> +++ b/arch/loongarch64/reloc.h
>>> @@ -0,0 +1,29 @@
>>> +#if defined __loongarch_double_float
>>> +#define FP_SUFFIX "-lp64d"
>>> +#elif defined __loongarch_single_float
>>> +#define FP_SUFFIX "-lp64f"
>>> +#elif defined __loongarch_soft_float
>>> +#define FP_SUFFIX "-lp64s"
>>> +#endif
>>> +
>>> +#define LDSO_ARCH "loongarch64" FP_SUFFIX
>>> +
>>> +#define TPOFF_K 0
>>> +
>>> +#define REL_PLT R_LARCH_JUMP_SLOT
>>> +#define REL_COPY R_LARCH_COPY
>>> +#define REL_DTPMOD R_LARCH_TLS_DTPMOD64
>>> +#define REL_DTPOFF R_LARCH_TLS_DTPREL64
>>> +#define REL_TPOFF R_LARCH_TLS_TPREL64
>>> +#define REL_RELATIVE R_LARCH_RELATIVE
>>> +#define REL_SYMBOLIC R_LARCH_64
>>> +
>>> +#define CRTJMP(pc,sp) __asm__ __volatile__( \
>>> + "move $sp, %1 ; jr %0" : : "r"(pc), "r"(sp) : "memory" )
>>> +
>>> +#define GETFUNCSYM(fp, sym, got) __asm__ ( \
>>> + ".hidden " #sym "\n" \
>>> + ".align 8 \n" \
>>> + " la.local $t1, "#sym" \n" \
>>> + " move %0, $t1 \n" \
>>> + : "=r"(*(fp)) : : "memory" )
>>> diff --git a/arch/loongarch64/syscall_arch.h b/arch/loongarch64/syscall_arch.h
>>> new file mode 100644
>>> index 00000000..4d5e1885
>>> --- /dev/null
>>> +++ b/arch/loongarch64/syscall_arch.h
>>> @@ -0,0 +1,137 @@
>>> +#define __SYSCALL_LL_E(x) (x)
>>> +#define __SYSCALL_LL_O(x) (x)
>>> +
>>> +#define SYSCALL_CLOBBERLIST \
>>> + "$t0", "$t1", "$t2", "$t3", \
>>> + "$t4", "$t5", "$t6", "$t7", "$t8", "memory"
>>> +
>>> +static inline long __syscall0(long n)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0");
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "=r"(a0)
>>> + : "r"(a7)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall1(long n, long a)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall2(long n, long a, long b)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> + register long a1 __asm__("$a1") = b;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7), "r"(a1)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall3(long n, long a, long b, long c)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> + register long a1 __asm__("$a1") = b;
>>> + register long a2 __asm__("$a2") = c;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7), "r"(a1), "r"(a2)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall4(long n, long a, long b, long c, long d)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> + register long a1 __asm__("$a1") = b;
>>> + register long a2 __asm__("$a2") = c;
>>> + register long a3 __asm__("$a3") = d;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7), "r"(a1), "r"(a2), "r"(a3)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall5(long n, long a, long b, long c, long d, long e)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> + register long a1 __asm__("$a1") = b;
>>> + register long a2 __asm__("$a2") = c;
>>> + register long a3 __asm__("$a3") = d;
>>> + register long a4 __asm__("$a4") = e;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7), "r"(a1), "r"(a2), "r"(a3), "r"(a4)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall6(long n, long a, long b, long c, long d, long e, long f)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> + register long a1 __asm__("$a1") = b;
>>> + register long a2 __asm__("$a2") = c;
>>> + register long a3 __asm__("$a3") = d;
>>> + register long a4 __asm__("$a4") = e;
>>> + register long a5 __asm__("$a5") = f;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7), "r"(a1), "r"(a2), "r"(a3), "r"(a4), "r"(a5)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +static inline long __syscall7(long n, long a, long b, long c, long d, long e, long f, long g)
>>> +{
>>> + register long a7 __asm__("$a7") = n;
>>> + register long a0 __asm__("$a0") = a;
>>> + register long a1 __asm__("$a1") = b;
>>> + register long a2 __asm__("$a2") = c;
>>> + register long a3 __asm__("$a3") = d;
>>> + register long a4 __asm__("$a4") = e;
>>> + register long a5 __asm__("$a5") = f;
>>> + register long a6 __asm__("$a6") = g;
>>> +
>>> + __asm__ __volatile__ (
>>> + "syscall 0"
>>> + : "+r"(a0)
>>> + : "r"(a7), "r"(a1), "r"(a2), "r"(a3), "r"(a4), "r"(a5), "r"(a6)
>>> + : SYSCALL_CLOBBERLIST);
>>> + return a0;
>>> +}
>>> +
>>> +#define VDSO_USEFUL
>>> +#define VDSO_CGT_SYM "__vdso_clock_gettime"
>>> +#define VDSO_CGT_VER "LINUX_5.10"
>>> +
>>> +#define IPC_64 0
>>> diff --git a/configure b/configure
>>> index 0b966ede..93b06287 100755
>>> --- a/configure
>>> +++ b/configure
>>> @@ -328,6 +328,7 @@ i?86*) ARCH=i386 ;;
>>> x86_64-x32*|x32*|x86_64*x32) ARCH=x32 ;;
>>> x86_64-nt64*) ARCH=nt64 ;;
>>> x86_64*) ARCH=x86_64 ;;
>>> +loongarch64*) ARCH=loongarch64 ;;
>>> m68k*) ARCH=m68k ;;
>>> mips64*|mipsisa64*) ARCH=mips64 ;;
>>> mips*) ARCH=mips ;;
>>> @@ -671,6 +672,20 @@ if test "$ARCH" = "aarch64" ; then
>>> trycppif __AARCH64EB__ "$t" && SUBARCH=${SUBARCH}_be
>>> fi
>>>
>>> +if test "$ARCH" = "loongarch64" ; then
>>> +trycppif __loongarch_double_float "$t" && SUBARCH=${SUBARCH}-lp64d
>>> +trycppif __loongarch_single_float "$t" && SUBARCH=${SUBARCH}-lp64f
>>> +trycppif __loongarch_soft_float "$t" && SUBARCH=${SUBARCH}-lp64s
>>> +printf "checking whether compiler support FCSRs... "
>>> +echo "__asm__(\"movfcsr2gr \$t0,\$fcsr0\");" > "$tmpc"
>>> +if $CC -c -o /dev/null "$tmpc" >/dev/null 2>&1 ; then
>>> +printf "yes\n"
>>> +else
>>> +printf "no\n"
>>> +CFLAGS_AUTO="$CFLAGS_AUTO -DBROKEN_LOONGARCH_FCSR_ASM"
>>> +fi
>>> +fi
>>> +
>>> if test "$ARCH" = "m68k" ; then
>>> if trycppif "__HAVE_68881__" ; then : ;
>>> elif trycppif "__mcffpu__" ; then SUBARCH="-fp64"
>>> diff --git a/include/elf.h b/include/elf.h
>>> index 23f2c4bc..7114f262 100644
>>> --- a/include/elf.h
>>> +++ b/include/elf.h
>>> @@ -315,7 +315,8 @@ typedef struct {
>>> #define EM_RISCV 243
>>> #define EM_BPF 247
>>> #define EM_CSKY 252
>>> -#define EM_NUM 253
>>> +#define EM_LOONGARCH 258
>>> +#define EM_NUM 259
>>>
>>> #define EM_ALPHA 0x9026
>>>
>>> @@ -699,6 +700,11 @@ typedef struct {
>>> #define NT_MIPS_FP_MODE 0x801
>>> #define NT_MIPS_MSA 0x802
>>> #define NT_VERSION 1
>>> +#define NT_LOONGARCH_CPUCFG 0xa00
>>> +#define NT_LOONGARCH_CSR 0xa01
>>> +#define NT_LOONGARCH_LSX 0xa02
>>> +#define NT_LOONGARCH_LASX 0xa03
>>> +#define NT_LOONGARCH_LBT 0xa04
>>>
>>>
>>>
>>> @@ -3293,6 +3299,102 @@ enum
>>> #define R_RISCV_SET32 56
>>> #define R_RISCV_32_PCREL 57
>>>
>>> +#define EF_LARCH_ABI_MODIFIER_MASK 0x07
>>> +#define EF_LARCH_ABI_SOFT_FLOAT 0x01
>>> +#define EF_LARCH_ABI_SINGLE_FLOAT 0x02
>>> +#define EF_LARCH_ABI_DOUBLE_FLOAT 0x03
>>> +#define EF_LARCH_OBJABI_V1 0x40
>>> +
>>> +#define R_LARCH_NONE 0
>>> +#define R_LARCH_32 1
>>> +#define R_LARCH_64 2
>>> +#define R_LARCH_RELATIVE 3
>>> +#define R_LARCH_COPY 4
>>> +#define R_LARCH_JUMP_SLOT 5
>>> +#define R_LARCH_TLS_DTPMOD32 6
>>> +#define R_LARCH_TLS_DTPMOD64 7
>>> +#define R_LARCH_TLS_DTPREL32 8
>>> +#define R_LARCH_TLS_DTPREL64 9
>>> +#define R_LARCH_TLS_TPREL32 10
>>> +#define R_LARCH_TLS_TPREL64 11
>>> +#define R_LARCH_IRELATIVE 12
>>> +#define R_LARCH_MARK_LA 20
>>> +#define R_LARCH_MARK_PCREL 21
>>> +#define R_LARCH_SOP_PUSH_PCREL 22
>>> +#define R_LARCH_SOP_PUSH_ABSOLUTE 23
>>> +#define R_LARCH_SOP_PUSH_DUP 24
>>> +#define R_LARCH_SOP_PUSH_GPREL 25
>>> +#define R_LARCH_SOP_PUSH_TLS_TPREL 26
>>> +#define R_LARCH_SOP_PUSH_TLS_GOT 27
>>> +#define R_LARCH_SOP_PUSH_TLS_GD 28
>>> +#define R_LARCH_SOP_PUSH_PLT_PCREL 29
>>> +#define R_LARCH_SOP_ASSERT 30
>>> +#define R_LARCH_SOP_NOT 31
>>> +#define R_LARCH_SOP_SUB 32
>>> +#define R_LARCH_SOP_SL 33
>>> +#define R_LARCH_SOP_SR 34
>>> +#define R_LARCH_SOP_ADD 35
>>> +#define R_LARCH_SOP_AND 36
>>> +#define R_LARCH_SOP_IF_ELSE 37
>>> +#define R_LARCH_SOP_POP_32_S_10_5 38
>>> +#define R_LARCH_SOP_POP_32_U_10_12 39
>>> +#define R_LARCH_SOP_POP_32_S_10_12 40
>>> +#define R_LARCH_SOP_POP_32_S_10_16 41
>>> +#define R_LARCH_SOP_POP_32_S_10_16_S2 42
>>> +#define R_LARCH_SOP_POP_32_S_5_20 43
>>> +#define R_LARCH_SOP_POP_32_S_0_5_10_16_S2 44
>>> +#define R_LARCH_SOP_POP_32_S_0_10_10_16_S2 45
>>> +#define R_LARCH_SOP_POP_32_U 46
>>> +#define R_LARCH_ADD8 47
>>> +#define R_LARCH_ADD16 48
>>> +#define R_LARCH_ADD24 49
>>> +#define R_LARCH_ADD32 50
>>> +#define R_LARCH_ADD64 51
>>> +#define R_LARCH_SUB8 52
>>> +#define R_LARCH_SUB16 53
>>> +#define R_LARCH_SUB24 54
>>> +#define R_LARCH_SUB32 55
>>> +#define R_LARCH_SUB64 56
>>> +#define R_LARCH_GNU_VTINHERIT 57
>>> +#define R_LARCH_GNU_VTENTRY 58
>>> +#define R_LARCH_B16 64
>>> +#define R_LARCH_B21 65
>>> +#define R_LARCH_B26 66
>>> +#define R_LARCH_ABS_HI20 67
>>> +#define R_LARCH_ABS_LO12 68
>>> +#define R_LARCH_ABS64_LO20 69
>>> +#define R_LARCH_ABS64_HI12 70
>>> +#define R_LARCH_PCALA_HI20 71
>>> +#define R_LARCH_PCALA_LO12 72
>>> +#define R_LARCH_PCALA64_LO20 73
>>> +#define R_LARCH_PCALA64_HI12 74
>>> +#define R_LARCH_GOT_PC_HI20 75
>>> +#define R_LARCH_GOT_PC_LO12 76
>>> +#define R_LARCH_GOT64_PC_LO20 77
>>> +#define R_LARCH_GOT64_PC_HI12 78
>>> +#define R_LARCH_GOT_HI20 79
>>> +#define R_LARCH_GOT_LO12 80
>>> +#define R_LARCH_GOT64_LO20 81
>>> +#define R_LARCH_GOT64_HI12 82
>>> +#define R_LARCH_TLS_LE_HI20 83
>>> +#define R_LARCH_TLS_LE_LO12 84
>>> +#define R_LARCH_TLS_LE64_LO20 85
>>> +#define R_LARCH_TLS_LE64_HI12 86
>>> +#define R_LARCH_TLS_IE_PC_HI20 87
>>> +#define R_LARCH_TLS_IE_PC_LO12 88
>>> +#define R_LARCH_TLS_IE64_PC_LO20 89
>>> +#define R_LARCH_TLS_IE64_PC_HI12 90
>>> +#define R_LARCH_TLS_IE_HI20 91
>>> +#define R_LARCH_TLS_IE_LO12 92
>>> +#define R_LARCH_TLS_IE64_LO20 93
>>> +#define R_LARCH_TLS_IE64_HI12 94
>>> +#define R_LARCH_TLS_LD_PC_HI20 95
>>> +#define R_LARCH_TLS_LD_HI20 96
>>> +#define R_LARCH_TLS_GD_PC_HI20 97
>>> +#define R_LARCH_TLS_GD_HI20 98
>>> +#define R_LARCH_32_PCREL 99
>>> +#define R_LARCH_RELAX 100
>>> +
>>> #ifdef __cplusplus
>>> }
>>> #endif
>>> diff --git a/src/fenv/loongarch64/fenv.S b/src/fenv/loongarch64/fenv.S
>>> new file mode 100644
>>> index 00000000..54064e01
>>> --- /dev/null
>>> +++ b/src/fenv/loongarch64/fenv.S
>>> @@ -0,0 +1,78 @@
>>> +#ifndef __loongarch_soft_float
>>> +
>>> +#ifdef BROKEN_LOONGARCH_FCSR_ASM
>>> +#define FCSR $r0
>>> +#else
>>> +#define FCSR $fcsr0
>>> +#endif
>>> +
>>> +.global feclearexcept
>>> +.type feclearexcept,@function
>>> +feclearexcept:
>>> + li.w $t0, 0x1f0000
>>> + and $a0, $a0, $t0
>>> + movfcsr2gr $t1, FCSR
>>> + andn $t1, $t1, $a0
>>> + movgr2fcsr FCSR, $t1
>>> + li.w $a0, 0
>>> + jr $ra
>>> +
>>> +.global feraiseexcept
>>> +.type feraiseexcept,@function
>>> +feraiseexcept:
>>> + li.w $t0, 0x1f0000
>>> + and $a0, $a0, $t0
>>> + movfcsr2gr $t1, FCSR
>>> + or $t1, $t1, $a0
>>> + movgr2fcsr FCSR, $t1
>>> + li.w $a0, 0
>>> + jr $ra
>>> +
>>> +.global fetestexcept
>>> +.type fetestexcept,@function
>>> +fetestexcept:
>>> + li.w $t0, 0x1f0000
>>> + and $a0, $a0, $t0
>>> + movfcsr2gr $t1, FCSR
>>> + and $a0, $t1, $a0
>>> + jr $ra
>>> +
>>> +.global fegetround
>>> +.type fegetround,@function
>>> +fegetround:
>>> + movfcsr2gr $t0, FCSR
>>> + andi $a0, $t0, 0x300
>>> + jr $ra
>>> +
>>> +.global __fesetround
>>> +.hidden __fesetround
>>> +.type __fesetround,@function
>>> +__fesetround:
>>> + li.w $t0, 0x300
>>> + and $a0, $a0, $t0
>>> + movfcsr2gr $t1, FCSR
>>> + andn $t1, $t1, $t0
>>> + or $t1, $t1, $a0
>>> + movgr2fcsr FCSR, $t1
>>> + li.w $a0, 0
>>> + jr $ra
>>> +
>>> +.global fegetenv
>>> +.type fegetenv,@function
>>> +fegetenv:
>>> + movfcsr2gr $t0, FCSR
>>> + st.w $t0, $a0, 0
>>> + li.w $a0, 0
>>> + jr $ra
>>> +
>>> +.global fesetenv
>>> +.type fesetenv,@function
>>> +fesetenv:
>>> + addi.d $t0, $a0, 1
>>> + beq $t0, $r0, 1f
>>> + ld.w $t0, $a0, 0
>>> +1: movgr2fcsr FCSR, $t0
>>> + li.w $a0, 0
>>> + jr $ra
>>> +
>>> +#endif
>>> diff --git a/src/ldso/loongarch64/dlsym.s b/src/ldso/loongarch64/dlsym.s
>>> new file mode 100644
>>> index 00000000..26fabcdb
>>> --- /dev/null
>>> +++ b/src/ldso/loongarch64/dlsym.s
>>> @@ -0,0 +1,7 @@
>>> +.global dlsym
>>> +.hidden __dlsym
>>> +.type dlsym,@function
>>> +dlsym:
>>> + move $a2, $ra
>>> + la.global $t0, __dlsym
>>> + jr $t0
>>> diff --git a/src/setjmp/loongarch64/longjmp.S b/src/setjmp/loongarch64/longjmp.S
>>> new file mode 100644
>>> index 00000000..896d2e26
>>> --- /dev/null
>>> +++ b/src/setjmp/loongarch64/longjmp.S
>>> @@ -0,0 +1,32 @@
>>> +.global _longjmp
>>> +.global longjmp
>>> +.type _longjmp,@function
>>> +.type longjmp,@function
>>> +_longjmp:
>>> +longjmp:
>>> + ld.d $ra, $a0, 0
>>> + ld.d $sp, $a0, 8
>>> + ld.d $r21,$a0, 16
>>> + ld.d $fp, $a0, 24
>>> + ld.d $s0, $a0, 32
>>> + ld.d $s1, $a0, 40
>>> + ld.d $s2, $a0, 48
>>> + ld.d $s3, $a0, 56
>>> + ld.d $s4, $a0, 64
>>> + ld.d $s5, $a0, 72
>>> + ld.d $s6, $a0, 80
>>> + ld.d $s7, $a0, 88
>>> + ld.d $s8, $a0, 96
>>> +#ifndef __loongarch_soft_float
>>> + fld.d $fs0, $a0, 104
>>> + fld.d $fs1, $a0, 112
>>> + fld.d $fs2, $a0, 120
>>> + fld.d $fs3, $a0, 128
>>> + fld.d $fs4, $a0, 136
>>> + fld.d $fs5, $a0, 144
>>> + fld.d $fs6, $a0, 152
>>> + fld.d $fs7, $a0, 160
>>> +#endif
>>> + sltui $a0, $a1, 1
>>> + add.d $a0, $a0, $a1
>>> + jr $ra
>>> diff --git a/src/setjmp/loongarch64/setjmp.S b/src/setjmp/loongarch64/setjmp.S
>>> new file mode 100644
>>> index 00000000..d158a3d2
>>> --- /dev/null
>>> +++ b/src/setjmp/loongarch64/setjmp.S
>>> @@ -0,0 +1,34 @@
>>> +.global __setjmp
>>> +.global _setjmp
>>> +.global setjmp
>>> +.type __setjmp,@function
>>> +.type _setjmp,@function
>>> +.type setjmp,@function
>>> +__setjmp:
>>> +_setjmp:
>>> +setjmp:
>>> + st.d $ra, $a0, 0
>>> + st.d $sp, $a0, 8
>>> + st.d $r21,$a0, 16
>>> + st.d $fp, $a0, 24
>>> + st.d $s0, $a0, 32
>>> + st.d $s1, $a0, 40
>>> + st.d $s2, $a0, 48
>>> + st.d $s3, $a0, 56
>>> + st.d $s4, $a0, 64
>>> + st.d $s5, $a0, 72
>>> + st.d $s6, $a0, 80
>>> + st.d $s7, $a0, 88
>>> + st.d $s8, $a0, 96
>>> +#ifndef __loongarch_soft_float
>>> + fst.d $fs0, $a0, 104
>>> + fst.d $fs1, $a0, 112
>>> + fst.d $fs2, $a0, 120
>>> + fst.d $fs3, $a0, 128
>>> + fst.d $fs4, $a0, 136
>>> + fst.d $fs5, $a0, 144
>>> + fst.d $fs6, $a0, 152
>>> + fst.d $fs7, $a0, 160
>>> +#endif
>>> + move $a0, $zero
>>> + jr $ra
>>> diff --git a/src/signal/loongarch64/restore.s b/src/signal/loongarch64/restore.s
>>> new file mode 100644
>>> index 00000000..f8e6daeb
>>> --- /dev/null
>>> +++ b/src/signal/loongarch64/restore.s
>>> @@ -0,0 +1,10 @@
>>> +.global __restore_rt
>>> +.global __restore
>>> +.hidden __restore_rt
>>> +.hidden __restore
>>> +.type __restore_rt,@function
>>> +.type __restore,@function
>>> +__restore_rt:
>>> +__restore:
>>> + li.w $a7, 139
>>> + syscall 0
>>> diff --git a/src/signal/loongarch64/sigsetjmp.s b/src/signal/loongarch64/sigsetjmp.s
>>> new file mode 100644
>>> index 00000000..992ab1a4
>>> --- /dev/null
>>> +++ b/src/signal/loongarch64/sigsetjmp.s
>>> @@ -0,0 +1,25 @@
>>> +.global sigsetjmp
>>> +.global __sigsetjmp
>>> +.type sigsetjmp,@function
>>> +.type __sigsetjmp,@function
>>> +sigsetjmp:
>>> +__sigsetjmp:
>>> + beq $a1, $zero, 1f
>>> + st.d $ra, $a0, 168
>>> + st.d $s0, $a0, 176
>>> + move $s0, $a0
>>> +
>>> + la.global $t0, setjmp
>>> + jirl $ra, $t0, 0
>>> +
>>> + move $a1, $a0 # Return from 'setjmp' or 'longjmp'
>>> + move $a0, $s0
>>> + ld.d $ra, $a0, 168
>>> + ld.d $s0, $a0, 176
>>> +
>>> +.hidden __sigsetjmp_tail
>>> + la.global $t0, __sigsetjmp_tail
>>> + jr $t0
>>> +1:
>>> + la.global $t0, setjmp
>>> + jr $t0
>>> diff --git a/src/thread/loongarch64/__set_thread_area.s b/src/thread/loongarch64/__set_thread_area.s
>>> new file mode 100644
>>> index 00000000..ffdd52f1
>>> --- /dev/null
>>> +++ b/src/thread/loongarch64/__set_thread_area.s
>>> @@ -0,0 +1,7 @@
>>> +.global __set_thread_area
>>> +.hidden __set_thread_area
>>> +.type __set_thread_area,@function
>>> +__set_thread_area:
>>> + move $tp, $a0
>>> + move $a0, $zero
>>> + jr $ra
>>> diff --git a/src/thread/loongarch64/__unmapself.s b/src/thread/loongarch64/__unmapself.s
>>> new file mode 100644
>>> index 00000000..1de334af
>>> --- /dev/null
>>> +++ b/src/thread/loongarch64/__unmapself.s
>>> @@ -0,0 +1,7 @@
>>> +.global __unmapself
>>> +.type __unmapself, @function
>>> +__unmapself:
>>> + li.d $a7, 215 # call munmap
>>> + syscall 0
>>> + li.d $a7, 93 # call exit
>>> + syscall 0
>>> diff --git a/src/thread/loongarch64/clone.s b/src/thread/loongarch64/clone.s
>>> new file mode 100644
>>> index 00000000..db9015e6
>>> --- /dev/null
>>> +++ b/src/thread/loongarch64/clone.s
>>> @@ -0,0 +1,28 @@
>>> +#__clone(func, stack, flags, arg, ptid, tls, ctid)
>>> +# a0, a1, a2, a3, a4, a5, a6
>>> +# sys_clone(flags, stack, ptid, ctid, tls)
>>> +# a0, a1, a2, a3, a4
>>> +
>>> +.global __clone
>>> +.hidden __clone
>>> +.type __clone,@function
>>> +__clone:
>>> + # Save function pointer and argument pointer on new thread stack
>>> + addi.d $a1, $a1, -16
>>> + st.d $a0, $a1, 0 # save function pointer
>>> + st.d $a3, $a1, 8 # save argument pointer
>>> + or $a0, $a2, $zero
>>> + or $a2, $a4, $zero
>>> + or $a3, $a6, $zero
>>> + or $a4, $a5, $zero
>>> + ori $a7, $zero, 220
>>> + syscall 0 # call clone
>>> +
>>> + beqz $a0, 1f # whether child process
>>> + jirl $zero, $ra, 0 # parent process return
>>> +1:
>>> + ld.d $t8, $sp, 0 # function pointer
>>> + ld.d $a0, $sp, 8 # argument pointer
>>> + jirl $ra, $t8, 0 # call the user's function
>>> + ori $a7, $zero, 93
>>> + syscall 0 # child process exit
>>> diff --git a/src/thread/loongarch64/syscall_cp.s b/src/thread/loongarch64/syscall_cp.s
>>> new file mode 100644
>>> index 00000000..0fbc7a47
>>> --- /dev/null
>>> +++ b/src/thread/loongarch64/syscall_cp.s
>>> @@ -0,0 +1,29 @@
>>> +.global __cp_begin
>>> +.hidden __cp_begin
>>> +.global __cp_end
>>> +.hidden __cp_end
>>> +.global __cp_cancel
>>> +.hidden __cp_cancel
>>> +.hidden __cancel
>>> +.global __syscall_cp_asm
>>> +.hidden __syscall_cp_asm
>>> +.type __syscall_cp_asm,@function
>>> +
>>> +__syscall_cp_asm:
>>> +__cp_begin:
>>> + ld.w $a0, $a0, 0
>>> + bnez $a0, __cp_cancel
>>> + move $t8, $a1 # reserve system call number
>>> + move $a0, $a2
>>> + move $a1, $a3
>>> + move $a2, $a4
>>> + move $a3, $a5
>>> + move $a4, $a6
>>> + move $a5, $a7
>>> + move $a7, $t8
>>> + syscall 0
>>> +__cp_end:
>>> + jr $ra
>>> +__cp_cancel:
>>> + la.local $t8, __cancel
>>> + jr $t8
>>> --
>>> 2.37.1
>>>
prev parent reply other threads:[~2023-11-17 7:19 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-04-18 1:28 [musl] add loongarch64 port v7 王洪亮
2023-04-18 9:38 ` Szabolcs Nagy
2023-04-18 10:47 ` Szabolcs Nagy
2023-04-18 11:32 ` 王洪亮
2023-05-10 3:36 ` 王洪亮
2023-06-01 12:44 ` wanghongliang
2023-06-25 3:43 ` Hongliang Wang
2023-07-19 6:55 ` Hongliang Wang
2023-08-05 6:18 ` 翟小娟
2023-08-05 15:43 ` Rich Felker
2023-08-13 1:41 ` Rich Felker
2023-08-15 9:24 ` Hongliang Wang
2023-08-15 9:32 ` alice
2023-08-25 9:27 ` 翟小娟
[not found] ` <99d954ca-faee-2cac-97af-7fc2ecdb9a89@loongson.cn>
2023-09-20 7:45 ` Jianmin Lv
2023-09-20 13:16 ` Szabolcs Nagy
2023-09-22 1:37 ` Hongliang Wang
2023-09-26 3:28 ` [musl] add loongarch64 port v8 Hongliang Wang
2023-10-08 3:05 ` 花静云
2023-11-09 13:15 ` Jingyun Hua
2023-11-14 13:16 ` Jingyun Hua
2023-11-16 2:54 ` [musl] add loongarch64 port v9 Hongliang Wang
2023-11-16 16:10 ` Rich Felker
2023-11-17 7:20 ` Hongliang Wang
2023-11-17 17:25 ` Rich Felker
2023-11-18 4:19 ` Jingyun Hua
2023-11-20 6:11 ` Hongliang Wang
2023-12-08 8:23 ` [musl] Re:[musl] add loongarch64 port v8 Hongliang Wang
2023-12-18 7:44 ` Jingyun Hua
2024-01-29 1:26 ` [musl] " Hongliang Wang
2024-01-29 3:14 ` Rich Felker
2024-01-29 7:37 ` Hongliang Wang
2023-11-16 16:44 ` [musl] add loongarch64 port v9 Szabolcs Nagy
2023-11-16 17:18 ` Szabolcs Nagy
2023-11-17 7:19 ` Hongliang Wang [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8cf01ee8-4c45-340c-8a17-4df1154de322@loongson.cn \
--to=wanghongliang@loongson.cn \
--cc=musl@lists.openwall.com \
--cc=nsz@port70.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this public inbox
https://git.vuxu.org/mirror/musl/
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).