From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-3.3 required=5.0 tests=MAILING_LIST_MULTI, RCVD_IN_DNSWL_MED,RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 17815 invoked from network); 21 Oct 2023 00:02:47 -0000 Received: from second.openwall.net (193.110.157.125) by inbox.vuxu.org with ESMTPUTF8; 21 Oct 2023 00:02:47 -0000 Received: (qmail 19940 invoked by uid 550); 21 Oct 2023 00:02:44 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-ID: Reply-To: musl@lists.openwall.com Received: (qmail 19900 invoked from network); 21 Oct 2023 00:02:43 -0000 Date: Sat, 21 Oct 2023 11:02:27 +1100 (AEDT) From: Damian McGuckin To: musl@lists.openwall.com Message-ID: <1de79c9a-b970-5285-3a90-a3d5a8df90e9@esi.com.au> MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset=US-ASCII Subject: [musl] Floating Point Operations Cycles/Latency for ARM + RISC-V + POWER10 What modern CPUs have a penalty for double precision floating point arithmetic on scalars compared to single precision once they are in a register, i.e. ignoring memory fetch issues. I have Agner Fog's excellent document for X86-64 which basically says that 32 bit and 64 bit operations for scalars take the same amount of time. I am looking for the same type of information for ARM and RISC-V. I found the data for 32-bit in the online documentation. But nothing bout 64 bit. I cannot find anything on this topic on RISC-V or POWER10. Maybe I am not searching on the right terms. Note that I am after the raw performance, not say the relative performance of say the MUSL sin() routine compared with the MUSL sinf(). Thanks - Damian