From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RCVD_IN_DNSWL_NONE autolearn=ham autolearn_force=no version=3.4.2 Received: from minnie.tuhs.org (minnie.tuhs.org [45.79.103.53]) by inbox.vuxu.org (OpenSMTPD) with ESMTP id 91a35a8f for ; Mon, 21 Oct 2019 19:38:26 +0000 (UTC) Received: by minnie.tuhs.org (Postfix, from userid 112) id 5EF339B9E9; Tue, 22 Oct 2019 05:38:25 +1000 (AEST) Received: from minnie.tuhs.org (localhost [127.0.0.1]) by minnie.tuhs.org (Postfix) with ESMTP id 8F50B94772; Tue, 22 Oct 2019 05:38:11 +1000 (AEST) Received: by minnie.tuhs.org (Postfix, from userid 112) id 836B994772; Tue, 22 Oct 2019 05:38:09 +1000 (AEST) Received: from wopr.sciops.net (wopr.sciops.net [216.126.196.60]) by minnie.tuhs.org (Postfix) with ESMTPS id DDFA293D8C for ; Tue, 22 Oct 2019 05:38:08 +1000 (AEST) Received: (qmail 77115 invoked by uid 1001); 21 Oct 2019 12:38:04 -0700 Date: Mon, 21 Oct 2019 12:38:04 -0700 From: Kurt H Maier To: Paul Winalski Message-ID: <20191021193804.GA75174@wopr> References: <201910191440.x9JEe8PB035921@tahoe.cs.Dartmouth.EDU> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Subject: Re: [TUHS] How to do differential/integral on a PDP-7, was: Space Travel X-BeenThere: tuhs@minnie.tuhs.org X-Mailman-Version: 2.1.26 Precedence: list List-Id: The Unix Heritage Society mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: The Eunuchs Hysterical Society Errors-To: tuhs-bounces@minnie.tuhs.org Sender: "TUHS" On Mon, Oct 21, 2019 at 03:21:34PM -0400, Paul Winalski wrote: > > I'm much more alarmed by the lack of memory error detection and > correction on a lot of modern computers. This is one of my big > concerns with the use of GPUs for heavy-duty computation. GPUs > typically don't have memory with error detection because the worst > that happens if there's a memory error in the GPU is you get a bad > pixel or two displayed. I'd not like to cross a bridge whose design > software used CUDA. > This might be true of gaming cards and low-end workstation cards, but the higher-end Quadro cards and all the dedicated GPGPUs have had at least ECC since at least the Maxwell era. Of course, nobody does a single run and stamps the drawings, so the process itself should catch these problems, but any correctly-configured CAD workstation or compute cluster has error-correcting memory, both on the system and in the accelerators. AMD's Radeon Pro and Intel's Xeon Phi accelerators do ECC as well. khm