From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: tuhs-bounces@minnie.tuhs.org
X-Spam-Checker-Version: SpamAssassin 3.4.1 (2015-04-28) on inbox.vuxu.org
X-Spam-Level: 
X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS,
	MAILING_LIST_MULTI,RCVD_IN_DNSWL_NONE autolearn=ham autolearn_force=no
	version=3.4.1
Received: from minnie.tuhs.org (minnie.tuhs.org [45.79.103.53])
	by inbox.vuxu.org (OpenSMTPD) with ESMTP id 67196c72
	for <ml@inbox.vuxu.org>;
	Fri, 29 Jun 2018 15:33:15 +0000 (UTC)
Received: by minnie.tuhs.org (Postfix, from userid 112)
	id 70675A188C; Sat, 30 Jun 2018 01:33:14 +1000 (AEST)
Received: from minnie.tuhs.org (localhost [127.0.0.1])
	by minnie.tuhs.org (Postfix) with ESMTP id 6AEE9A181B;
	Sat, 30 Jun 2018 01:33:04 +1000 (AEST)
Received: by minnie.tuhs.org (Postfix, from userid 112)
 id 16D7BA181B; Sat, 30 Jun 2018 01:33:03 +1000 (AEST)
Received: from mail1.g22.pair.com (mail1.g22.pair.com [66.39.65.155])
 by minnie.tuhs.org (Postfix) with ESMTPS id 787FBA1815
 for <tuhs@minnie.tuhs.org>; Sat, 30 Jun 2018 01:33:02 +1000 (AEST)
Received: from mail1.g22.pair.com (localhost [127.0.0.1])
 by mail1.g22.pair.com (Postfix) with ESMTP id BF62248474C;
 Fri, 29 Jun 2018 11:33:01 -0400 (EDT)
Received: from [172.16.0.147] (unknown [88.98.95.237])
 (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
 (No client certificate requested)
 by mail1.g22.pair.com (Postfix) with ESMTPSA id 3FFC74495CE;
 Fri, 29 Jun 2018 11:33:01 -0400 (EDT)
Content-Type: text/plain; charset=us-ascii
Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\))
From: tfb@tfeb.org
In-Reply-To: <20180628170955.GH21688@mcvoy.com>
Date: Fri, 29 Jun 2018 16:32:59 +0100
Content-Transfer-Encoding: quoted-printable
Message-Id: <8881414B-FF5C-4BD9-B518-AD22366DE4BC@tfeb.org>
References: <af780f9fb5c14e37f12ce5c2a4e40376669c730f@webmail.yaccman.com>
 <81277CC3-3C4A-49B8-8720-CFAD22BB28F8@bitblocks.com>
 <20180628141538.GB663@thunk.org> <20180628144017.GB21688@mcvoy.com>
 <20180628105538.65f82615@jabberwock.cb.piermont.com>
 <20180628145825.GE21688@mcvoy.com>
 <2B710879-7659-47A4-AA86-03F232F7B78B@tfeb.org>
 <20180628160202.GF21688@mcvoy.com>
 <79022674-0FFA-4B1B-8A27-4C403D51540E@tfeb.org>
 <20180628170955.GH21688@mcvoy.com>
To: Larry McVoy <lm@mcvoy.com>
X-Mailer: Apple Mail (2.3273)
Subject: Re: [TUHS] PDP-11 legacy, C, and modern architectures
X-BeenThere: tuhs@minnie.tuhs.org
X-Mailman-Version: 2.1.20
Precedence: list
List-Id: The Unix Heritage Society mailing list <tuhs.minnie.tuhs.org>
List-Unsubscribe: <https://minnie.tuhs.org/cgi-bin/mailman/options/tuhs>,
 <mailto:tuhs-request@minnie.tuhs.org?subject=unsubscribe>
List-Archive: <http://minnie.tuhs.org/pipermail/tuhs/>
List-Post: <mailto:tuhs@minnie.tuhs.org>
List-Help: <mailto:tuhs-request@minnie.tuhs.org?subject=help>
List-Subscribe: <https://minnie.tuhs.org/cgi-bin/mailman/listinfo/tuhs>,
 <mailto:tuhs-request@minnie.tuhs.org?subject=subscribe>
Cc: tuhs@minnie.tuhs.org
Errors-To: tuhs-bounces@minnie.tuhs.org
Sender: "TUHS" <tuhs-bounces@minnie.tuhs.org>

On 28 Jun 2018, at 18:09, Larry McVoy <lm@mcvoy.com> wrote:
>=20
> I'm not sure how people keep missing the original point.  Which was:
> the market won't choose a bunch of wimpy cpus when it can get faster
> ones.  It wasn't about the physics (which I'm not arguing with), it=20
> was about a choice between lots of wimpy cpus and a smaller number of
> fast cpus.  The market wants the latter, as Ted said, Sun bet heavily
> on the former and is no more.

[I said I wouldn't reply more: I'm weak.]

I think we have been talking at cross-purposes, which is probably my =
fault.  I think you've been using 'wimpy' to mean 'intentionally slower =
than they could be' while I have been using it to mean 'of very tiny =
computational power compared to the power of the whole system'.  Your =
usage is probably more correct in terms of the way the term has been =
used historically.

But I think my usage tells you something important: that the performance =
of individual cores will, inevitably, become increasingly tiny compared =
to the performance of the system they are in, and will almost certainly =
become asymptotically constant (ie however much money you spend on an =
individual core it will not be very much faster than the one you can buy =
off-the-shelf). So, if you want to keep seeing performance improvements =
(especially if you want to keep seeing exponential improvements for any =
significant time), then you have no choice but to start thinking about =
parallelism.

The place I work now is an example of this.  Our machines have the =
fastest cores we could get.  But we need nearly half a million of them =
to do the work we want to do (this is across three systems).

I certainly don't want to argue that choosing intentionally slower cores =
than you can get is a good idea in general (although there are cases =
where it may be, including, perhaps, some HPC workloads).

---

However let me add something about the Sun T-series machines, which were =
'wimpy cores' in the 'intentionally slower' sense.  When these started =
appearing I worked in a canonical Sun customer at the time: a big retail =
bank.  And the reason we did not buy lots of them was nothing to do with =
how fast they were (which was more than fast enough), it was because =
Sun's software was inadequate.

To see why, consider what retail banks IT looked like in the late 2000s. =
 We had a great mass of applications, the majority of which ran on =
individual Solaris instances (at least two, live and DR, per =
application).  A very high proportion (not all) of these applications =
had utterly negligible computational requirements.  But they had very =
strong requirements on availability, or at least the parts of the =
business which owned them said they did and we could not argue with =
that, especially given that this was 2008 and we knew that if we had a =
visible outage there was a fair chance that it would be misread as the =
bank failing, resulting in  a cascade failure of the banking system and =
the inevitable zombie apocalypse.  No one wanted that.

Some consolidation had already been done: we had a bunch of 25ks, many =
of which were split into lot of domains.  The smallest domain on a 25k =
was a single CPU board which was 4 sockets and therefore 8 or 16 (I =
forget how many cores there were per socket) cores.  I think you could =
not partition a 25k like that completely because you ran out of IO =
assemblies, so some domains had to be bigger.

This smallest domain was huge overkill for many of these applications, =
and 25ks were terribly expensive as well.

So, along came the first T-series boxes and they were just obviously =
ideal: we could consolidate lots and lots of these things onto a single =
T-series box, with a DR partner, and it would cost some tiny fraction of =
what a 25k cost, and use almost no power (DC power was and is a real =
problem).

But we didn't do that: we did some experiments, and some things moved I =
think, but on the whole we didn't move.  The reason we didn't move was =
nothing, at all, to do with performance, it was, as I said, software, =
and in particular virtualisation. Sun had two approaches to this, =
neither of which solved the problems that everyone had.

At the firmware level there were LDOMs (which I think did not work very =
well early on or may not have existed) which let you cut up a machine =
into lots of smaller ones with a hypervisor in the usual way.  But all =
of these smaller machines shared the same hardware of course.  So if you =
had a serious problem on the machine, then all of your LDOMs went away, =
and all of the services on that machine had an outage, at once.  This =
was not the case on a 25k: if a CPU or an IO board died it would affect =
the domain it was part of, but everything else would carry on.

At the OS level there were zones (containers).  Zones had the advantage =
that they could look like Solaris 8 (the machine itself, and therefore =
the LDOMs it got split into, could only run Solaris 10), which all the =
old applications were running, and they could be very fine-grained.  But =
they weren't really very isolated from each other (especially in =
hindsight), they didn't look *enough* like Solaris 8 for people to be =
willing to certify the applications on them, and they still had the =
all-your-eggs-in-one-basket problem if the hardware died.

The thing that really killed it was the eggs-in-one-basket problem.  We =
had previous experience with consolidating a lot of applications onto =
one OS & hardware instance, and no-one wanted to go anywhere near that.  =
If you needed to get an outage (say to install a critical security =
patch, or because of failing hardware) you had to negotiate this with =
*all* the application teams, all of whom had different requirements and =
all of whom regarded their application as the most important thing the =
bank ran (some of them might be right).  It could very easily take more =
than a year to get an outage on the big shared-services machines, and =
when the outage happened you would have at least 50 people involved to =
stop and restat everything.  It was just a scarring nightmare.

So, to move to the T-series machines what we would have needed was a way =
of partitioning the machine in such a way that the partitions ran =
Solaris 8 natively, and in such a way that the partitions could be =
moved, live, to other systems to deal with the eggs-in-one-basket =
problem.  Sun didn't have that, anywhere near (they knew this I think, =
and they got closer later on, but it was too late).

So, the machines failed for us.  But this failure was nothing, at all, =
to do with performance, let alone performance per core, which was =
generally more than adequate.  Lots of wimpy, low power, CPUs was what =
we needed, in fact: we just needed the right software on top of them, =
which was not there.

(As an addentum: what eventually happened / is happening I think is that =
applications are getting recertified on Linux/x86 sitting on top of ESX, =
*which can move VMs live between hosts*, thus solving the problem.)=