From mboxrd@z Thu Jan 1 00:00:00 1970 From: bogus@does.not.exist.com () Date: Mon, 4 May 2009 19:42:15 +0000 Subject: No subject Message-ID: Topicbox-Message-UUID: 0082827c-ead5-11e9-9d60-3106f5b1d025 have thousands. we're currently operating without auth (in part due to configuration issues), so I don't know how well it will scale. The other aspect here is that in current configurations, every "run" has a different machine configuration based on what you request from the job scheduler and what you actually get. We pretty much get different IP addresses every time, with different front ends, different file servers, etc. etc. Again though - the idea is to use file systems more pervasively within the applications as well -- so there may be multiple file servers per node providing different services depending on workload needs at the particular point of computation. Read our MTAGS paper from last year's supercomputing conference to get a bigger picture view on how we view services coming, going, migrating, and adapting to changing application usage and failure. -eric