Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

Oak Ridge lab: Behold, I Am TITAN, hear my 20 petaflop ROAR
The Register ^ | 29th October 2012 15:03 GMT | Timothy Prickett Morgan

Posted on 10/30/2012 1:45:12 PM PDT by Ernest_at_the_Beach

<

Oak Ridge National Laboratories is really eager to talk about the "Titan" supercomputer, which weighs in at more than 20 petaflops of performance and which has just been activated this month. In fact it's too eager to wait until the SC12 supercomputing trade show in November to give out all the feeds and speeds. Cray and Nvidia, which want to sell baby clones of Titan as fast as they can, are similarly impatient. And so, with Titan actually up and running, they are unveiling Titan super now rather than later.

The top techies from Cray, Nvidia, and Oak Ridge were in the Big Apple ahead of today's launch and discussed the new machine over breakfast with El Reg. And you can understand the eagerness of Oak Ridge to talk about the big, bad box, which the US Department of Energy hopes will facilitate significant breakthroughs in research in the physics, combustion, materials science, nuclear energy, and combustion.

Titan is not a brand new machine, but is rather an upgrade of the existing "Jaguar" supercomputer at Oak Ridge, which is a tricky bit of work to performance as researchers are continuing to run jobs on the system.

Jaguar came onto the scene in the fall of 2009 and was based on Cray's XT5 systems using the "SeaStar+" interconnect across its nodes. Jaguar was the top-ranked machine on the November 2009 and June 2010 editions of the Top 500 supercomputer rankings, with 2.33 petaflops of peak theoretical performance. After an upgrade earlier this year - moving to the "Gemini" XE interconnect and the sixteen-core "Interlagos" Opteron 6274 processors in half the sockets in the nodes and 960 of Nvidia's Tesla M2090 GPU coprocessors - the Jaguar system was upgraded to 298,592 x86 cores and a total 2.63 petaflops across those CPU and GPU processors.

The Jaguar and Titan machines both have 18,688 compute nodes, but with the Titan machine, each sixteen-core Opteron compute node is getting twice as much main memory (32GB per compute node) and is also being paired up with a Tesla K20 "Kepler" GPU coprocessor (which has 6GB of its own GDDR5 memory). The Titan machine has a total of 299,008 Opteron cores, and total system memory is boosted to 710TB across this system (up from 300TB in Jaguar), and the number of login and I/O nodes has also been doubled to 512 nodes.

With all of those Tesla K20 coprocessors, the total number-crunching capacity of the machine is expected to be more than 20 petaflops, says Jeffrey Nichols, associate laboratory director at Oak Ridge, which means that the K20 GPU coprocessors are delivering at least 17.4 petaflops of that raw computing performance since the Opteron side of the nodes is delivering around 2.63 petaflops.

Oak Ridge National Laboratory in Tennessee

Nichols says that Oak Ridge started working on the design of the Titan machine back in the spring 2009, and that is one of the reasons why the lab is so eager get its hands on Titan and put it through its paces. The machine is being put through its reliability and stability acceptance tests now, which is why formal performance specs for the box have not been released yet. (You can bet there will be a Linpack result for the machine to be included in the November 2012 Top 500 list, which will come out during the SC12 event.)

The computing side of the Jaguar machine burned 6.95 megawatts to deliver that 2.33 petaflops of peak performance, which came in at 1.76 petaflops on the Linpack parallel Fortran benchmark test. Just based on peak performance, that works out to 335.4 megaflops per watt. To ramp the Jaguar machine up to more than 20 petaflops just using Opteron processors and the Gemini interconnect, if you could do it, would consume an estimated 30 megawatts and take more than 800 cabinets. This is obviously not feasible, even for Oak Ridge, which buys its juice from the local Tennessee Valley Authority.

The Titan supercomputer at Oak Ridge

The Titan supercomputer at Oak Ridge

Nvidia has not given out precise performance specs for the K20 GPU coprocessor, except to say it will have more than 1 teraflops of double-precision oomph. So it is hard to say what the power efficiency of the Titan supercomputer would be. All Nichols would say is that the machine would deliver "well north" of 20 petaflops. What Nvidia did say in its announcement for Titan (but not during the meeting with El Reg) is that the new super is more than 10 times faster and is five times more energy efficient than Jaguar. To get to that 10X raw performance, the K20 GPU has to deliver around 1.1 teraflops of double precision performance.

With the Titan machine burning 9 megawatts of juice, according to Oak Ridge, this machine will offer a considerably improved 2,576 megaflops per watt if it reaches 23.1 petaflops of performance. Our math shows higher performance per watt than Nvidia's press release. Go figure.

The important thing as far as Cray, Nvidia, and Oak Ridge are concerned is that this performance is on par with the power efficiency of the BlueGene/Q PowerPC-based massively parallel supercomputer from IBM, which has been the most efficient super available up until now. However, until Linpack is run and we see how the architectural improvements in the K20 GPU coprocessors, such as dynamic parallelism and Hyper-Q, pan out we won't be able to see which type of machine, BlueGene or x86-Tesla hybrid, has the edge in power efficiency on Linpack or other workloads.

Up until now, GPU-accelerated machines have been able to deliver cheaper flops and very good bang for the buck, too, but a lot of the raw computing capabilities of the GPU were not used by applications and the power efficiency of ceepie-geepie hybrids was not spectacular.

The storage side of the Titan machine will be upgraded as well, says Nichols, with the current setup having 10PB of storage and about 280GB/sec of aggregate I/O bandwidth coming out of the storage into the supercomputer. The RFP for the future storage system to match Titan has just been sent out, according to Nichols, with Oak Ridge expecting to double up or triple up the storage capacity and pushing up to 1 TB/sec of aggregate bandwidth into the system.

Nichols says that the role of flash memory in the future file system, if any, has not yet been determined, but says that obviously by putting high speed flash right into compute nodes, you could dramatically speed up checkpointing operations on clusters because of the higher I/O rates flash have over disk storage. With checkpointing, you are saving a state of a node and its work at a given recovery point. You don't need to store the raw data there, which is on the parallel disk array.

Next page: The push to exascale and out to zettascale


TOPICS: Computers/Internet; Science
KEYWORDS: hitech

1 posted on 10/30/2012 1:45:14 PM PDT by Ernest_at_the_Beach
[ Post Reply | Private Reply | View Replies]

To: ShadowAce; SunkenCiv

ping


2 posted on 10/30/2012 1:46:05 PM PDT by Ernest_at_the_Beach ((The Global Warming Hoax was a Criminal Act....where is Al Gore?))
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach

3 posted on 10/30/2012 1:48:31 PM PDT by abb ("What ISN'T in the news is often more important than what IS." Ed Biersmith, 1942 -)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach

Want.

Can’t afford....


4 posted on 10/30/2012 1:48:41 PM PDT by Vendome (Don't take life so seriously, you won't live through it anyway)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach

Soon to be available for every 7 year old in PlayStation 7 by XMAS 2017!


5 posted on 10/30/2012 1:50:50 PM PDT by Cvengr (Adversity in life and death is inevitable. Thru faith in Christ, stress is optional.)
[ Post Reply | Private Reply | To 2 | View Replies]

To: Ernest_at_the_Beach
And you can understand the eagerness of Oak Ridge to talk about the big, bad box, which the US Department of Energy hopes will facilitate significant breakthroughs in research in the physics, combustion, materials science, nuclear energy, and combustion.

I'm sorry, Dave. I'm afraid I can't do that.

6 posted on 10/30/2012 1:51:37 PM PDT by Moonman62 (The US has become a government with a country, rather than a country with a government.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Moonman62
And you can understand the eagerness of Oak Ridge to talk about the big, bad box, which the US Department of Energy hopes will facilitate significant breakthroughs in research in the physics, combustion, materials science, nuclear energy, and combustion.

I see the folks at Oak Ridge are already starting on their redundancy protocol... That, or the writer REALLY likes combustion.

7 posted on 10/30/2012 3:02:44 PM PDT by kingu (Everything starts with slashing the size and scope of the federal government.)
[ Post Reply | Private Reply | To 6 | View Replies]

To: Vendome
Want. Can’t afford....

I'm happy with what I have, whatever it is. Check this ( with cygwin ):

$ time factor 431238384397394731494949493865668771 431238384397394731494949493865668771: 22175407456858039 19446694958654684789

real 5m47.188s
user 5m47.071s
sys 0m0.046s

Amazing! Incredible! Beyond MY wildest imagination, I can assure you.

8 posted on 10/30/2012 3:30:17 PM PDT by dr_lew
[ Post Reply | Private Reply | To 4 | View Replies]

To: dr_lew

Sigh. How about:

$ time factor 431238384397394731494949493865668771
431238384397394731494949493865668771: 22175407456858039 19446694958654684789

real 5m47.188s
user 5m47.071s
sys 0m0.046s

( The preview took a “natural” line break after the argument of “factor”. )


9 posted on 10/30/2012 3:42:17 PM PDT by dr_lew
[ Post Reply | Private Reply | To 8 | View Replies]

To: Ernest_at_the_Beach

Now *that’s* a computer. ;’)


10 posted on 10/30/2012 4:17:44 PM PDT by SunkenCiv (https://secure.freerepublic.com/donate/)
[ Post Reply | Private Reply | To 2 | View Replies]

To: SunkenCiv
the sixteen-core "Interlagos" Opteron 6274 processors

Going to check if that is what Microcenter & Newegg have.

Here we go:

AMD Opteron 6274 Interlagos 2.2GHz 16MB L3 Cache Socket G34 115W 16-Core Server Processor OS6274WKTGGGUWOF

$599.99

11 posted on 10/30/2012 5:44:22 PM PDT by Ernest_at_the_Beach ((The Global Warming Hoax was a Criminal Act....where is Al Gore?))
[ Post Reply | Private Reply | To 10 | View Replies]

To: dr_lew

What are we calculating ?


12 posted on 10/30/2012 5:48:42 PM PDT by Ernest_at_the_Beach ((The Global Warming Hoax was a Criminal Act....where is Al Gore?))
[ Post Reply | Private Reply | To 9 | View Replies]

To: SunkenCiv
Microcenter has the Interlagos :

Opteron 6272 2.1GHz Socket G34 Processor OS6272WKTGGGUWOF

$519.99

13 posted on 10/30/2012 6:00:35 PM PDT by Ernest_at_the_Beach ((The Global Warming Hoax was a Criminal Act....where is Al Gore?))
[ Post Reply | Private Reply | To 10 | View Replies]

To: SunkenCiv
And we have:

Socket G34 Motherboards at your Orange County/Tustin Micro Center

********************************************


14 posted on 10/30/2012 6:06:01 PM PDT by Ernest_at_the_Beach ((The Global Warming Hoax was a Criminal Act....where is Al Gore?))
[ Post Reply | Private Reply | To 13 | View Replies]

To: Cvengr

>>Soon to be available for every 7 year old in PlayStation 7 by XMAS 2017!

But you won’t be able to play PS6 games on it either.

or PS5, 4, 3, 2 etc.


15 posted on 10/30/2012 8:25:32 PM PDT by Only1choice____Freedom (FDR had the New Deal. President 0bama has the Raw Deal.)
[ Post Reply | Private Reply | To 5 | View Replies]

To: Ernest_at_the_Beach

“factor x” gives the prime factors of an integer, “x”.
For example:

$ factor 15
15: 3 5

$ factor 221
221: 13 17

$ factor 8192
8192: 2 2 2 2 2 2 2 2 2 2 2 2 2

... Of course, these answers, and much larger ones, are given instantly. “factor” is an original UNIX command, but older implementations would respond “Ouch!” when given values of x with greater than 15 or so digits, so ...

factor 431238384397394731494949493865668771

... is a brave new world for me to have at my fingertips, even compared to a few years ago, let only the 1960’s and 70’s !


16 posted on 10/30/2012 8:40:25 PM PDT by dr_lew
[ Post Reply | Private Reply | To 12 | View Replies]

To: dr_lew

Of course, “let only the 1960’s and 70’s” s/b “let alone the 1960’s and 70’s”


17 posted on 10/30/2012 8:46:58 PM PDT by dr_lew
[ Post Reply | Private Reply | To 16 | View Replies]

To: Ernest_at_the_Beach

But can it sing, “Daisy” ?


18 posted on 10/30/2012 10:54:11 PM PDT by fieldmarshaldj (Resist We Much)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach
total system memory is boosted to 710TB

Yup, that Titan sucker is aware.

19 posted on 10/30/2012 11:09:08 PM PDT by The Cajun (Sarah Palin, Mark Levin......Nuff said.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach
Come on, Ernest. You're on my ping list. I know you got this ping.
20 posted on 10/31/2012 2:24:40 AM PDT by ShadowAce (Linux -- The Ultimate Windows Service Pack)
[ Post Reply | Private Reply | To 1 | View Replies]

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson