Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

Waiting to exascale: Now that IBM has Summit-ed, who's to node what comes next?
The Register ^ | 14 June 2018 | Chris Mellor

Posted on 06/14/2018 11:04:26 AM PDT by ShadowAce

IBM's 200 petaFLOPS (200,000 trillion calculations per second) Summit supercomputer was unveiled at Oak Ridge National Laboratory last Friday and, scaled up, has proven itself capable of exascale computing in some applications.

That's 1,000 petaFLOPS or one quintillion floating point operations per second.

In comparison, the Cray/Intel Aurora supercomputer project clocked in at 180 petaFLOPS with 50,000 x86 nodes, interconnected with 200Gbit/s OmniPath 2.

These nodes were supposed to be augmented with Intel's Knights Hill version of its multicore Phi co-processor. However, the Knights Hill development was canned in November 2017. Aurora has given way to Aurora 2, due for delivery in 2021, which should be an exascale system with redesigned Phi processors.

The US Department of Energy is part-funding the development of exascale computers through its Coral-2 programme (Coral being "Collaboration of Oak Ridge, Argonne and Livermore", three national labs). The original Coral programme generated the Aurora and Summit systems, the latter of which was kicked off by IBM in 2014. Cray and Intel were awarded $200m in April 2015 to build Aurora. Though Aurora was due to be delivered this year, the failed co-processor design meant that wasn't possible.

Six bidders – AMD, Cray, HPE, IBM, Intel and Nvidia – were invited by the DoE to respond to a Coral-2 request for proposals and some or all did so by May 24. The individual bidders have not been revealed and the bids are being evaluated.

There are three server/HPC system builders – Cray, HPE and IBM – and three processor/co-processor vendors – AMD, Intel and Nvidia.

We may assume Cray/Intel are bidding for the Aurora follow-on (called A21), and we looked at aspects of an HPE exascale system, suggesting an HPE/AMD partnership might be feasible.

The Summit reveal provided hints about an IBM exascale system and that's what we're going to dig into.

Summit nodes

Summit has just 4,608 nodes, which are more powerful than Aurora's X86 ones, interconnected with dual-rail 100Gbits EDR InfiniBand. As Nicole Hemsoth pointed out at our sister publication, The Next Platform, the system also has far fewer nodes than the 18,688 of its Oak Ridge neighbour and previous US supercomputing speed record-holder, Titan, but nevertheless "deliver[s]... 5X to 10X more performance while only increasing power consumption from nine to 13 megawatts."

Each node, basically an AC922 server, has two 3.1GHz Power9 CPUs with 22 cores and 6 x Tesla V100 GPUs, connected by NVLink 2. There is 1.6TB of memory per node.

supercomputer

The nodes are interconnected with Mellanox dual-rail EDR 100Gbit/s InfiniBand links, 200Gbit/s per node.

There is more than 10PB of main Summit memory and it uses IBM's Spectrum Scale filesystem, initially with about 3PB capacity and 30 GB/sec bandwidth. These numbers will rise to 250PB, 2.5TB/sec sequential and 2.2TB/sec random IO. Peak power usage is 13MW.

HPE has mentioned that exascale computers could have tens of thousands, if not hundreds of thousands, of nodes. But not if Summit could be scaled to an exaFLOPS machine – meaning a fivefold increase in performance.

That would mean 23,040 nodes using the current Power9/6xGPU node setup.

But Nvidia has moved on, having announced its HGX-2 2 petaFLOPS GPU grunt box with 16 Tesla V100s, the latest GPU architecture, connected using six NVSwitches. And there may well be a Volta GPU follow-on, with Ampere and Turing names floating around.

IBM is moving on, developing a POWER10 CPU, due to arrive in 2020. The Coral-2 systems are meant to be deliverable from 2021. It could have 48 cores and support a faster NVLink 3 interconnect.

Mellanox is developing NDR 400Gbit/s InfiniBand switching interconnect.

Could Spectrum Scale have its performance pushed higher? There's no reason to doubt that.

Join the dots

Let's suggest a scaled-up Summit node using POWER10 CPUs, souped-up Nvidia GPUs with faster NVLink, NDR InfiniBand internode links, and a faster/larger Spectrum Scale could provide a pathway to exascale with fewer than 23,040 nodes.

If we scale up Summit nodes 2.5x in performance, using these technologies, then 9,216 of them would get us to 1 exaFLOPS. There's a 40MW limit for power consumption and scaling up Summit power usage by 2.5x gets us to 32.5MW (this is a simplistic extrapolation as power usage in general is directly proportional to number of nodes/performance if the node tech is the same). An enticing prospect.

There are other technologies that could help, such as high-bandwidth memory and storage-class memory. As long as these don't need application software rewrites, they could go into the mix too.

HPE's Machine-based exascale technology set is adventurous and exciting – for HPE. A scaled-up Summit is basically more of the same –less adventurous perhaps but possibly a safer bet.

Cray/Intel's Aurora A21 looks to be as risky as HPE's system, because Intel co-processor development has stalled – perhaps it will use its in-development GPUs – and Xeon under-performance, compared to POWER10, would predicate many tens of thousands of nodes with unproven co-processor/GPU technology.

Big Blue could stalk the processor design halls in triumph: Yeah, Xeon. x86? More like ex-86. Feel the POWER, etc. etc. ®



TOPICS: Computers/Internet
KEYWORDS: exascale; summit
Navigation: use the links below to view more comments.
first 1-2021-28 next last

1 posted on 06/14/2018 11:04:26 AM PDT by ShadowAce
[ Post Reply | Private Reply | View Replies]

To: rdb3; Calvinist_Dark_Lord; JosephW; Only1choice____Freedom; amigatec; Ernest_at_the_Beach; ...

2 posted on 06/14/2018 11:05:01 AM PDT by ShadowAce (Linux - The Ultimate Windows Service Pack)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

Lotus Notes still won’t work right.


3 posted on 06/14/2018 11:06:12 AM PDT by wally_bert (This is the message phone company. I see youÂ’re using our unit, now how about paying for it?)
[ Post Reply | Private Reply | To 2 | View Replies]

… 200,000 trillion calculations per second …
Isn’t that 200 quadrillion calculations per second? or are we going by the British trillion?
4 posted on 06/14/2018 11:09:15 AM PDT by Olog-hai ("No Republican, no matter how liberal, is going to woo a Democratic vote." -- Ronald Reagan, 1960)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

Nano-Nano.....................


5 posted on 06/14/2018 11:10:40 AM PDT by Red Badger (When Obama and VJ go to prison for treason, will Roseanne get her show back?...)
[ Post Reply | Private Reply | To 2 | View Replies]

To: ShadowAce

Sweet! Now the NSA will be able to spy on the whole world in real-time!


6 posted on 06/14/2018 11:15:31 AM PDT by MeganC (There is nothing feminine about feminism.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

I’m only moderately impressed when computers are made faster by creating a larger “committee” of individual processors.

The challenge with these committees is connecting the individual processors together.


7 posted on 06/14/2018 11:15:32 AM PDT by cymbeline
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

ElectricPencil screams on this baby.


8 posted on 06/14/2018 11:16:35 AM PDT by Joe Bfstplk (A Texas Deplorable.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

It’s been more than 15 years since I have used an IBM product. Heck, I thought they hade nearly vanished.


9 posted on 06/14/2018 11:16:52 AM PDT by CodeToad
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

My first and only computer that I learned to program FORTRAN 1962

The following is the text of an IBM Data Processing Division press technical fact sheet distributed on October 4, 1960.
The solid-state IBM 7090 is the most powerful data processing system now coming off production lines at International Business Machines Corporation. The fully-transistorized system has computing speeds six times faster than those of its vacuum-tube predecessor, the IBM 709, and seven and one-half times faster than those of the IBM 704. Announced in December, 1958, the first 7090 was installed in December, 1959.


10 posted on 06/14/2018 11:22:25 AM PDT by larryjohnson (FReepersonaltrainer)
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

My first and only computer that I learned to program FORTRAN 1962

The following is the text of an IBM Data Processing Division press technical fact sheet distributed on October 4, 1960.
The solid-state IBM 7090 is the most powerful data processing system now coming off production lines at International Business Machines Corporation. The fully-transistorized system has computing speeds six times faster than those of its vacuum-tube predecessor, the IBM 709, and seven and one-half times faster than those of the IBM 704. Announced in December, 1958, the first 7090 was installed in December, 1959.


11 posted on 06/14/2018 11:22:26 AM PDT by larryjohnson (FReepersonaltrainer)
[ Post Reply | Private Reply | To 1 | View Replies]

To: CodeToad

I still have a mid 1990’s vintage IBM desktop. Not windows/I10 worthy but still runs early excel, the grandson’s games plus picture storage. With Chinese espionage, I would not trust products from Lenovo, who took over IBM’s personal computer business. IBM has always been strong on systems, just not consumer products..


12 posted on 06/14/2018 11:33:49 AM PDT by buckalfa (I was so much older then, but I'm younger than that now.)
[ Post Reply | Private Reply | To 9 | View Replies]

To: ShadowAce
It's a SNA world after all!
It's a SNA world after all!
It's a SNA world after all!
It's a SNA...SNA...world!!!

What does Netview say about it (while running on my TN3270 emulator to pull VTAM statistics)?

13 posted on 06/14/2018 11:35:50 AM PDT by Dubh_Ghlase
[ Post Reply | Private Reply | To 1 | View Replies]

To: cymbeline
The article does say that 1) Summit has far fewer nodes than it's neighbor, Titan, and 2) Mellanox is working on 400GB/sec IB fabric fo the next generation.

Not only are these machines getting more powerful, they're getting smaller.

14 posted on 06/14/2018 11:39:34 AM PDT by ShadowAce (Linux - The Ultimate Windows Service Pack)
[ Post Reply | Private Reply | To 7 | View Replies]

To: larryjohnson

Brings back memories. I started with Cobol and Autocoder on the 7010 in the late 60’s just before the 360’s came in.


15 posted on 06/14/2018 12:00:45 PM PDT by duckman ( Not tired of winning!)
[ Post Reply | Private Reply | To 11 | View Replies]

To: ShadowAce

Bookmark


16 posted on 06/14/2018 12:02:34 PM PDT by publius911 ( If we let it, California will lead us all over the cliff.)
[ Post Reply | Private Reply | To 14 | View Replies]

To: duckman

I started with FORTRAN IV on a Cray 6400, moved to a Cray 6600, then 7600. This was 1966 to 1973.


17 posted on 06/14/2018 12:09:46 PM PDT by FroggyTheGremlim (Democrats: the political party of the undeadD)
[ Post Reply | Private Reply | To 15 | View Replies]

To: ShadowAce

“Not only are these machines getting more powerful, they’re getting smaller.”

Smaller equals faster — yes.

Are they running at higher clock speeds? I.e. are gate delays getting smaller?


18 posted on 06/14/2018 12:45:36 PM PDT by cymbeline
[ Post Reply | Private Reply | To 14 | View Replies]

To: ShadowAce

“Not only are these machines getting more powerful, they’re getting smaller.”

Smaller equals faster — yes.

Are they running at higher clock speeds? I.e. are gate delays getting smaller?


19 posted on 06/14/2018 12:46:16 PM PDT by cymbeline
[ Post Reply | Private Reply | To 14 | View Replies]

To: ShadowAce

bump


20 posted on 06/14/2018 12:57:14 PM PDT by Albion Wilde (Trump is a real estate genius because he lives rent-free in so many heads.)
[ Post Reply | Private Reply | To 1 | View Replies]


Navigation: use the links below to view more comments.
first 1-2021-28 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson