Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

Japanese Start Up Another Supercomputer
The Register ^ | 17th February 2012 23:12 GMT | Timothy Prickett Morgan -

Posted on 02/18/2012 7:49:56 PM PST by Ernest_at_the_Beach

Upstart supercomputer maker Appro International has started up another Xeon E5-based supercomputer, this one a hybrid CPU-GPU Xtreme-X machine installed at the University of Tsukuba in Japan.

The deal at Tsukuba was announced last September at about the same time that Intel was widely expected to launch the "Sandy Bridge-EP" Xeon E5 processors, which are now due for launch sometime before the end of the first quarter. It is not clear how Appro was able to jump to the head of the Xeon E5 line, but it is worth noting that Appro now uses Intel motherboards as well as processors in its Xtreme-X clusters.

Appro Tsubuka2 supercomputer

The University of Tsukuba's "Frontier" CPU-GPU supercomputer

The Tsukuba machine is nicknamed "Frontier" but officially called the much more boring HA-PACS, short for Highly Accelerated Parallel Advanced system for Computational Sciences. The feeds and speeds of the 802 teraflops Frontier machine were a little bit vague back in September, but the university confirmed to El Reg that it is comprised of 288 server nodes using the eight-core variants of the forthcoming Xeon E5.

Each node has two sockets and 128GB of main memory, for a total of 36TB of memory on the CPU side, and each also has four of Nvidia's Tesla M2090 server-cooled, fanless GPU coprocessors, with each GPU being equipped with 6GB of its own GDDR5 graphics memory. In terms of peak theoretical capacity, only about 11 percent of the aggregate performance of the cluster comes from the Sandy Bridge processors, with the rest coming from the Tesla GPUs.

In real-world situations thus far, however, hybrid supers have had very poor efficiency because of the trickiness of the links between the CPUs and the GPUs. Because the Xeon E5s support PCI-Express 3.0 peripheral slots – and do so right on the CPU chip itself – there is every reason to believe that the doubling of bandwidth between the GPU and CPU will help all hybrid Xeon-Tesla machines get closer to their peak theoretical performance, but we won't know until the Xeon E5 machines are out and supercomputer centers start conducting their tests and publishing their results.

While efficiency is important, so is the fact that by going heavy on the GPUs, the Frontier cluster fits in 26 racks and only burns 400 kilowatts of juice. That works out to about 2,005 megaflops per watt peak. By comparison, IBM's most efficient supercomputer – the BlueGene/Q massively parallel CPU beast launched last fall at SC11 – is designed to get 20 petaflops of peak performance out of 6.6 megawatts, or about 3,030 megaflops per watt.

Without the GPUs, x86 CPU clusters can't even come close to BlueGene/Q, but with them (as was the case with the hybrid Opteron-Cell "Roadrunner" super built by IBM for Los Alamos National Lab), hybrids can get better performance per watt. And with PCI-Express 3.0, you can hang four GPUs off a two-socket server and have enough bandwidth to talk between the two compute engines.

The Frontier cluster is configured with a dual-rail Quad Data Rate (QDR) InfiniBand network comprised of ConnectX-3 network interface cards from Mellanox on the servers, and Mellanox QDR switches lashing the machines together. The nodes are configured in a fat tree configuration. The servers are linked to a little more than a half petabyte of S2A storage arrays from DataDirect Networks.

As an adjunct to the cluster, the university is working on its own direct interconnect electronics to let the GPUs in the cluster to talk directly with each other and share data without having to bother the CPU. Tsukuba techies have cooked up a little something called the PCI Express Adaptive Communication Hub, or PEACH, which acts as a PCI controller linking the GPUs to each other. Boffins are at work on an improved PEACH2 hub, which is thrust onto an FPGA, and which will make use of Nvidia's GPU-Direct protocol to create what is in effect a GPU switch.

"The largest issue in accelerated computing is how to fill the gap between its powerful internal computation performance and relatively poor external communication performance," Taisuke Boku, deputy director of the Center for Computational Sciences at the university, explained in an email to El Reg. "To overcome this problem, we need to develop various new algorithms, shifting from traditional ones for our target applications. In some applications, we may need a paradigm shift from scratch toward a new generation of algorithms. HA-PACS will be the testbed for developing of these algorithms. For this purpose, we need to use the system constantly in large scale for selected applications."

The PEACH2 GPU-PCI switches (well, that is essentially what they are) will eventually be plugged into a bunch more nodes, and these will be plugged into the Frontier machine sometime in early 2013, adding something on the order of 200 teraflops and pushing the machine up above the teraflops barrier. These lessons learned from the GPU direct connections will lay the groundwork for exascale systems, the university hopes. ®


TOPICS: Computers/Internet
KEYWORDS: hitech
Title is from HardOCP...
1 posted on 02/18/2012 7:50:00 PM PST by Ernest_at_the_Beach
[ Post Reply | Private Reply | View Replies]

To: ShadowAce

fyi


2 posted on 02/18/2012 7:51:08 PM PST by Ernest_at_the_Beach ( Support Geert Wilders)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach
http://www.codesimian.com/burningTerminatorHead.jpg

Can't see this going wrong.

3 posted on 02/18/2012 8:28:43 PM PST by KC_Lion (I will NEVER vote for Romney, the GOP will go the way of the Whigs if they nominate him)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach

Yes but can it prevent Win-dohs from crashing?


4 posted on 02/18/2012 8:36:08 PM PST by GrandJediMasterYoda (How ironic that Ann Coulter should write a book called Treason.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: GrandJediMasterYoda
LOL!

I don't think Win-dohs plays in this league.

5 posted on 02/18/2012 8:57:33 PM PST by Ernest_at_the_Beach ( Support Geert Wilders)
[ Post Reply | Private Reply | To 4 | View Replies]

To: Ernest_at_the_Beach

That picture reminds me of my old TRS80 Model 1. I miss it.


6 posted on 02/18/2012 9:38:27 PM PST by lwoodham (Time is what keeps everything from happening all at once.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach

Yeah, but the old Crays just looked so much more badass.

7 posted on 02/18/2012 9:50:40 PM PST by martin_fierro (< |:)~)
[ Post Reply | Private Reply | To 1 | View Replies]

To: All
The funds raised in these FReepathons go to pay our current quarter expenses. But we're also going to try to replace some of our older servers and failing equipment this year so we're going to add a little extra to our FReepathon goals. John is estimating ten to fifteen thousand to do this and I'd like to get it all in place and working before the election cycle is fully heated up, so we'll try to bring in a little extra now, if we can, and the rest next quarter.

Jim Robinson



Click to Donate!

8 posted on 02/18/2012 10:17:40 PM PST by onyx (SUPPORT FREE REPUBLIC, DONATE MONTHLY. If you want on Sarah Palin's Ping List, let me know.)
[ Post Reply | Private Reply | To 7 | View Replies]

To: Ernest_at_the_Beach

WOW!


9 posted on 02/18/2012 10:18:24 PM PST by onyx (SUPPORT FREE REPUBLIC, DONATE MONTHLY. If you want on Sarah Palin's Ping List, let me know.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Ernest_at_the_Beach

Put this thing in a recording studio or video suite and watch them eat up it’s processing power and then start bitching about “no one ever building anything that’s got a ‘pair’.”


10 posted on 02/19/2012 4:51:04 AM PST by TalBlack ( Evil doesn't have a day job.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: rdb3; Calvinist_Dark_Lord; Salo; JosephW; Only1choice____Freedom; amigatec; stylin_geek; ...

11 posted on 02/20/2012 5:10:57 AM PST by ShadowAce (Linux -- The Ultimate Windows Service Pack)
[ Post Reply | Private Reply | To 1 | View Replies]

To: rdb3; Calvinist_Dark_Lord; Salo; JosephW; Only1choice____Freedom; amigatec; stylin_geek; ...

12 posted on 02/20/2012 5:11:33 AM PST by ShadowAce (Linux -- The Ultimate Windows Service Pack)
[ Post Reply | Private Reply | To 1 | View Replies]

To: lwoodham

They have the look of a SGI bank of servers.


13 posted on 02/20/2012 5:26:10 AM PST by bmwcyle (I am ready to serve Jesus on Earth because the GOP failed again)
[ Post Reply | Private Reply | To 6 | View Replies]

To: Ernest_at_the_Beach

Finally! Something fast enough to stream all that anime porn!


14 posted on 02/20/2012 5:30:37 AM PST by Egon (The difference between Theory and Practice: In Theory, there is no difference.)
[ Post Reply | Private Reply | To 1 | View Replies]

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson