PDA

View Full Version : Concept: A Cluster faster than Big Mac


Plague Bearer
11-08-2003, 03:29 PM
This was spurred on by a recent commentary over at Mac OS Rumors about Apple putzing around with the latest Firewire 1600 and 3200 chipsets.

The ingredients of this speculation are:
The inevitable availability of Firewire 1600 and 3200
TCP/IP over firewire available in Panther now. It's in Networking under Port configurations. Select NEW and look at the list. For those who didn't notice.

One of the things that slowed the installation of the Big Mac Cluster was the team had to install 1100 2Mb infiniband fiber channel network cards because 1000 Base T wasn't fast enough. This ate up several days in hardware installation not to mention the cost of additional hardware. One other issue to be aware of was that the fiber channel cards were optimized in a way that nearly saturated the PCI-X slot they were plugged into. A potential bottleneck in the future.

Disclaimer: I, not being familiar with fiber channel, have no Idea of what kind of throughput is possible with fiber channel. Are there faster flavors of it available?

Now let the "What if's" begin.

If Firewire 1600 is 50% faster than 1000 Base T and Firewire 3200 is 60% faster than 2Mb fiber channel. What kind of performance would this kind of network fabric offer? What sort of network topology would be necessary to prevent any one node from saturating the 400MB/s limit of Firewire 3200 or 200MB/s of Firewire 1600.

One downside would be the lack of firewire switches for this kind of network. Now there's an idea for a startup business.

Have fun.

Discuss.

G-News
11-08-2003, 03:46 PM
there's more to fiberchannel than just pure bandwidth.
Plus, they used infiniband interconnects, NOT Fiberchannel.
Infiniband is again a LOT faster than fiberchannel (talking about several GB/sec being moved, not just gbit/sec).
I don't know what latencies firewire 3200 would have, but I doubt it's anywhere near infiniband or similar clustering interfaces.

kupan787
11-08-2003, 04:09 PM
Also, Apple needs to get a model with ECC ram and then they could be taken a bit more seriously (or so that is my unerstanding). some have speculated that a G5 xServe will have ECC ram...

Flounder
11-08-2003, 05:59 PM
third place seems pretty serious to me.....

I don't know anything about this topic. What is "serious" about this ram?

kupan787
11-08-2003, 06:16 PM
Originally posted by Flounder
third place seems pretty serious to me.....

I don't know anything about this topic. What is "serious" about this ram?

Some quotes from MacNN on the subject of ECC:

G5 has non-ECC memory. Therefore it's possible to occur calculation failure. It's no problem for HPL, because HPL make sure correctness at the end of calculation, but it might be a fatal point in "real" calculation. During HPL tunings, there were some diffect node and pretty difficult to find it(sometimes incorrect calculation, sometimes dead lock of thread ..). It G5 had ECC memory, it would be much easier...

If anything it would slow it down slightly. However, ECC will self-correct some memory errors that non-ECC won't. ie. If there is a silent error, ECC will correct it silently and the calculations will merrily continue, correctly. OTOH, non-ECC memory will simply let the error propagate and the whole calculation will be garbage. This isn't a big deal for a person running 4 GB RAM in Photoshop. Indeed, Dr. V said it's not a problem for their cluster running 4 Terabytes of RAM doing calculations for a few hours either. However, it's potentially a huge deal for somebody running 4 Terabytes of memory in a cluster and trying to do a calculation that's supposed to go on for a week or whatever.

And this, from Dr. Srinidhi Varadarajan, Director of the Terascale Computing Facility at VT:

However, I do agree that lack of ECC is an issue, particularly when our fault tolerance work now enables large runs that can last as long as the application chooses.. We are switching to an ECC platform soon, so this will not be a long-term issue either.

Which sounds like the G5s are a temporary measure. Either VT knows that new machines from Apple are coming with ECC, or they are planning on siwtching to Opterons or something. This seems odd that they would spend $5 million on this setup, and then scrap it shortly after...

G-News
11-08-2003, 06:20 PM
they just bought a cluster for 5.2 million plus infrastructure and are switching to an ECC platform SOON?
WTF?

kupan787
11-08-2003, 07:02 PM
Originally posted by G-News
they just bought a cluster for 5.2 million plus infrastructure and are switching to an ECC platform SOON?
WTF?

That is what I was thinking. To paraprase someone over at macnn, they basicly think this was used to get on the list. Then they will resell the G5s to students, and purchase new G5 xServes which will have ECC support. The infiband and all that shodul just work on the xServes (right?) the only thing would be the racks they use now wouldn't be compatible.

All in all the post by the Director makes absolutly no sense.

Flounder
11-08-2003, 07:52 PM
thanks for the info, Kupan

G-News
11-09-2003, 06:02 AM
I simply don't believe that, I'm sorry.

Chucker
11-09-2003, 07:18 AM
Originally posted by kupan787
To paraprase someone over at macnn, they basicly think this was used to get on the list. Then they will resell the G5s to students, and purchase new G5 xServes which will have ECC support.

:err:
:no:
:rolleyes:

Krassy
11-09-2003, 01:58 PM
Originally posted by G-News
I simply don't believe that, I'm sorry.
i 2nd that.

Powerdoc
11-09-2003, 02:20 PM
Originally posted by Krassy
i 2nd that.

So do I. i doubt that the university will give them more money. If they had one, they will have choosen the Opteron.

I am not a mathematical expert, but there is a way to make software in order to detect error with key control. If they want to have secure calculations they can do it.
Even ECC memory do not prevent all errors, and are not the warrant of a 100 % proof calculation.

kupan787
11-09-2003, 02:44 PM
Originally posted by G-News
I simply don't believe that, I'm sorry.

Do you think I am making it up? Here is the link to the page over at MacNN:

http://forums.macnn.com/showthread.php?s=&threadid=183155&perpage=50&pagenumber=3

It is about half way down, and posted by Eug.

And one more post from over there:

]It probably means they're gonna simply buy Opterons or G5 Xserves or something in the near future.

I'm thinking they got the G5 Power Macs as an interim solution, to get on the list, and because they can be used in a number of different situations. ie. Use them for now, esp. for smaller calculations and then use them in small clusters or as desktops when they get more money to build a dedicated supercluster with ECC-endowed servers, which could in fact be G5 Xserves. It would make sense for Apple to go with ECC, esp. now that VT has proven clustering G5s is a viable option in terms of functionality, provides a good price performance ratio, and that Apple is a good reliable hardware vendor.

The question many are asking is why they didn't go with Linux Opteron right off the bat, but it seems they had issues with supply and support, as well as quoted costs. The G5 was the best solution at the time for their needs, but it's not a perfect solution either. Fortunately for them, they seem to have a lot of money to play with.

Krassy
11-09-2003, 02:55 PM
Originally posted by kupan787
Do you think I am making it up?

not but i don't believe the authors of your mentioned quotes... the only thing i can imagine is that they're switching to newer g5s some day but not earlier than 12 months from now - they bought a whole cooling system and made up these places for the towers.... and they are porting (and already have) their software to mac os x so they won't take opterons for sure. but anyway - i don't know this for sure ... and i don't believe this whole ecc-ram stuff is such a big problem ...

G-News
11-09-2003, 05:50 PM
this "in the near future" may be relative to supercomputer livecycles. In that case "the near future" could also be in 5 years.

I think this is simply a rumormonger going postal.

Stoo
11-09-2003, 06:02 PM
A cluster doesn't need insane years-long uptimes. Physical memory errors are sufficiently rare that 1100 G5s should last a couple of days or weeks. Even if one machine does return incorrect results, that would presumably just show up as an erroneous datapoint, as happens in empirical experiments.

Chagi
11-09-2003, 07:11 PM
Virginia Tech is going to be building a brand new center on campus, with completion around 2005/2006 - Institute for Critical Technology and Applied Science (ICTAS).

So when Dr. Varadarajan is talking about ECC "soon", I think it's logical to assume that they are already beginning to look ahead to the system(s) that will be implemented in the new facility.

Plague Bearer
11-10-2003, 11:02 PM
Originally posted by Stoo
A cluster doesn't need insane years-long uptimes. Physical memory errors are sufficiently rare that 1100 G5s should last a couple of days or weeks. Even if one machine does return incorrect results, that would presumably just show up as an erroneous datapoint, as happens in empirical experiments.


As I understood it, the software error correction they are using just reissues the suspect data packet to a different node for reprocessing.

If one or two RAM errors occurred per week, as I have heard others state, as the "normal frequency of occurrence" I don't see how this could have much effect on the total time of processing on a long run of data.

kupan787
11-11-2003, 12:16 AM
Originally posted by Plague Bearer
As I understood it, the software error correction they are using just reissues the suspect data packet to a different node for reprocessing.

If one or two RAM errors occurred per week, as I have heard others state, as the "normal frequency of occurrence" I don't see how this could have much effect on the total time of processing on a long run of data.

All I know is that over at Ars, people were up in arms about the fact VT was using a machine with no ECC. I heard stories that the whole cluster couldn't be used for more than an hour before an error in memory would screw up teh calculation. However, smaller segments running would be fine (say 25-50 machines out of the 1100).

G-News
11-11-2003, 03:52 AM
Well, the battlefront in Ars is often more flamed than factual.
I don't think Varadarajanrananan or whatever his wicked name is would have bought an ECC-less system, if he wouldn't have a solution for the issue.

Krassy
11-11-2003, 03:55 AM
Originally posted by kupan787
All I know is that over at Ars, people were up in arms about the fact VT was using a machine with no ECC. I heard stories that the whole cluster couldn't be used for more than an hour before an error in memory would screw up teh calculation. However, smaller segments running would be fine (say 25-50 machines out of the 1100).

as i said - i don't believe this... this would be a waste of money for VT. do you think they're that stupid? me not.

machem
11-11-2003, 11:19 AM
Do you guys know how people use clusters? Any idea?

Number one, most software includes some sort of error-correction, whether it is "dumb" (relying on previous-step data) or "smart" (self-checking algorithms which are independent of memory). ECC is nice, but gravy.

Second, noone is going to hog all 2200 procs at once. Batching, queuing and priorities all factor in here. What Big Mac does is allows more people to do more work, all at once.

Third, I guarantee the end users won't tolerate this thing being disassembled and re-built, even a year from now. I suspect most end users won't be up to speed on it until well after Christmas. This is even assuming that there are no glitches in s/w implementation, communication and/or training.

Another thing I didn't see mentioned re: Infiniband is its extremely low latency. This is a huge issue. 1000 ether would be fine for 90% of end users if its latency were much less. For communication-critical s/ws like QCD, however, this was a no-brainer, even with the additional time involved. Someone mentioned this additional assembly time as a hassle. Are you kidding me? going from delivery to assembly through testing an ready for real work in what, less than 8 weeks is insane. Simply exemplary. People who build and use clusters are frankly speechless over what VT did, even w/o the cost issues.

kupan787
11-11-2003, 11:27 AM
Originally posted by Krassy
as i said - i don't believe this... this would be a waste of money for VT. do you think they're that stupid? me not.

No I don't. And once again what I have posted (quotes) all came from the MacNN post (which I linked to above). It doesn't make sense to me that they would spend $5 million on a solution that they wouldn't really want just because the opteron solution would have cost more. You get what you need.

But on the other hand, the quote from the Director at VT was on an emailing list, so I really doubt it is fake. Believe what you want about the rest, but if he says they are going to ECC soon, I believe that.

G-News
11-11-2003, 02:14 PM
as I already said, soon in this case most certainly means a couple of YEARS.

Chagi
11-11-2003, 04:38 PM
Originally posted by machem
Someone mentioned this additional assembly time as a hassle. Are you kidding me? going from delivery to assembly through testing an ready for real work in what, less than 8 weeks is insane. Simply exemplary. People who build and use clusters are frankly speechless over what VT did, even w/o the cost issues.

I got a good laugh out of that criticism too. If you're spending millions of dollars on your new toy, is the cost of labour to initially set it up really all that significant?

machem
11-12-2003, 01:00 AM
Originally posted by Chagi
I got a good laugh out of that criticism too. If you're spending millions of dollars on your new toy, is the cost of labour to initially set it up really all that significant?

Time, not labor. Labor was free. And gladly given, I'm sure. ;)

Programmer
11-12-2003, 10:39 AM
Of course given the visibility that this has given Apple, I'm sure that they've invited their new favourite professor over to their labs to look at the new upcoming toys. Given the rumoured Xgrid as well, my guess is that they've been planning server machines (i.e. Xserves) which are G5-based, use ECC memory, and can use InfiniBand. If these are coming any time in the next 6 months I think this qualifies as "soon", and that could mean 90nm 970s... at higher clock rates.

Technology marches on, and staying in the top 3 supercomputers requires diligence and a large pocketbook.

G-News
11-12-2003, 11:20 AM
I seriously doubt that Apple is going to equip their machines with Infiniband interfaces per default anytime soon.
It's not like Infiniband is the per se standard for clustering, nor are these interfaces anything cheap.
The only thing I can imagine is Apple offering interface cards as BTO in the future.

G-News

Amorph
11-12-2003, 11:34 AM
Originally posted by G-News
I seriously doubt that Apple is going to equip their machines with Infiniband interfaces per default anytime soon.

All they have to offer, as I understand it, is a PCI-X slot or equivalent. If the customer wants Infiniband, they can help themselves.

That notwithstanding, I'm sure Apple is keenly interested in the driver that was custom written for that card VA Tech used...

O and A
11-12-2003, 02:26 PM
As I remember VT is planning another supercomputer with a new facility in 2005