HPC

Why we will not have a unified HPC and AI software environment, ever

No good reason for vendors to play ball with each other


Register Debate Welcome to the latest Register Debate in which writers discuss technology topics, and you the reader choose the winning argument. The format is simple: we propose a motion, the arguments for the motion will run this Monday and Wednesday, and the arguments against on Tuesday and Thursday. During the week you can cast your vote on which side you support using the poll embedded below, choosing whether you're in favour or against the motion. The final score will be announced on Friday, revealing whether the for or against argument was most popular.

This week's motion is: A unified, agnostic software environment can be achieved. We debate the question: can the industry ever have a truly open, unified, agnostic software environment in HPC and AI that can span multiple kinds of compute engines?

Arguing AGAINST the motion today is Dan Olds, Chief Research Officer for HPC/AI industry analyst firm Intersect360 Research.

There is no way in hell this will happen. Why? Because this is a world of human beings who are working in the interests of themselves and their organizations. APIs are sources of competitive advantage for many companies and, as such, not something that those suppliers should want to completely standardize – particularly when that standard is being driven by the largest and most influential supplier in the industry.

This doesn’t mean that standards shouldn’t exist. But they should, and will, only exist in situations where competitive advantage won’t accrue to a single (or small group) of players. Self-serving standards won’t survive in the long run – they stifle innovation and shackle new products and competition by forcing new entrants to adhere to the rules established by legacy products/vendors. But this is a self-correcting mechanism in most cases, the old standards get overrun by the sheer brilliance of new ways of doing things and they fade away into obscurity.

We have seen plays like this before. The one that I’m most intimately familiar with was IBM’s gambit in the early 2000s, along with SCO and Sequent, to build a unified version of the Unix operating system, called Project Monterey. It was supposed to be the best Unix ever, with features and performance that would quickly make it the dominant flavor.

To give you a little context, I was employed by IBM at this time, and I started my tech career several years earlier with Sequent. I attended an internal IBM RS/6000 (as their Power systems were called back then) briefing in Austin and saw slide after slide about the new tech they were injecting into their proprietary AIX variant of Unix. As I was one of the guys who was going to be explaining this to customers, I kept asking the same questions over and over: “Why are you continuing to go full speed ahead with AIX when IBM is supposed to be driving the Project Monterey bus?” and, “Will these wondrous innovations be shared with the upcoming Project Monterey?”

I didn’t receive clear answers to my queries, telling me that even though IBM was the biggest supporter of a unified Unix, they were at best hedging their bets by pushing AIX development or, at worst, completely hypocritical and looking to use Monterey to roil the Unix market.

I’m not trying to toss shade at Intel for pushing OneAPI or AMD for pushing ROCm. They will soon both be suppliers of a large number of compute engines, ranging from CPUs to GPUs to FPGAs (and in Intel’s case, custom ASICs for AI). Each of these have, for the most part, unique APIs, which is a recipe for eventual drowning under the load of making sure that everything can work with everything else.

In short, Intel needs to have oneAPI at least for its own products in order to keep churning them out and keep the complexity down internally. It will also help them tell a better story to customers in that a common API will make it easy for developers to utilize full Intel product stacks. If I’m Intel, this is the right move. Ditto for AMD and ROCm.

To think that other vendors will embrace oneAPI or ROCm is naïve

But to think that other vendors will embrace oneAPI or ROCm is naïve. Imagine that you are an up-and-comer in the tech field. You have a new accelerator that outperforms everything out there. You tell your engineers that you have decided that, for business reasons, your fancy new accelerator needs to utilize oneAPI or ROCm in order to maintain compatibility with what will be the possibly only instruction set that will matter in the future.

As your engineers start yelling at you, you catch fragments of sentences coming through the noise: ”You can’t be serious, this means we have to completely re-engineer our code!” and, “It’s our instructions that make us so much faster!” and, “Without our custom instructions, we can’t get X or Y to work without serious performance hits.” The many, many expletives in those sentences have been removed.

Shaken, you leave the meeting with their cursing and insults echoing in your ears. Perhaps you should reconsider your decision before it’s too late. Maybe there is some competitive advantage to be had in doing your own thing and not adhering to standards that throttle the performance of your new accelerator.

The lesson of history is that companies do not compromise to be one of the crowd when it is not in their best interest. ®

Cast your vote below. We'll close the poll on Thursday night and publish the final result on Friday. You can track the debate's progress here.

JavaScript Disabled

Please Enable JavaScript to use this feature.


Other stories you might like

  • FTC urged to protect data privacy of women visiting abortion clinics
    As Supreme Court set to overturn Roe v Wade, safeguards on location info now more vital than ever

    Democrat senators have urged America's Federal Trade Commission to do something to protect the privacy of women after it emerged details of visits to abortion clinics were being sold by data brokers.

    Women's healthcare is an especially thorny issue right now after the Supreme Court voted in a leaked draft majority opinion to overturn Roe v Wade, a landmark ruling that declared women's rights to have an abortion are protected by the Fourteenth Amendment of the US Constitution.

    If the nation's top judges indeed vote to strike down that 1973 decision, individual states, at least, can set their own laws governing women's reproductive rights. Thirteen states already have so-called "trigger laws" in place prohibiting abortions – mostly with exceptions in certain conditions, such as if the pregnancy or childbirth endangers the mother's life – that will go into effect if Roe v Wade is torn up. People living in those states would, in theory, have to travel to another state where abortion is legal to carry out the procedure lawfully, although laws are also planned to ban that.

    Continue reading
  • Zuckerberg sued for alleged role in Cambridge Analytica data-slurp scandal
    I can prove CEO was 'personally involved in Facebook’s failure to protect privacy', DC AG insists

    Cambridge Analytica is back to haunt Mark Zuckerberg: Washington DC's Attorney General filed a lawsuit today directly accusing the Meta CEO of personal involvement in the abuses that led to the data-slurping scandal. 

    DC AG Karl Racine filed [PDF] the civil suit on Monday morning, saying his office's investigations found ample evidence Zuck could be held responsible for that 2018 cluster-fsck. For those who've put it out of mind, UK-based Cambridge Analytica harvested tens of millions of people's info via a third-party Facebook app, revealing a – at best – somewhat slipshod handling of netizens' privacy by the US tech giant.

    That year, Racine sued Facebook, claiming the social network was well aware of the analytics firm's antics yet failed to do anything meaningful until the data harvesting was covered by mainstream media. Facebook repeatedly stymied document production attempts, Racine claimed, and the paperwork it eventually handed over painted a trail he said led directly to Zuck. 

    Continue reading
  • Florida's content-moderation law kept on ice, likely unconstitutional, court says
    So cool you're into free speech because that includes taking down misinformation

    While the US Supreme Court considers an emergency petition to reinstate a preliminary injunction against Texas' social media law HB 20, the US Eleventh Circuit Court of Appeals on Monday partially upheld a similar injunction against Florida's social media law, SB 7072.

    Both Florida and Texas last year passed laws that impose content moderation restrictions, editorial disclosure obligations, and user-data access requirements on large online social networks. The Republican governors of both states justified the laws by claiming that social media sites have been trying to censor conservative voices, an allegation that has not been supported by evidence.

    Multiple studies addressing this issue say right-wing folk aren't being censored. They have found that social media sites try to take down or block misinformation, which researchers say is more common from right-leaning sources.

    Continue reading
  • US-APAC trade deal leaves out Taiwan, military defense not ruled out
    All fun and games until the chip factories are in the crosshairs

    US President Joe Biden has heralded an Indo-Pacific trade deal signed by several nations that do not include Taiwan. At the same time, Biden warned China that America would help defend Taiwan from attack; it is home to a critical slice of the global chip industry, after all. 

    The agreement, known as the Indo-Pacific Economic Framework (IPEF), is still in its infancy, with today's announcement enabling the United States and the other 12 participating countries to begin negotiating "rules of the road that ensure [US businesses] can compete in the Indo-Pacific," the White House said. 

    Along with America, other IPEF signatories are Australia, Brunei, India, Indonesia, Japan, South Korea, Malaysia, New Zealand, the Philippines, Singapore, Thailand and Vietnam. Combined, the White House said, the 13 countries participating in the IPEF make up 40 percent of the global economy. 

    Continue reading
  • 381,000-plus Kubernetes API servers 'exposed to internet'
    Firewall isn't a made-up word from the Hackers movie, people

    A large number of servers running the Kubernetes API have been left exposed to the internet, which is not great: they're potentially vulnerable to abuse.

    Nonprofit security organization The Shadowserver Foundation recently scanned 454,729 systems hosting the popular open-source platform for managing and orchestrating containers, finding that more than 381,645 – or about 84 percent – are accessible via the internet to varying degrees thus providing a cracked door into a corporate network.

    "While this does not mean that these instances are fully open or vulnerable to an attack, it is likely that this level of access was not intended and these instances are an unnecessarily exposed attack surface," Shadowserver's team stressed in a write-up. "They also allow for information leakage on version and build."

    Continue reading
  • A peek into Gigabyte's GPU Arm for AI, HPC shops
    High-performance platform choices are going beyond the ubiquitous x86 standard

    Arm-based servers continue to gain momentum with Gigabyte Technology introducing a system based on Ampere's Altra processors paired with Nvidia A100 GPUs, aimed at demanding workloads such as AI training and high-performance compute (HPC) applications.

    The G492-PD0 runs either an Ampere Altra or Altra Max processor, the latter delivering 128 64-bit cores that are compatible with the Armv8.2 architecture.

    It supports 16 DDR4 DIMM slots, which would be enough space for up to 4TB of memory if all slots were filled with 256GB memory modules. The chassis also has space for no fewer than eight Nvidia A100 GPUs, which would make for a costly but very powerful system for those workloads that benefit from GPU acceleration.

    Continue reading

Biting the hand that feeds IT © 1998–2022