HPC

AMD, Nvidia, HPE tapped to triple the speed of US weather super with $35m upgrade

Scientists will focus on modelling hurricanes, wildfires, solar storms with Milan Epycs and Nv A100s


HPE will upgrade the US National Center for Atmospheric Research's (NCAR) supercomputer using AMD and Nvidia's latest CPUs and GPUs, creating a machine roughly three times as powerful as its current Intel-based beast.

That system running today, code-named Cheyenne, is four years old, and now its government-funded lab wants a bigger and faster super to forecast natural disasters unfolding on Earth. Uncle Sam shopped around and awarded the contract, worth more than $35m, to Hewlett Packard Enterprise.

The upgraded supercomputer has yet to be given an official name, and is expected to be up and running in 2022, when it will replace Cheyenne, we're told. Kids in the lab's home state of Wyoming will be asked to propose a name for the thing.

The computer will be based on HPE's Cray EX (previously Shasta) supercomputer blueprints, and is expected to reach a theoretical maximum performance of 19.87 petaFLOPS – that's much faster than the 5.34 petaFLOPS Cheyenne offers today.

"That is almost 3.5 times the speed of scientific computing performed by the Cheyenne supercomputer, and the equivalent of every man, woman, and child on the planet solving one equation every second for a month," NCAR said Wednesday. "Once operational, the HPE-powered system is expected to rank among the top 25 or so fastest supercomputers in the world."

Once operational, the HPE-powered system is expected to rank among the top 25 or so fastest supercomputers in the world

The new machine will sport 2,570 compute nodes: 2,488 of those will contain AMD's 7nm third-gen Epyc Milan processors – due to officially launch in March – and 82 remaining nodes will be a mixture of Milan chips and Nvidia's 7nm A100 GPUs. It'll have a total RAM capacity of a whopping 692TB. The connectivity between the nodes is powered by HPE's Slingshot interconnect architecture, boasting a bandwidth of 200Gbps per direction per network switch port.

By using GPUs to accelerate workloads, NCAR will be moving away from its previous CPU-only approach. Although Cheyenne has more 4,032 computation nodes, they're filled with Intel's Xeon Broadwell workhorses, and this system design is less efficient, in terms of power and silicon required, than more modern supercomputers that have a mix of CPUs and GPUs to crunch through math operations at high speed. Specialist hardware wins out over generic processors.

CINECA

Nvidia signs up for an Italian Job: Building for Europe the 'world's fastest AI supercomputer' by 2022

READ MORE

"In terms of hardware, the new system will be hugely helpful to AI and machine learning," David Hosansky, a spokesperson for NCAR, told The Register.

"The big benefit of GPUs is performing large numbers of computations simultaneously on one chip, resulting in much less power usage and hardware for the same number of parallel operations. GPUs have less on-board memory than CPUs, but the ones being used in this new system are top of the line in terms of both memory and number of cores. That will allow our scientists to load more data and train larger machine learning models than they could before."

NCAR scientists will use machine-learning algorithms to simulate models of freak weather events, including hurricanes, hail storms, wildfires, and solar storms. These models ingest huge amounts of weather data to output forecasts, and help scientists understand the impacts of climate change.

For example, AI is used to estimate the amount of moisture in vegetation and that data is fed into another model that combines real data from satellite observations to map out regions at risk of wildfires. Weather changes quickly, and these maps are regenerated every day.

"We are now working on using GOES-R satellite data and machine learning to produce hourly fuel moisture content maps over [the US]," NCAR scientist Branko Kosovic, director of the Weather Systems and Assessment Program, told El Reg. "Machine learning could be used also to develop more accurate and frequent high resolution maps of fuel characteristics. Furthermore, machine learning models could be developed to improve rate of fire spread parameterizations by combining observations and theoretical considerations and developments."

For more conversations with NCAR staff, check out our sister site, The Next Platform. ®

Broader topics


Other stories you might like

  • New York City rips out last city-owned public payphones
    Y'know, those large cellphones fixed in place that you share with everyone and have to put coins in. Y'know, those metal disks representing...

    New York City this week ripped out its last municipally-owned payphones from Times Square to make room for Wi-Fi kiosks from city infrastructure project LinkNYC.

    "NYC's last free-standing payphones were removed today; they'll be replaced with a Link, boosting accessibility and connectivity across the city," LinkNYC said via Twitter.

    Manhattan Borough President Mark Levine said, "Truly the end of an era but also, hopefully, the start of a new one with more equity in technology access!"

    Continue reading
  • Cheers ransomware hits VMware ESXi systems
    Now we can say extortionware has jumped the shark

    Another ransomware strain is targeting VMware ESXi servers, which have been the focus of extortionists and other miscreants in recent months.

    ESXi, a bare-metal hypervisor used by a broad range of organizations throughout the world, has become the target of such ransomware families as LockBit, Hive, and RansomEXX. The ubiquitous use of the technology, and the size of some companies that use it has made it an efficient way for crooks to infect large numbers of virtualized systems and connected devices and equipment, according to researchers with Trend Micro.

    "ESXi is widely used in enterprise settings for server virtualization," Trend Micro noted in a write-up this week. "It is therefore a popular target for ransomware attacks … Compromising ESXi servers has been a scheme used by some notorious cybercriminal groups because it is a means to swiftly spread the ransomware to many devices."

    Continue reading
  • Twitter founder Dorsey beats hasty retweet from the board
    We'll see you around the Block

    Twitter has officially entered the post-Dorsey age: its founder and two-time CEO's board term expired Wednesday, marking the first time the social media company hasn't had him around in some capacity.

    Jack Dorsey announced his resignation as Twitter chief exec in November 2021, and passed the baton to Parag Agrawal while remaining on the board. Now that board term has ended, and Dorsey has stepped down as expected. Agrawal has taken Dorsey's board seat; Salesforce co-CEO Bret Taylor has assumed the role of Twitter's board chair. 

    In his resignation announcement, Dorsey – who co-founded and is CEO of Block (formerly Square) – said having founders leading the companies they created can be severely limiting for an organization and can serve as a single point of failure. "I believe it's critical a company can stand on its own, free of its founder's influence or direction," Dorsey said. He didn't respond to a request for further comment today. 

    Continue reading
  • Snowflake stock drops as some top customers cut usage
    You might say its valuation is melting away

    IPO darling Snowflake's share price took a beating in an already bearish market for tech stocks after filing weaker than expected financial guidance amid a slowdown in orders from some of its largest customers.

    For its first quarter of fiscal 2023, ended April 30, Snowflake's revenue grew 85 percent year-on-year to $422.4 million. The company made an operating loss of $188.8 million, albeit down from $205.6 million a year ago.

    Although surpassing revenue expectations, the cloud-based data warehousing business saw its valuation tumble 16 percent in extended trading on Wednesday. Its stock price dived from $133 apiece to $117 in after-hours trading, and today is cruising back at $127. That stumble arrived amid a general tech stock sell-off some observers said was overdue.

    Continue reading
  • Amazon investors nuke proposed ethics overhaul and say yes to $212m CEO pay
    Workplace safety, labor organizing, sustainability and, um, wage 'fairness' all struck down in vote

    Amazon CEO Andy Jassy's first shareholder meeting was a rousing success for Amazon leadership and Jassy's bank account. But for activist investors intent on making Amazon more open and transparent, it was nothing short of a disaster.

    While actual voting results haven't been released yet, Amazon general counsel David Zapolsky told Reuters that stock owners voted down fifteen shareholder resolutions addressing topics including workplace safety, labor organizing, sustainability, and pay fairness. Amazon's board recommended voting no on all of the proposals.

    Jassy and the board scored additional victories in the form of shareholder approval for board appointments, executive compensation and a 20-for-1 stock split. Jassy's executive compensation package, which is tied to Amazon stock price and mostly delivered as stock awards over a multi-year period, was $212 million in 2021. 

    Continue reading
  • Confirmed: Broadcom, VMware agree to $61b merger
    Unless anyone out there can make a better offer. Oh, Elon?

    Broadcom has confirmed it intends to acquire VMware in a deal that looks set to be worth $61 billion, if it goes ahead: the agreement provides for a “go-shop” provision under which the virtualization giant may solicit alternative offers.

    Rumors of the proposed merger emerged earlier this week, amid much speculation, but neither of the companies was prepared to comment on the deal before today, when it was disclosed that the boards of directors of both organizations have unanimously approved the agreement.

    Michael Dell and Silver Lake investors, which own just over half of the outstanding shares in VMware between both, have apparently signed support agreements to vote in favor of the transaction, so long as the VMware board continues to recommend the proposed transaction with chip designer Broadcom.

    Continue reading

Biting the hand that feeds IT © 1998–2022