Archive

Category Archives for "IT Industry"

Groq Says It Can Deploy 1 Million AI Inference Chips In Two Years

If you are looking for an alternative to Nvidia GPUs for AI inference – and who isn’t these days with generative AI being the hottest thing since a volcanic eruption – then you might want to give Groq a call.

The post Groq Says It Can Deploy 1 Million AI Inference Chips In Two Years first appeared on The Next Platform.

Groq Says It Can Deploy 1 Million AI Inference Chips In Two Years was written by Timothy Prickett Morgan at The Next Platform.

Nvidia Proves The Enormous Potential For Generative AI

The exorbitant cost of GPU-accelerated systems for training and inference and latest to rush to find gold in mountains of corporate data are combining to exert tectonic forces on the datacenter landscape and push up a new Himalaya range – with Nvidia as its steepest and highest peak.

The post Nvidia Proves The Enormous Potential For Generative AI first appeared on The Next Platform.

Nvidia Proves The Enormous Potential For Generative AI was written by Timothy Prickett Morgan at The Next Platform.

Pushing The Limits Of HPC And AI Is Becoming A Sustainability Headache

As Moore’s law continues to slow, delivering more powerful HPC and AI clusters means building larger, more power hungry facilities.

The post Pushing The Limits Of HPC And AI Is Becoming A Sustainability Headache first appeared on The Next Platform.

Pushing The Limits Of HPC And AI Is Becoming A Sustainability Headache was written by Timothy Prickett Morgan at The Next Platform.

What To Do When You Can’t Get Nvidia H100 GPUs

In a world where allocations of “Hopper” H100 GPUs coming out of Nvidia’s factories are going out well into 2024, and the allocations for the impending “Antares” MI300X and MI300A GPUs are probably long since spoken for, anyone trying to build a GPU cluster to power a large language model for training or inference has to think outside of the box.

The post What To Do When You Can’t Get Nvidia H100 GPUs first appeared on The Next Platform.

What To Do When You Can’t Get Nvidia H100 GPUs was written by Timothy Prickett Morgan at The Next Platform.

Billion-Dollar AI Promise a Bright Spot in Gloomy Quarter for Cisco

Cisco navigated a rocky road in its first quarter of the year as evidenced by the dips in share price for the networking giant this morning.

The post Billion-Dollar AI Promise a Bright Spot in Gloomy Quarter for Cisco first appeared on The Next Platform.

Billion-Dollar AI Promise a Bright Spot in Gloomy Quarter for Cisco was written by Nicole Hemsoth Prickett at The Next Platform.

Microsoft Holds Chip Makers’ Feet To The Fire With Homegrown CPU And AI Chips

After many years of rumors, Microsoft has finally confirmed that it is following rivals Amazon Web Services and Google into the design of custom processors and accelerators for their clouds.

The post Microsoft Holds Chip Makers’ Feet To The Fire With Homegrown CPU And AI Chips first appeared on The Next Platform.

Microsoft Holds Chip Makers’ Feet To The Fire With Homegrown CPU And AI Chips was written by Timothy Prickett Morgan at The Next Platform.

Will Isambard 4 Be The UK’s First True Exascale Machine?

Here is a story you don’t hear very often: A supercomputing center was just given a blank check up to the peak power consumption of its facility to build a world-class AI/HPC supercomputer instead of a sidecar partition with some GPUs to play around with and wish its researchers had a lot more capacity.

The post Will Isambard 4 Be The UK’s First True Exascale Machine? first appeared on The Next Platform.

Will Isambard 4 Be The UK’s First True Exascale Machine? was written by Timothy Prickett Morgan at The Next Platform.

Top500 Supercomputers: Who Gets The Most Out Of Peak Performance?

The most exciting thing about the Top500 rankings of supercomputers that come out each June and November is not who is on the top of the list.

The post Top500 Supercomputers: Who Gets The Most Out Of Peak Performance? first appeared on The Next Platform.

Top500 Supercomputers: Who Gets The Most Out Of Peak Performance? was written by Timothy Prickett Morgan at The Next Platform.

Nvidia Pushes Hopper HBM Memory, And That Lifts GPU Performance

For very sound technical and economic reasons, processors of all kinds have been overprovisioned on compute and underprovisioned on memory bandwidth – and sometimes memory capacity depending on the device and depending on the workload – for decades.

The post Nvidia Pushes Hopper HBM Memory, And That Lifts GPU Performance first appeared on The Next Platform.

Nvidia Pushes Hopper HBM Memory, And That Lifts GPU Performance was written by Timothy Prickett Morgan at The Next Platform.

You Can Load Up On Cheap Cores With Updated Milan Epycs

There are two ways that CPU makers can deliver more bang for the buck, and those running distributed computing workloads can go either way – or somewhere in between – as they build out their server clusters.

The post You Can Load Up On Cheap Cores With Updated Milan Epycs first appeared on The Next Platform.

You Can Load Up On Cheap Cores With Updated Milan Epycs was written by Timothy Prickett Morgan at The Next Platform.

Supermicro Racks Up The System Revenues

There is wracking up the money, and racking up the servers – and Supermicro, which is sometimes an OEM and sometimes an ODM as well as a motherboard and component supplier to those who want to be either, is doing both here at the beginning of its fiscal 2024 year.

The post Supermicro Racks Up The System Revenues first appeared on The Next Platform.

Supermicro Racks Up The System Revenues was written by Timothy Prickett Morgan at The Next Platform.

Ventana Launches Veyron V2 RISC-V Into The Datacenter

It took the X86 architecture fifteen years get an appreciable share of datacenter compute, and it took the Arm architecture about ten years to get a foothold you could measure.

The post Ventana Launches Veyron V2 RISC-V Into The Datacenter first appeared on The Next Platform.

Ventana Launches Veyron V2 RISC-V Into The Datacenter was written by Timothy Prickett Morgan at The Next Platform.

AMD’s Instinct GPU Business Is Coiled To Spring

Timing is a funny thing. The summer of 2006 when AMD bought GPU maker ATI Technologies for $5.6 billion and took on both Intel in CPUs and Nvidia in GPUs was the same summer when researchers first started figuring out how to offload single-precision floating point math operations from CPUs to Nvidia GPUs to try to accelerate HPC simulation and modeling workloads.

The post AMD’s Instinct GPU Business Is Coiled To Spring first appeared on The Next Platform.

AMD’s Instinct GPU Business Is Coiled To Spring was written by Timothy Prickett Morgan at The Next Platform.

Amazon Gears Up To Profit Mightily From The Generative AI Boom

Because they are in the front of the line for acquiring Nvidia datacenter GPUs, the hyperscalers and cloud builders are going to be the ones who benefit mightily from shortages of matrix math engines that can train AI models and run inference against them.

The post Amazon Gears Up To Profit Mightily From The Generative AI Boom first appeared on The Next Platform.

Amazon Gears Up To Profit Mightily From The Generative AI Boom was written by Timothy Prickett Morgan at The Next Platform.

Intel Is Counting On AI Inference To Save The Xeon CPU

There is little question that generative AI as well as other kinds of machine learning are going to augment applications in every industry and in every part of the application stack in the coming years.

The post Intel Is Counting On AI Inference To Save The Xeon CPU first appeared on The Next Platform.

Intel Is Counting On AI Inference To Save The Xeon CPU was written by Timothy Prickett Morgan at The Next Platform.

1 5 6 7 8 9 147