Machine Learning

MLCommons, the consortium behind the MLPerf family of machine learning benchmarks, is announcing this morning that the organization will be developing a new desktop AI benchmarking suite under the MLPerf banner. Helmed by the body’s newly-formed MLPerf Client working group, the task force will be developing a client AI benchmark suit aimed at traditional desktop PCs, workstations, and laptops. According to the consortium, the first iteration of the MLPerf Client benchmark suite will be based on Meta’s Llama 2 LLM, with an initial focus on assembling a benchmark suite for Windows. The de facto industry standard benchmark for AI inference and training on servers and HPC systems, MLCommons has slowly been extending the MLPerf family of benchmarks to additional devices over the past several years...

AMD: Partial RDNA 3 Video Card Support Coming to Future ROCm Releases

AMD this morning is formally announcing the launch of the latest version of its GPU compute software stack, ROCm 5.7. Along with making several important updates to the software...

15 by Ryan Smith on 6/29/2023

Intel Discloses New Details On Meteor Lake VPU Block, Lays Out Vision For Client AI

While the first systems based on Intel’s forthcoming Meteor Lake (14th Gen Core) systems are still at least a few months out – and thus just a bit too...

17 by Ryan Smith on 5/29/2023

NVIDIA: Grace Hopper Has Entered Full Production & Announcing DGX GH200 AI Supercomputer

Teeing off an AI-heavy slate of announcements for NVIDIA, the company has confirmed that their Grace Hopper “superchip” has entered full production. The combination of a Grace CPU and...

8 by Ryan Smith on 5/29/2023

NVIDIA Announces H100 NVL - Max Memory Server Card for Large Language Models

While this year’s Spring GTC event doesn’t feature any new GPUs or GPU architectures from NVIDIA, the company is still in the process of rolling out new products based...

25 by Ryan Smith on 3/21/2023

NVIDIA Hopper GPU Architecture and H100 Accelerator Announced: Working Smarter and Harder

Depending on your point of view, the last two years have either gone by very slowly, or very quickly. While the COVID pandemic never seemed to end – and...

88 by Ryan Smith on 3/22/2022

Cerebras Completes Series F Funding, Another $250M for $4B Valuation

Every once in a while, a startup comes along with something out of left field. In the AI hardware generation, Cerebras holds that title, with their Wafer Scale Engine...

25 by Dr. Ian Cutress on 11/10/2021

NVIDIA Launches A2 Accelerator: Entry-Level Ampere For Edge Inference

Alongside a slew of software-related announcements this morning from NVIDIA as part of their fall GTC, the company has also quietly announced a new server GPU product for the...

16 by Ryan Smith on 11/9/2021

Hot Chips 2021 Live Blog: Machine Learning (Graphcore, Cerebras, SambaNova, Anton)

Welcome to Hot Chips! This is the annual conference all about the latest, greatest, and upcoming big silicon that gets us all excited. Stay tuned during Monday and Tuesday...

3 by Dr. Ian Cutress on 8/24/2021

Hot Chips 2021 Live Blog: Machine Learning (Esperanto, Enflame, Qualcomm)

Welcome to Hot Chips! This is the annual conference all about the latest, greatest, and upcoming big silicon that gets us all excited. Stay tuned during Monday and Tuesday...

0 by Dr. Ian Cutress on 8/24/2021

Hot Chips 2021 Keynote Live Blog: Designing Chips with AI, Synopsys

Welcome to Hot Chips! This is the annual conference all about the latest, greatest, and upcoming big silicon that gets us all excited. Stay tuned during Monday and Tuesday...

0 by Dr. Ian Cutress on 8/23/2021

Cadence Cerebrus to Enable Chip Design with ML: PPA Optimization in Hours, not Months

The design of most leading edge processors and ASICs rely on steps of optimization, with the three key optimization points being Performance, Power, and Area (and sometimes Cost). Once...

20 by Dr. Ian Cutress on 7/22/2021

Using AI to Build Better Processors: Google Was Just the Start, Says Synopsys

In light of the rate of innovation, chip design teams have spent tens of thousands of hours honing their skills over the decades. But getting the best human-designed processor...

100 by Dr. Ian Cutress on 6/23/2021

Xilinx Expands Versal AI to the Edge: Helping Solve the Silicon Shortage

Today Xilinx is announcing an expansion to its Versal family, focused specifically on low power and edge devices. Xilinx Versal is the productization of a combination of many different...

25 by Dr. Ian Cutress on 6/9/2021

MLPerf Inference v1.0: 2000 Suite Results, New Power Measurements

There has been a strong desire for a series of industry standard machine learning benchmarks, akin to the SPEC benchmarks for CPUs, in order to compare relative solutions. Over...

11 by Dr. Ian Cutress on 4/21/2021

Graphcore Series E Funding: $710m Total, $440m Cash-in-Hand

For those that aren’t following the AI industry, one of the key metrics to observe for a number of these AI semiconductor startups is the amount of funding they...

12 by Dr. Ian Cutress on 1/4/2021

Qualcomm's Cloud AI 100 Now Sampling: Up to 400TOPs at 75W

Today Qualcomm is revealing more information on last year’s announced “Cloud AI 100” inference chip and platform. The new inference platform by the company is said to have entered...

15 by Andrei Frumusanu on 9/16/2020

342 Transistors for Every Person In the World: Cerebras 2nd Gen Wafer Scale Engine Teased

One of the highlights of Hot Chips from 2019 was the startup Cerebras showcasing its product – a large ‘wafer-scale’ AI chip that was literally the size of a...

32 by Dr. Ian Cutress on 8/18/2020

Arm Announces Ethos-N78 NPU: Bigger And More Efficient

Yesterday Arm released the new Cortex-A78, Cortex-X1 CPUs and the new Mali-G78 GPU. Alongside the new “key” IPs from the company, we also saw the reveal of the newest...

34 by Andrei Frumusanu on 5/27/2020

AMD Unveils CDNA GPU Architecture: A Dedicated GPU Architecture for Data Centers

Over the last decade, the industry has seen a boom in demand for GPUs for the data center. Driven in large part by rapid progress in neural networking, deep...

26 by Ryan Smith on 3/5/2020

Log in

Don't have an account? Sign up now