EP-IT Data Science Seminars

Co-Design for Efficient & Adaptive ML

Name: Co-Design for Efficient & Adaptive ML
Start: 2024-06-26T11:00:00+02:00
End: 2024-06-26T12:00:00+02:00
Location: CERN

by Dr Yaman Umuroglu (AMD Research & Advanced Development)

Wednesday 26 Jun 2024, 11:00 → 12:00 Europe/Zurich

500/1-001 - Main Auditorium (CERN)

500/1-001 - Main Auditorium

CERN

400

Show room on map

Description

Beyond the well-known highlights in computer vision and natural language, AI is steadily expanding into new application domains. This Pervasive AI trend requires supporting diverse and fast-moving application requirements, ranging from specialized I/O to fault tolerance and limited resources, all the while retaining high performance and low latency. Adaptive compute architectures such as AMD FPGAs are an excellent fit for such requirements but require co-design of hardware and ML algorithms to reap the full benefits. In this talk, we will cover a breadth of co-design techniques, including their merits and challenges, from streaming dataflow architectures to quantization, from sparsity to full circuit co-design. By combining such techniques, we can enable nanosecond-latency and performance in the hundreds of millions of inferences per second. The proliferation of this technology is enabled via open-source AMD tools such as FINN, Brevitas and LogicNets, as well as the AMD-FastML collaborative project QONNX.

Yaman Umuroglu is a Senior Member of Technical Staff with AMD Research and Advanced Development. He holds a PhD degree from the Norwegian University of Science and Technology (NTNU) in domain-specific architectures for reconfigurable computing. His research takes a full-stack view of machine learning with neural networks with a focus on high-efficiency and high-performance implementations and spans hardware-network codesign, techniques for efficient arithmetic, sparsity and quantization.

Coffee will be served at 10:30.

Organised by

M. Girone, M. Elsing, L. Moneta, M. Pierini

Contact

EP-seminars.colloquia@cern.ch

Webcast

There is a live webcast for this event

98545267593

EP/IT Data Science seminar

Lorenzo Moneta

Pascal Pignereau, Markus Elsing, Maria Girone, Thomas Nik Bazl Fard, Caroline Cazenoves, EP Seminars and Colloquia, Maurizio Pierini

97200142

Join via phone

Choose timezone

Co-Design for Efficient & Adaptive ML

by Dr Yaman Umuroglu (AMD Research & Advanced Development)

500/1-001 - Main Auditorium

CERN