1–5 Sept 2025
ETH Zurich
Europe/Zurich timezone

wa-hls4ml: A Benchmark and Surrogate Models for hls4ml Resource and Latency Estimation

Not scheduled
1h
HIT G floor (gallery)

HIT G floor (gallery)

Speaker

Ben Hawks (Fermi National Accelerator Lab)

Description

As machine learning (ML) is increasingly implemented in hardware to address real-time challenges in scientific applications, the
development of advanced toolchains has significantly reduced the time required to iterate on various designs. These advancements have
solved major obstacles, but also exposed new challenges. For example, processes that were not previously considered bottlenecks, such
as hardware synthesis, are becoming limiting factors in the rapid iteration of designs. To mitigate these emerging constraints, multipleefforts have been undertaken to develop an ML-based surrogate model that estimates resource usage of synthesized ML accelerator
architectures. We introduce wa-hls4ml, a benchmark for ML accelerator resource and latency estimation, and its corresponding
initial dataset of over 680 000 fully connected and convolutional neural networks, all synthesized using hls4ml and targeting Xilinx
FPGAs. The benchmark evaluates the performance of resource and latency predictors against several common ML model architectures,
primarily originating from scientific domains, as exemplar models, and the average performance across a subset of the dataset.
Additionally, we introduce GNN- and transformer-based surrogate models that predict latency and resources for ML accelerators. We
present the architecture and performance of the models and find that the models generally predict latency and resources for the 75%
percentile within several percent of the synthesized resources on the synthetic test dataset.

Authors

Prof. Audrey Corbeil Therrien (Université de Sherbrooke) Ben Hawks (Fermi National Accelerator Lab) Dennis Plotnikov (Johns Hopkins University (US)) Dmitri Demler Donovan Sproule (Columbia University) Elham Khoda (University of Washington (US)) Giuseppe Di Guglielmo (Fermilab) Hamza Ezzaoui Rahali (University of Sherbrooke) Jason Weitz (Univ. of California San Diego (US)) Javier Mauricio Duarte (Univ. of California San Diego (US)) Dr Karla Tame-Narvaez (Fermilab National Accelerator Laboratory) Keegan Smith (Texas A&M University) Mohammad Medhi Rahimifar (University of Sherbrooke) Nhan Tran (Fermi National Accelerator Lab. (US)) Russell Marroquin (University of California San Diego) Vladimir Loncar (CERN)

Presentation materials