15–18 Oct 2024
Purdue University
America/Indiana/Indianapolis timezone

Scaling CNN Deployment on FPGAs: The path to 100k parameters with hls4ml

Not scheduled
20m
Steward Center 306 (Third floor) (Purdue University)

Steward Center 306 (Third floor)

Purdue University

128 Memorial Mall Dr, West Lafayette, IN 47907
Poster

Speaker

Stylianos Tzelepis (National Technical Univ. of Athens (GR))

Description

Deploying large CNNs on resource-constrained hardware such as FPGAs poses significant challenges, particularly in balancing high throughput with limited resources and power consumption. To address these challenges, hls4ml was leveraged to accelerate inference through a streaming architecture, in contrast to programmable engines with dedicated instruction sets commonly used to scale to accommodate large neural networks. Despite hls4ml's support for resource-efficient designs and layer-wise streaming via the Vivado HLS compiler, achieving a 100k-parameter CNN deployment remained a significant hurdle. This work details the key advancements implemented in hls4ml to overcome these barriers, with the long-term goal of scaling to 1M parameters. Our approach focuses on three core areas: migrating to the Vitis HLS compiler for enhanced performance, extending the separable convolutional layer implementation to reduce resource demands, and integrating the built-in FIFO depth optimization of Vitis HLS into the hls4ml workflow to efficiently manage resources while maintaining low latency and power efficiency.

Authors

Nicolo Ghielmetti (CERN) Sioni Paris Summers (CERN) Stylianos Tzelepis (National Technical Univ. of Athens (GR))

Presentation materials

There are no materials yet.