3–6 Oct 2022
Southern Methodist University
America/Chicago timezone

Large CNN for HLS4ML and Deepcalo

5 Oct 2022, 14:05
5m
Southern Methodist University

Southern Methodist University

Speakers

ChiJui Chen Lin-Chi Yang Yan-Lun Huang

Description

Convolutional neural networks (CNN) have been widely applied in a tremendous of applications that involve image processing, including particle physics. Deepcalo is a package designed for developing CNNs using ATLAS data at CERN, targeting tasks like energy regression of electrons and photons. Although it has been shown that CNNs used in Deepcalo can handle the task smoothly, the extensive computation resources and high-power consumption lead it hard to perform real-time inference during the experiment. As a result, it is limited in software simulation usage.

To accelerate the inference time and lower the power consumption, we implement those CNNs on FPGAs (Field Programmable Gate Arrays) with HLS4ML. HLS4ML is an automated tool for deploying machine-learning models on FPGAs, targeting ultra-low latency using fully-on-chip architecture. Based on HLS C++ codes by Dr. Dylan Rankin, we extend the HLS4ML library for supporting an automatic large CNNs conversion. In this work, we introduce a deeply-optimized workflow for implementing large CNNs on FPGAs. Implemented on an AlveoU50 FPGA running at 200 MHz, the accelerator infers with 0.039 of IQR75 loss in 0.6 ms.

Primary authors

Alexander Joseph Schuy (University of Washington (US)) Bo-Cheng Lai ChiJui Chen Dylan Sheldon Rankin (Massachusetts Inst. of Technology (US)) Lin-Chi Yang Philip Coleman Harris (Massachusetts Inst. of Technology (US)) Scott Hauck Shih-Chieh Hsu (University of Washington Seattle (US)) Yan-Lun Huang Ziang Yin (University of Washington (US))

Presentation materials