Fast Machine Learning for Science Workshop 2022

Name: Fast Machine Learning for Science Workshop 2022
Start: 2022-10-03T09:00:00-05:00
End: 2022-10-06T12:30:00-05:00
Location: Southern Methodist University

3–6 Oct 2022

Southern Methodist University

America/Chicago timezone

Quantized Neural Networks on FPGAs using HAWQ-V3 and hl4ml

5 Oct 2022, 14:45

15m

Southern Methodist University

Contributed Talks

Javier Ignacio Campos (Fermi National Accelerator Lab. (US))

Neural networks have been shown to be helpful in identifying events of interest in particle physics. However, to be used for live trigger decisions, they must meet demandingly low latencies and resource utilization for deployment on Field Programmable Gate Arrays (FPGAs). HAWQ-V3, a Hessian-based quantization-aware training framework, and hls4ml, an FPGA firmware implementation package, address these issues. HAWQ-V3 is a training framework enabling ultra-low and mixed-precision quantization. It introduced an approach to determining the relative quantization precision of each layer based on the layer's Hessian spectrum. More recently, it implements a computational graph with only integer addition, multiplication, and bit-shifting. We present a neural network classifier implemented with HAWQ-V3 for high-pT jets from simulations of LHC proton-proton collisions. We then introduce an extension for HAWQ-V3 to translate our classifier into the Quantized ONNX (QONNX) intermediate representation format, an extension of the Open Neural Network Exchange (ONNX) format, supporting arbitrary-precision and low-precision neural networks. We demonstrate how the conversion of HAWQ-V3 models leverages the PyTorch Just-in-Time compiler to trace and translate models to QONNX operators. We then proceed to hls4ml to create firmware implementation of our quantized neural network and review its estimated latency and resource utilization for an FPGA.

Javier Ignacio Campos (Fermi National Accelerator Lab. (US))

Slides

Fast Machine Learning for Science Workshop 2022

Quantized Neural Networks on FPGAs using HAWQ-V3 and hl4ml

Southern Methodist University

Speaker

Description

Author

Presentation materials

Choose timezone

Fast Machine Learning for Science Workshop 2022

Speaker

Description

Author

Presentation materials