Fast Machine Learning for Science Conference 2025

Name: Fast Machine Learning for Science Conference 2025
Start: 2025-09-01T08:30:00+02:00
End: 2025-09-05T17:30:00+02:00
Location: ETH Zurich

1–5 Sept 2025

ETH Zurich

Europe/Zurich timezone

Local organisers

fml-2025-organisers@cern.ch

da4ml: Distributed Arithmetic for Real-time Neural Networks on FPGAs

2 Sept 2025, 14:20

20m

ETH Zurich

HIT E 51, Siemens Auditorium, ETH Zurich, Hönggerberg campus, 8093 Zurich, Switzerland

Standard Talk Contributed talks

Chang Sun (California Institute of Technology (US))

Neural networks with a latency requirement on the order of microseconds, like the ones used at the CERN Large Hadron Collider, are typically deployed on FPGAs pipelined with II=1. A bottleneck for the deployment of such neural networks is area utilization, which is directly related to the required constant matrix-vector multiplication (CMVM) operations. In this work, we propose an efficient algorithm for implementing CMVM operations with distributed arithmetic (DA) on FPGAs that simultaneously optimizes for area consumption and latency. The algorithm achieves resource reduction similar to state-of-the-art algorithms while being significantly faster to compute.

We release da4ml, a free and open source package that enables end-to-end, bit-exact neural network to Verilog or HLS design conversion, optimized with the proposed algorithm. For easy adoption into existing workflows, we also integrate da4ml into the hls4ml library. The results show that da4ml can reduce on-chip resources by up to a third for realistic, highly quantized neural networks while simultaneously reducing latency compared to the native implementation hls4ml, enabling the implementation of previously infeasible networks.

Chang Sun (California Institute of Technology (US))

Zhiqiang (Walkie) Que (Imperial College London) Vladimir Loncar (CERN) Wayne Luk Maria Spiropulu (California Institute of Technology (US))

da4ml-fastml25.pdf

Fast Machine Learning for Science Conference 2025

Local organisers

da4ml: Distributed Arithmetic for Real-time Neural Networks on FPGAs

ETH Zurich

Speaker

Description

Author

Co-authors

Presentation materials

Choose timezone

Fast Machine Learning for Science Conference 2025

Local organisers

Speaker

Description

Author

Co-authors

Presentation materials