Fast Machine Learning for Science Workshop 2023

Name: Fast Machine Learning for Science Workshop 2023
Start: 2023-09-25T08:30:00+01:00
End: 2023-09-28T18:00:00+01:00
Location: Imperial College London

25–28 Sept 2023

Imperial College London

Europe/London timezone

Efficient sparse matrix multiplication in hls4ml

Not scheduled

Blackett Laboratory, Lecture Theatre 1 (Imperial College London)

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Blackett Laboratory

Lightning Talk Contributed Talks Contributed Talks

Duc Minh Hoang (MIT)

Pruning enhances neural network hardware efficiency by zeroing out weight magnitude. In order to take full advantage of pruning, efficient implementations of sparse matrix multiplication are required. The current hls4ml implementations of sparse matrix multiplication rely on either the built in high-level synthesis zero suppression operations or a coordinate list representation, which faces scalability issues with model size and reuse factor. These implementations, particularly the coordinate list representation, are limited by their need to have large amounts of fanouts within an FPGA or ASIC to ensure a fully flexible implementation. We introduce a new implementation that preserves coordinate information but avoids the large dedicated logic needed for fanouts through the use of a crossbar. We present results for FPGA implementations scanning the model sparsity and initiation intervals for multiple benchmark models in MLPerf Inference Benchmark for anomaly detection and image classification.

Duc Minh Hoang (MIT)

Philip Coleman Harris (Massachusetts Inst. of Technology (US))

There are no materials yet.

Fast Machine Learning for Science Workshop 2023

Efficient sparse matrix multiplication in hls4ml

Blackett Laboratory, Lecture Theatre 1

Imperial College London

Speaker

Description

Author

Co-author

Presentation materials

Choose timezone

Fast Machine Learning for Science Workshop 2023

Speaker

Description

Author

Co-author

Presentation materials