Fast Machine Learning for Science Conference 2024

Name: Fast Machine Learning for Science Conference 2024
Start: 2024-10-15T08:00:00-04:00
End: 2024-10-18T21:00:00-04:00
Location: Purdue University

15–18 Oct 2024

Purdue University

America/Indiana/Indianapolis timezone

Session

Lighting talks

15 Oct 2024, 15:15

Steward Center 306 (Third floor) (Purdue University)

Steward Center 306 (Third floor)

Purdue University

128 Memorial Mall Dr, West Lafayette, IN 47907

There are no materials yet.

133. Robust and interpretable deep learning by leveraging domain knowledge

Mirco Hünnefeld (University of Wisconsin-Madison)

15/10/2024, 15:15

Lightning 5 min talk + poster

Recently, compelling evidence for the emission of high-energy neutrinos from our host Galaxy - the Milky Way - was reported by IceCube, a neutrino detector instrumenting a cubic kilometer of glacial ice at the South Pole. This breakthrough observation is enabled by advances in AI, including a physics-driven deep learning method capable of exploiting available symmetries and domain knowledge....

123. Intelligent experiments through real-time AI: GNN-based trigger pipeline for sPHENIX

Jovan Mitrevski (Fermi National Accelerator Lab. (US))

15/10/2024, 15:20

Lightning 5 min talk, no poster

This R&D project, initiated by the DOE Nuclear Physics AI-Machine Learning initiative in 2022, explores advanced AI technologies to address data processing challenges at RHIC and future EIC experiments. The main objective is to develop a demonstrator capable of efficient online identification of heavy-flavor events in proton-proton collisions (~1 MHz) based on their decay topologies, while...

108. Interpreting and Accelerating Transformers for Jet Tagging

Aaron Wang (University of Illinois at Chicago (US)), Vivekanand Gyanchand Sahu (University of California San Diego)

15/10/2024, 15:25

Lightning 5 min talk + poster

Attention-based transformers are ubiquitous in machine learning applications from natural language processing to computer vision. In high energy physics, one central application is to classify collimated particle showers in colliders based on the particle of origin, known as jet tagging. In this work, we study the interpretatbility and prospects for acceleration of Particle Transformer (ParT),...

75. S-QUARK: A Scalable Quantization-Aware Training Framework for FPGA Deployment based on Keras-v3

Chang Sun (California Institute of Technology (US))

15/10/2024, 15:30

Lightning 5 min talk + poster

In this work, we present the Scalable QUantization-Aware Real-time Keras (S-QUARK), an advanced quantization-aware training (QAT) framework for efficient FPGAs inference built on top of Keras-v3, supporting all Tensorflow, JAX, and PyTorch backends.

The framework inherits all perks from the High Granularity Quantization (HGQ) library, and extends it to support fixed-point numbers with...

67. Online track reconstruction with graph neural networks on FPGAs for the ATLAS experiment

Jared Burleson (University of Illinois at Urbana-Champaign)

15/10/2024, 15:35

Lightning 5 min talk + poster

The next phase of high energy particle physics research at CERN will
involve the High-Luminosity Large Hadron Collider (HL-LHC). In preparation for
this phase, the ATLAS Trigger and Data AcQuisition (TDAQ) system will undergo
upgrades to the online software tracking capabilities. Studies are underway to
assess a heterogeneous computing farm deploying GPUs and/or FPGAs, together
with the...

56. IceSONIC - Network AI Inference on Coprocessors for IceCube Offline Processing

Benedikt Riedel

15/10/2024, 15:40

Lightning 5 min talk + poster

An Artificial Intelligence (AI) model will spend “90% of its lifetime in inference.”To fully utilize co-
processors, such as FPGAs or GPUs, for AI inference requires O(10) CPU cores to feed to work to the
coprocessors. Traditional data analysis pipelines will not be able to effectively and efficiently use
the coprocessors to their full potential. To allow for distributed access to...

107. Towards Online Machine Learning in DUNE Data Acquisition

Andrew Mogan

15/10/2024, 15:45

Lightning 5 min talk + poster

Processing large volumes of sparse neutrino interaction data is essential to the success of liquid argon time projection chamber (LArTPC) experiments such as DUNE. High rates of radiological background must be eliminated to extract critical information for track reconstruction and downstream analysis. Given the computational load of this rejection, and potential real time constraints of...

74. Fast Simulation of Particle Physics Calorimeters

Oz Amram (Fermi National Accelerator Lab. (US))

15/10/2024, 15:50

Lightning 5 min talk + poster

Detector simulation is a key component of physics analysis and related activities in particle physics.In the upcoming High Luminosity LHC era, simulation will be required to use a smaller fraction of computing in order to satisfy resource constraints at the same time as experiments are being upgraded new with the new higher granularity detectors, which requires significantly more resources to...

104. Real-Time AI-Based Data Selection in LArTPC Experiments Using Accelerated FPGA Platforms

Akshay Malige

15/10/2024, 15:55

Lightning 5 min talk + poster

The demand for machine learning algorithms on edge devices, such as Field-Programmable Gate Arrays (FPGAs), arises from the need to process and intelligently reduce vast amounts of data in real-time, especially in large-scale experiments like the Deep Underground Neutrino Experiment (DUNE). Traditional methods, such as thresholding, clustering, multiplicity checks, or coincidence checks,...

78. Benchmarking and Interpreting Real-Time Quench Detection Algorithms

Maira Khan (Fermi National Accelerator Laboratory)

15/10/2024, 16:00

Lightning 5 min talk + poster

Detecting quenches in superconducting (SC) magnets by non-invasive means is a challenging real-time process that involves capturing
and sorting through physical events that occur at different frequencies and appear as various signal features. These events may be correlated across instrumentation type, thermal cycle, and ramp. These events together build a more complete picture of continuous...

97. Real-time Reinforcement Learning on AI Engines with Online Training for Autonomous Accelerators

Luca Scomparin

15/10/2024, 16:05

Lightning 5 min talk + poster

Reinforcement Learning (RL) is a promising approach for the autonomous AI-based control of particle accelerators. Real-time requirements for these algorithms can often not be satisfied with conventional hardware platforms.
In this contribution, the unique KINGFISHER platform being developed at KIT will be presented. Based on the novel AMD-Xilinx Versal platform, this system provides...

99. AI Red Teaming for Science

Anita Nikolich (UIUC)

15/10/2024, 16:10

Lightning 5 min talk + poster

AI Red Teaming, an offshoot of traditional cybersecurity practices, has emerged as a critical tool for ensuring the integrity of AI systems. An under explored area has been the application of AI Red Teaming methodologies to scientific applications, which increasingly use machine learning models in workflows. I'll highlight why this is important and how AI Red Teaming can highlight...

76. An Efficient Multiply Accumulate Tree for Real-time Quantized Neural Networks

Chang Sun (California Institute of Technology (US))

15/10/2024, 16:15

Lightning 5 min talk + poster

Neural networks with a latency requirement at the order of microseconds, like the ones used at the CERN Large Hadron Colliders, are typically deployed on FPGAs fully unrolled. A bottleneck for the deployment of such neural networks is area utilization, which is directly related to the number of Multiply Accumulate (MAC) operations in matrix-vector multiplications.

In this work, we present...

70. A gradient-based hardware-aware neural architecture search framework for hls4ml

ChiJui Chen

15/10/2024, 16:20

Poster

In software-hardware co-design, balancing performance with hardware constraints is critical, especially when using FPGAs for high-energy physics (HEP) applications with hls4ml. Limited resources and stringent latency requirements exacerbate this challenge. Existing frameworks such as AutoQKeras use Bayesian optimization to balance model size/energy and accuracy, but they are time-consuming,...

113. Model-Independent Real-Time Anomaly Detection at CMS with CICADA

Lino Oscar Gerlach (Princeton University (US))

16/10/2024, 15:10

Lightning 5 min talk + poster

In the search for new physics, real-time detection of anomalous events is critical for maximizing the discovery potential of the LHC. CICADA (Calorimeter Image Convolutional Anomaly Detection Algorithm) is a novel CMS trigger algorithm operating at the 40 MHz collision rate. By leveraging unsupervised deep learning techniques, CICADA aims to enable physics-model independent trigger decisions,...

82. Unsupervised Learning Methods of Real-Time Anomaly Detection for Data Selection and Detector Monitoring in Liquid Argon Time Projection Chambers

Jack Henry Cleeve (Columbia University)

16/10/2024, 15:15

Lightning 5 min talk + poster

Unsupervised learning algorithms enable insights from large, unlabeled datasets, allowing for feature extraction and anomaly detection that can reveal latent patterns and relationships often not found by supervised or classical algorithms. Modern particle detectors, including liquid argon time projection chambers (LArTPCs), collect a vast amount of data, making it impractical to save...

55. An open platform for in-situ high-speed computer vision with hls4ml

Ryan Forelli (Northwestern University)

16/10/2024, 15:20

Lightning 5 min talk + poster

Low latency machine learning inference is vital for many high-speed imaging applications across various scientific domains. From analyzing fusion plasma [1] to rapid cell-sorting [2], there is a need for in-situ fast inference in experiments operating in the kHz to MHz range. External PCIe accelerators are often unsuitable for these experiments due to the associated data transfer overhead,...

73. PearNets for Pearson Correlated Latent Optimization of Nanophotonic Devices

Michael Tan Bezick

16/10/2024, 15:25

Lightning 5 min talk + poster

Recent advancements in generative artificial intelligence (AI), including transformers, adversarial networks, and diffusion models, have demonstrated significant potential across various fields, from creative art to drug discovery. Leveraging these models in engineering applications, particularly in nanophotonics, is an emerging frontier. Nanophotonic metasurfaces, which manipulate light at...

135. EnsembleLUT: Scaling up LUT-based Neural Networks with Ensemble Learning

Olivia Weng

16/10/2024, 15:30

Lightning 5 min talk + poster

Applications like high-energy physics and cybersecurity require extremely high throughput and low latency neural network (NN) inference. Lookup-table-based NNs address these constraints by implementing NNs purely as lookup tables (LUTs), achieving inference latency on the order of nanoseconds. Since LUTs are a fundamental FPGA building block, LUT-based NNs map to FPGAs easily. LogicNets (and...

85. An Efficient and Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks

Hoin Jung (Purdue University)

16/10/2024, 15:35

Lightning 5 min talk + poster

Recent advancements in Vision-Language Models (VLMs) have enabled complex multimodal tasks by processing text and image data simultaneously, significantly enhancing the field of artificial intelligence. However, these models often exhibit biases that can skew outputs towards societal stereotypes, thus necessitating debiasing strategies. Existing debiasing methods focus narrowly on specific...

127. wa-hls4ml: A benchmark and dataset for ML accelerator resource estimation

Ben Hawks (Fermi National Accelerator Lab)

16/10/2024, 15:40

Lightning 5 min talk + poster

As machine learning (ML) increasingly serves as a tool for addressing real-time challenges in scientific applications, the development of advanced tooling has significantly reduced the time required to iterate on various designs. Despite these advancements in areas that once posed major obstacles, newer challenges have emerged. For example, processes that were not previously considered...

98. Neural Architecture Codesign for Fast Physics Applications

Dmitri Demler

16/10/2024, 15:45

Lightning 5 min talk + poster

We develop an automated pipeline to streamline neural architecture codesign for physics applications, to reduce the need for ML expertise when designing models for a novel task. Our method employs a two-stage neural architecture search (NAS) design to enhance these models, including hardware costs, leading to the discovery of more hardware-efficient neural architectures. The global search...

35. Comprehensive Analysis of UNet Variants in Cardiac Image Segmentation

Niharika Das (G H Raisoni University)

16/10/2024, 15:50

Lightning 5 min talk + poster

Deep learning, particularly employing the Unet architecture, has become pivotal in cardiology, facilitating detailed analysis of heart anatomy and function. The segmentation of cardiac images enables the quantification of essential parameters such as myocardial viability, ejection fraction, cardiac chamber volumes, and morphological features. These segmentation methods operate autonomously...

86. Edge SpAIce: Enabling On-Board Data Compression With Machine Learning On FPGAs

Nicolò Ghielmetti (CERN)

16/10/2024, 15:55

Lightning 5 min talk + poster

The number of CubeSats launched for data-intensive applications is increasing due to the modularity and reduced cost these platforms provide. Consequently, there is a growing need for efficient data processing and compression. Tailoring onboard processing with Machine Learning to specific mission tasks can optimise downlink usage by focusing only on relevant data, ultimately reducing the...

95. Leveraging Intel FPGA AI Suite and AI Tensor Blocks for Real-Time, Energy-Efficient Deep Learning Inference

Jahanzeb Ahmad

Lightning 5 min talk + poster

In the presentation, the introduction of the Intel FPGA AI Suite alongside the revolutionary AI Tensor Blocks recently incorporated into the latest FPGA device families by Intel for deep learning inference is showcased. These innovative FPGA components bring real-time, low-latency, and energy-efficient processing to the forefront. They are supported by the inherent advantages of Intel FPGAs,...

Building timetable...

Choose timezone

Fast Machine Learning for Science Conference 2024

Presentation materials