10–14 Jul 2023
University of Washington
US/Pacific timezone

Real-time Fitting and Materials Characterization in Band-Excitation Piezoresponse Force Microscopy

10 Jul 2023, 19:00
2h
Oak Hall Denny Room

Oak Hall Denny Room

Speakers

Veronica Obute (Drexel University) Yael Passy

Description

Increased development and utilization of multimodal scanning probe microscopy (SPM) and spectroscopy techniques have led to an orders-of-magnitude increase in the volume, velocity, and variety of collected data. While larger datasets have certain advantages, practical challenges arise from their increased complexity including the extraction and analysis of actionable scientific information. In recent years, there has been an increase in the application of machine and deep learning techniques that use batching and stochastic methods to regularize statistical models to execute functions or aid in scientific discovery and interpretation. While this powerful method has been applied in a variety of imaging systems (e.g., SPM, electron microscopy, etc.), simplistic analysis alone takes on the order of weeks to months due to scheduling and IO overhead imposed by GPU and CPU based systems which limits streaming inference rates to speeds above 50ms. This latency precludes the possibility of real-time analysis in SPM techniques such as band-excitation piezoresponse force spectroscopy (BE PFM), where typical measurements of cantilever resonance occur at 64Hz.
One method to accelerate machine learning inference is to bring computational resources as close to the data acquisition source as possible to minimize latencies associated with I/O and scheduling. Therefore, we leverage the National Instruments PXI platform to establish a direct, peer-to-peer channel over PCIe between an analog-to-digital converter and a Xilinx field programmable gate array (FPGA). Through the LabVIEW FPGA design suite, we develop this FPGA-based pipeline using cantilever resonances acquired in BE PFM to conduct real-time prediction of the simple harmonic oscillator (SHO) fit. To accomplish this, we use hls4ml to compile a high-level synthesis (HLS) representation of the neural network. Once this HLS model is synthesized to a register transfer level description (RTL), we implement the design on the FPGAs programmable logic. The parallelizable nature of FPGAs allows for heavily pipelined neural network implementations to achieve latencies on the order of microseconds. We currently benchmark our implementation at 36us per inference with a fourier transformation accounting for an additional 330us. At the expense of FPGA resources, we overlap data acquisition with computation to enable continuous acquisition and processing of response data. Overall, this work provides a foundation for deploying on-sensor neural networks using specialty hardware for real-time analysis and control of materials imaging systems.

Authors

Alibek Kaliyev (Lehigh University) Amir Gholami Joshua Agar Martin Takáč Michael Mahoney Nhan Tran (Fermi National Accelerator Lab. (US)) Pedro Sales (Massachusetts Institute of Technology) Philip Coleman Harris (Massachusetts Inst. of Technology (US)) Rama Vasudevan (Oak Ridge National Laboratory) Ryan Forelli (Lehigh University) Seda Memik (Northwestern University) Shuyu Qin Stephen Jesse Veronica Obute (Drexel University) Yael Passy Yichen Guo

Presentation materials