Speaker
Sitong An
(CERN, Carnegie Mellon University (US))
Description
We report the latest development in ROOT/TMVA, a new system that takes trained ONNX deep learning models and emits C++ code that can be easily included and invoked for fast inference of the model, with minimal dependency. We present an overview of the current solutions for conducting inference in C++ production environment, discuss the technical details and examples of the generated code, and demonstrates its development status with a preliminary benchmark against popular tools.
Authors
Sitong An
(CERN, Carnegie Mellon University (US))
Lorenzo Moneta
(CERN)