18–22 Jan 2016
UTFSM, Valparaíso (Chile)
Chile/Continental timezone

Density Estimation Trees as fast non-parametric modelling tools

19 Jan 2016, 15:45
25m
UTFSM, Valparaíso (Chile)

UTFSM, Valparaíso (Chile)

Avenida España 1680, Valparaíso Chile
Oral Data Analysis - Algorithms and Tools Track 2

Speaker

Lucio Anderlini (Universita e INFN, Firenze (IT))

Description

Density Estimation Trees (DETs) are decision trees trained on a multivariate dataset to estimate its probability density function. While not competitive with kernel techniques in terms of accuracy, they are incredibly fast, embarrassingly parallel and relatively small when stored to disk. These properties make DETs appealing in the resource-expensive horizon of the LHC data analysis. Possible applications may include selection optimization, fast simulation and fast detector calibration. In this contribution I describe the bases of the algorithm and a hybrid, multi-threaded implementation relying on RooFit for the training, and on plain C++ for the evaluation of the density estimation. A set of applications under discussion within the LHCb Collaboration are also briefly illustrated.

Author

Lucio Anderlini (Universita e INFN, Firenze (IT))

Presentation materials

Peer reviewing

Paper