29 July 2015 to 6 August 2015
World Forum
Europe/Amsterdam timezone

Pull-validation: A resampling method to improve the usage of low-statistics datasets

4 Aug 2015, 16:00
1h
Amazon Foyer Terrace (World Forum)

Amazon Foyer Terrace

World Forum

Churchillplein 10 2517 JW Den Haag The Netherlands
Board: 263
Poster contribution DM-EX Poster 3 DM and NU

Speaker

Jan Luenemann (Vrije Universiteit Brussel)

Description

In high energy physics many background dominated analyses suffer from limited statistics in simulation: With increasing efficiency of the event selection the simulated samples are reduced so that in many cases the event number at final analysis level is very low. Due to limited computational resources the production of more simulation is not always feasible. In this cases it is helpful to extract more information from the available simulated data sets. One way to deal with this issue in multivariate analyses (MVA) can be achieved by using resampling methods: The MVA is trained many times on small subsets that are randomly resampled from the complete dataset. The variation of the MVA output between the trainings can be interpreted as probability density function (PDF) for each event. This PDF can be used to calculate a weight that is applied to each event instead of making a binary cut decision. With this procedure events that were normaly removed by the event selection can still contribute to the final dataset with a small weight. Another advantage is that pull-validation also provides an estimator for the uncertainty of the multivariate method. As an example of how the method can be used, we present a case-scenario from searches for physics beyond the Standard Model with IceCube.
Registration number following "ICRC2015-I/" 351

Authors

Anna Obertacke (Universität Wuppertal) Florian Scheriau (Technische Universität Dortmund) Jan Kunnen (Vrije Universiteit Brussel) Jan Luenemann (Vrije Universiteit Brussel)

Presentation materials

There are no materials yet.