Computing in High Energy and Nuclear Physics (CHEP) 2012

Name: Computing in High Energy and Nuclear Physics (CHEP) 2012
Start: 2012-05-21T06:00:00-04:00
End: 2012-05-25T18:00:00-04:00
Location: New York City, NY, USA

21–25 May 2012

New York City, NY, USA

US/Eastern timezone

Support

chep2012@bnl.gov

A new development cycle of the Statistical Toolkit

24 May 2012, 13:30

4h 45m

Rosenthal Pavilion (10th floor) (Kimmel Center)

Rosenthal Pavilion (10th floor)

Kimmel Center

Poster Software Engineering, Data Stores and Databases (track 5) Poster Session

Mr Matej Batic (Jozef Stefan Institute)

The Statistical Toolkit is an open source system specialized in the statistical comparison of distributions. It addresses requirements common to different experimental domains, such as simulation validation (e.g. comparison of experimental and simulated distributions), regression testing in software development and detector performance monitoring. The first development cycles concerned the provision of a wide set of non-parametric goodness-of-fit tests for the so-called two sample problem, i.e. the comparison of two distributions. The active use of the Statistical Toolkit in real-life applications, documented in the literature, has highlighted new requirements, that are addressed by a new development cycle. The new product includes extensions of the functionality of the toolkit, refinements of existing algorithms and tools and improved usability of the system. Various sets of statistical tests have been added to the existing collection to deal with the one sample problem (i.e. the comparison of a data distribution to a function, including tests for normality), the comparison of two-dimensional distributions, categorical analysis and the estimate of randomness. Improved algorithms and software design contribute to the robustness of the results. A simple user layer dealing with primitive data types and an improved ROOT user layer facilitate the use of the toolkit both in standalone analyses and in large scale experiments. Interface to the R package extends the native functionality of the toolkit. An overview of the new developments is presented, along with applications to concrete experimental scenarios.

Dr Alberto Ribon (CERN) Dr Andreas Pfeiffer (CERN) Dr Maria Grazia Pia (Universita e INFN (IT)) Mr Matej Batic (Jozef Stefan Institute)

Poster

st_chep_2012_poster.pdf

Computing in High Energy and Nuclear Physics (CHEP) 2012

Support

A new development cycle of the Statistical Toolkit

Rosenthal Pavilion (10th floor)

Kimmel Center

Speaker

Description

Authors

Presentation materials