July 28, 2020 to August 6, 2020
virtual conference
Europe/Prague timezone

Conclusions from TrackML the HEP Tracking Machine Learning challenge

Jul 31, 2020, 9:40 AM
virtual conference

virtual conference

Talk 14. Computing and Data Handling Computing and Data Handling


Andreas Salzburger (CERN)


The HL-LHC will see ATLAS and CMS see proton bunch collisions reaching track multiplicity up to 10.000 charged tracks per event. To engage the Computer Science community to contribute new algorithms ideas, we have organized a Tracking Machine Learning challenge (TrackML). Participants are provided events with 100k 3D points, and are asked to group the points into tracks; they are also given a 100GB training dataset including the ground truth. The challenge is run in two phases. The first "Accuracy" phase has run on Kaggle platform from May to August 2018; algorithms were judged judged only on a score related to the fraction of correctly assigned hits. The second "Throughput" phase ran Sep 2018 to March 2019 on Codalab, required code submission; algorithms were then ranked by combining accuracy and speed. The first phase has seen 653 participants, with top performers with innovative approaches (see arXiv:1904.06778). The second phase has recently finished and featured some astonishingly fast solutions, which have been subsequently analysed. The talk will report on the lessons from the TrackML challenge and perspectives.

Primary authors

David Rousseau (IJCLab-Orsay) Jean-Roch Vlimant (California Institute of Technology (US)) Vincenzo Innocente (CERN) Andreas Salzburger (CERN) Isabelle Guyon Dr Laurent Basara (LAL/LRI, Université Paris Saclay) Sabrina Amrouche (Université de Geneve (CH)) Tobias Golling (Universite de Geneve (CH)) Moritz Kiehn (Universite de Geneve (CH)) Yetkin Yilmaz (LAL-Orsay (FR)) Paolo Calafiura (Lawrence Berkeley National Lab. (US)) Steven Farrell (Lawrence Berkeley National Lab (US)) Heather Gray (UC Berkeley/LBNL) Vladimir Gligorov (Centre National de la Recherche Scientifique (FR)) Mr Victor Estrade (LRI) Cecile Germain (Universite Paris Sud) Edward Moyse (University of Massachusetts (US)) Mikhail Hushchyn (Yandex School of Data Analysis (RU)) Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))

Presentation materials