Sixth Machine Learning in High Energy Physics Summer School 2020

Europe/Berlin
Zoom

Zoom

Andrey Ustyuzhanin (Yandex School of Data Analysis (RU)), Lesya Shchutska (EPFL - Ecole Polytechnique Federale Lausanne (CH))
Description

The Sixth Machine Learning summer school organised by Yandex School of Data AnalysisLaboratory of Methods for Big Data Analysis of National Research University Higher School of Economics, and High Energy Physics Laboratory LPHE at EPFL will be held at EPFL, Lausanne, Switzerland from the 16th to 30th of July 2020.

The school will cover the relatively young area of data analysis and computational research that has started to emerge in High Energy Physics (HEP). It is known by several names including “Multivariate Analysis”, “Neural Networks”, “Classification/Clusterization techniques”. In more generic terms, these techniques belong to the field of “Machine Learning”, which is an area that is based on research performed in Statistics and has received a lot of attention from the Data Science community.

There are plenty of essential problems in high energy physics that can be solved using Machine Learning methods. These vary from online data filtering and reconstruction to offline data analysis.

Students of the school will receive a theoretical and practical introduction to this new field and will be able to apply acquired knowledge to solve their own problems. Topics ranging from decision trees to deep learning and hyperparameter optimisation will be covered with concrete examples and hands-on tutorials. A special data-science competition will be organised within the school to allow participants to get better feeling of real-life ML applications scenarios.

The expected number of students for the school is about 60. The school is aimed at PhD students and postdoctoral researchers, but also open to masters students.

Pre-requisites for participation

Upon completion of the school participants would be able to

  • formulate a HEP-related problem in ML-friendly terms;
  • select quality criteria for a given problem;
  • understand and apply principles of widely-used classification models (e.g. boosting, bagging, BDT, neural networks, etc) to practical cases;
  • optimise features and parameters of a given model in efficient way under given restrictions;
  • select the best classifier implementation amongst a variety of ML libraries (scikit-learn, xgboost, deep learning libraries, etc);
  • understand and apply principles of generative model design;
  • define & conduct reproducible data-driven experiments.

School materials

School materials are available as online course.

Organisers

High Energy Physics Laboratory LPHE at EPFL

Partners

 

Sponsors

Thanks to our sponsors:

CHIPP: Swiss Institute of Particle Physics    Swiss Academy of Sciences  
EPFL Doctoral School
 

 

we can provide a bit of subsidy for students who are not able to afford the full registration fee out of their own funds. Make sure you apply before the early registration deadline. See registration fee page for details. 

    • 09:00 09:30
      Welcome, opening words 30m

      Welcome by organizers, the structure and gist of the school.

      Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
    • 09:30 10:00
      Practice. Getting familiar with school infrastructure 30m
      Speakers: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU)), Vladislav Belavin (Yandex School of Data Analysis (RU))
    • 10:00 10:15
      Break 15m
    • 10:15 11:30
      Intro into Machine Intelligence 1h 15m

      Overview of ML history, applications and outlook with HEP perspective.

      Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
    • 11:30 12:45
      Section 1. Introduction into ML: Basics
      Convener: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 11:30
        Introduction into supervised learning 30m

        Practical session

        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 12:00
        Seminar: Data Handling in Python 45m
        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
    • 12:45 13:30
      Lunch 45m
    • 13:30 14:30
      Section 1. Introduction into ML: Linear Regression
      Convener: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 13:30
        Linear Regression 30m

        Linear regression. Analytical solution. Gradient descent. Numerical solution. Polynomial features.

        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 14:00
        Seminar 30m
        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
    • 14:30 15:00
      Break 30m
    • 15:00 16:00
      Section 1. Introduction into ML: Logistic regression-1
      Convener: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 15:00
        Logistic regression-1 30m
        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 15:30
        Seminar 30m
    • 16:00 16:30
      Break 30m
    • 16:30 17:30
      Section 1. Introduction into ML: Logistic regression-2
      Convener: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 16:30
        Logistic regression-2 30m

        Linear models regularization. Probabilistic interpretation of linear models (regression and classification).

        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 17:00
        Seminar 30m
        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
    • 09:00 10:00
      Section 1. Introduction into ML: Quality Metrics-1
      Convener: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 09:00
        Quality Metrics-1 30m

        Quality metrics for classification and regression

        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 09:30
        Seminar: Quality Metrics-1 30m
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 1. Introduction into ML: Quality Metrics-2
      Convener: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 10:30
        Quality Metrics-2 30m

        How to test your model. Cross validation. Statistical model comparison

        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 11:00
        Seminar: Quality Metrics-2 30m
        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
    • 11:30 12:00
      Break 30m
    • 12:00 13:00
      Section 1. Introduction into ML: Decision trees-1
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 12:00
        Decision trees-1 30m

        Splitting rule. Classification and regression decision trees

        Speaker: Mr Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 12:30
        Seminar: Decision trees-1 30m
        Speaker: Nikita Kazeev (Yandex School of Data Analysis (RU))
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Section 1. Introduction into ML: Ensembles-1
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 14:00
        Ensembles-1 30m

        Bagging and Random Forest. Stacking and blending.

        Speaker: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 14:30
        Seminar: Ensembles-1 30m
        Speaker: Nikita Kazeev (Yandex School of Data Analysis (RU))
    • 15:00 15:30
      Break 30m
    • 15:30 16:30
      Section 1. Introduction into ML: Ensembles-2
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 15:30
        Ensembles-2 30m

        Gradient boosting.

        Speaker: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 16:00
        Seminar: Ensembles-2 30m
        Speaker: Nikita Kazeev (Yandex School of Data Analysis (RU))
    • 16:30 17:00
      Coopetition introduction (1) 30m
      Speaker: Mr Nikita Kazeev (Yandex School of Data Analysis (RU))
    • 17:00 17:30
      Break 30m
    • 17:30 18:30
      Socialization: Open flow discussion
    • 09:00 10:00
      Section 1. Introduction into ML: Useful hacks.
      Convener: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 09:00
        Useful hacks 30m

        Feature engineering, importance and selection.

        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 09:30
        Seminar: Useful hacks. 30m
        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 1. Introduction into ML: Clustering-1
      Convener: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 10:30
        Clustering-1 30m

        Clustering. K-Means. Quality metrics for clustering

        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 11:00
        Seminar: Clustering-1 30m
        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
    • 11:30 12:00
      Break 30m
    • 12:00 13:00
      Section 1. Introduction into ML: Clustering-2
      Convener: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 12:00
        Clustering-2 30m

        Hierarchical clustering and DBSCAN.

        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
      • 12:30
        Seminar: Clustering-2 30m
        Speaker: Mikhail Hushchyn (Yandex School of Data Analysis (RU))
    • 09:00 17:00
      Sunday 8h
    • 09:00 10:00
      Section 2. Introduction into Neural Networks: Intro to NN
      Convener: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 09:00
        Intro to NN 20m
        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 09:20
        Seminar: Intro to NN 40m
        Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 2. Introduction into Neural Networks: Intro to Pytorch
      Convener: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
      • 10:30
        Intro to Pytorch 30m
        Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
      • 11:00
        Seminar: Pytorch practice 30m
        Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
    • 11:30 12:00
      Break 30m
    • 12:00 13:00
      Section 2. Introduction into Neural Networks: CNN
      Convener: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
      • 12:00
        CNN 30m
        Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
      • 12:30
        Seminar: CNN 30m
        Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Section 2. Introduction into Neural Networks: Network regularization
      Convener: Artem Maevskiy (National Research University Higher School of Economics (RU))
      • 14:00
        Network regularization 30m
      • 14:30
        Seminar: Network regularization 30m
    • 15:00 15:30
      Break 30m
    • 15:30 16:30
      Section 2. Introduction to Neural Nets: Autoregressive networks
      Convener: Artem Ryzhikov (National Research University Higher School of Economics (RU))
      • 15:30
        Autoregressive networks 30m
      • 16:00
        Seminar: Autoregressive networks 30m
    • 16:30 17:00
      Break 30m
    • 17:00 18:30
      Guest lectures
      Convener: Tommaso Dorigo (Universita e INFN, Padova (IT))
    • 09:00 10:00
      Section 2. Introduction to Neural Nets: Autoregressive networks-2
      Convener: Artem Ryzhikov (National Research University Higher School of Economics (RU))
      • 09:00
        Autoregressive networks-2 30m
      • 09:30
        Seminar: Autoregressive networks-2 30m
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 2. Introduction to Neural Nets: Network architectures: tips and tricks
      Convener: Maxim Borisyak (Yandex School of Data Analysis (RU))
      • 10:30
        Network architectures: tips and tricks 40m
      • 11:10
        Seminar: Practice 20m
    • 11:30 12:00
      Break 30m
    • 12:00 13:00
      Section 3. Bayesian Deep Learning: Intro
      Convener: Ekaterina Lobacheva
      • 12:00
        Intro 30m
      • 12:30
        Seminar: Intro 30m
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Section 3. Bayesian Deep Learning: Full Bayesian Inference
      Convener: Ekaterina Lobacheva
      • 14:00
        Full Bayesian Inference 30m
      • 14:30
        Seminar: Full Bayesian Inference 30m
    • 15:00 15:30
      Break 30m
    • 15:30 16:00
      Section 3. Bayesian Deep Learning: Bayesian linear regression
      Convener: Nadya Chirkova
    • 16:00 16:30
      Break 30m
    • 16:30 18:00
      Guest lectures
    • 09:00 10:00
      Section 3. Bayesian Deep Learning: Variational Inference
      Convener: Ekaterina Lobacheva
      • 09:00
        Variational Inference 30m
      • 09:30
        Seminar: Variational Inference 30m
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 3. Bayesian Deep Learning: Gaussian Processes
      Convener: Nadya Chirkova
    • 11:30 12:00
      Break 30m
    • 12:00 13:30
      Section 3. Bayesian Deep Learning: Bayesian Neural Networks
      Convener: Nadya Chirkova
      • 12:00
        Bayesian Neural Networks - Introduction 30m
      • 12:30
        Bayesian Neural Networks - Training 30m
    • 13:30 14:30
      Lunch 1h
    • 14:30 15:30
      Section 3. Bayesian Deep Learning: Bayesian Neural Networks
      Convener: Nadya Chirkova
      • 14:30
        Seminar: Bayesian Neural Networks - Training 30m
      • 15:00
        Bayesian Sparsification of Neural Networks 30m
    • 15:30 16:00
      Break 30m
    • 16:00 16:30
      Introduction into the 2nd coopetition 30m
      Speaker: Artem Maevskiy (National Research University Higher School of Economics (RU))
    • 16:30 18:00
      Guest lectures
    • 09:00 10:00
      Section 3. Bayesian Deep Learning: VAE
      • 09:00
        VAE 30m
      • 09:30
        Seminar: VAE 30m
    • 10:00 10:30
      Section 3. Bayesian Deep Learning: Seminar: VAE
      Convener: Alexei Struminsky (Space Research Institute)
    • 10:30 11:00
      Break 30m
    • 11:00 11:30
      Section 4. Generative models and networks: Generative models
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
    • 11:30 12:00
      Section 4. Generative models and networks: Seminar: Practice on basic generative models
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
    • 12:00 12:30
      Break 30m
    • 12:30 13:00
      Section 4. Generative models and networks: Introduction to distances Pt. 1
      Convener: Vladislav Belavin (Yandex School of Data Analysis (RU))
    • 13:00 13:30
      Section 4. Generative models and networks: Seminar: Distances Pt. 1
      Convener: Vladislav Belavin (Yandex School of Data Analysis (RU))
    • 13:30 14:30
      Lunch 1h
    • 14:30 15:00
      Section 4. Generative models and networks: Introduction to distances Pt. 2
      Convener: Vladislav Belavin (Yandex School of Data Analysis (RU))
    • 15:00 15:30
      Section 4. Generative models and networks: Seminar: Distances Pt. 2
      Convener: Vladislav Belavin (Yandex School of Data Analysis (RU))
    • 15:30 16:00
      Break 30m
    • 16:00 17:30
      Guest lectures
    • 09:00 10:00
      Section 4. Generative models and networks: Autoencoders
      Convener: Artem Ryzhikov (National Research University Higher School of Economics (RU))
      • 09:00
        Autoencoders 30m
      • 09:30
        Seminar: Practice on AE 30m
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 4. Generative models and networks: GANs
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 10:30
        GANs 30m
      • 11:00
        Seminar: GANs 30m
    • 11:30 12:00
      Break 30m
    • 12:00 13:00
      Section 4. Generative models and networks: Advanced GANs
      Convener: Nikita Kazeev (Yandex School of Data Analysis (RU))
      • 12:00
        Advanced GANs 30m
      • 12:30
        Seminar: Advanced GANs 30m
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Section 4. Generative models and networks: Flow models
      Convener: Artem Ryzhikov (National Research University Higher School of Economics (RU))
      • 14:00
        Flow models 30m
      • 14:30
        Seminar: Flow models 30m
    • 15:00 15:30
      Break 30m
    • 15:30 16:30
      Section 4. Generative models and networks: Invertible Generative Models
      • 15:30
        Invertible Generative Models 30m
      • 16:00
        Seminar: Invertible Generative Practice 30m
    • 16:30 17:00
      Break 30m
    • 17:00 18:30
      Guest lectures
    • 09:00 10:00
      Section 5. Advanced Optimization Methods: Introduction to black-box optimization
      Convener: Maxim Borisyak (Yandex School of Data Analysis (RU))
      • 09:00
        Introduction to black-box optimization 30m
      • 09:30
        Seminar: Introduction to black-box optimization 30m
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 5. Advanced Optimization Methods: Variational Optimization
      Convener: Mr Vladislav Belavin (Yandex School of Data Analysis (RU))
    • 11:30 12:00
      Break 30m
    • 12:00 13:30
      Guest lectures
    • 09:00 10:00
      Section 5. Advanced Optimization Methods: Bayesian Optimization
      Convener: Maxim Borisyak (Yandex School of Data Analysis (RU))
      • 09:00
        Bayesian Optimization 30m
      • 09:30
        Seminar: Bayesian Optimization 30m
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section 5. Advanced Optimization Methods: BO-GP and friends-1
      Convener: Maxim Borisyak (Yandex School of Data Analysis (RU))
      • 10:30
        BO-GP and friends 30m
      • 11:00
        Seminar: BO-GP and friends 30m
    • 11:30 12:00
      Break 30m
    • 12:00 13:00
      Section 5. Advanced Optimization Methods: BO-GP and friends-2
      Convener: Maxim Borisyak (Yandex School of Data Analysis (RU))
      • 12:00
        BO-GP and friends 30m
      • 12:30
        Seminar: BO-GP and friends 30m
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:30
      Guest lectures
    • 15:30 16:00
      Break 30m
    • 16:00 17:00
      Socialization: Open flow discussion
    • 09:00 10:00
      Section X: Learning to pivot
      Convener: Maxim Borisyak (Yandex School of Data Analysis (RU))
      • 09:00
        Learning to pivot 20m
      • 09:20
        Seminar: Learning to pivot 40m
    • 10:00 10:30
      Break 30m
    • 10:30 12:30
      Section X: Deep Learning at scale
    • 12:30 13:30
      Lunch 1h
    • 13:30 14:30
      Section X: Interpretability
    • 14:30 15:00
      Break 30m
    • 15:00 16:30
      Guest lectures
      Convener: Michael Aaron Kagan (SLAC National Accelerator Laboratory (US))
    • 16:30 17:00
      Break 30m
    • 17:00 18:30
      Guest Lecture 1h 30m
      Speaker: Dr Michela Paganini (Facebook AI Research)
    • 09:00 10:00
      Section X: Anomaly Detection-1
      Convener: Denis Derkach (National Research University Higher School of Economics (RU))
    • 10:00 10:30
      Break 30m
    • 10:30 11:30
      Section X: Anomaly Detection-2
      Convener: Denis Derkach (National Research University Higher School of Economics (RU))
    • 11:30 12:00
      Break 30m
    • 12:00 13:30
      Guest lectures
    • 13:30 14:30
      Lunch 1h
    • 14:30 16:00
      Break 1h 30m
    • 16:00 17:30
      Guest lectures
      • 16:00
        Lunch 1h
    • 09:00 10:30
      Guest lectures
    • 10:30 11:30
      Break 1h
    • 11:30 13:00
      Socialization: Student's project presentations
    • 13:00 13:30
      Closing words 30m
      Speaker: Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))
    • 13:30 14:30
      Lunch 1h
    • 14:30 16:30
      Socialization: Follow-up (VR?) party