Techniques for statistical learning including variable transformation, variable selection, and classification are reviewed, with emphasis on low-dimensional datasets typical for physics analysis. The reviewed techniques cover principal component analysis (PCA), kernel PCA, multidimensional scaling, classification by decision trees and their ensembles, classification by kernel methods such as support vector machines, and supervised feature selection by sequential algorithms, ensembles of decision trees and nearest neighbors. Application of these techniques to real-world datasets is illustrated in MATLAB. A MATLAB primer is included in the course, and MATLAB trial licenses will be available to participants.
Attention: The agenda of the course has been changed! Please refer to the timetable to see the updated agenda! The course takes place in the afternoons of Monday, Tuesday, Thursday and Friday. On Wednesday afternoon there is no lecture!
We are happy to announce that the course will be accessible also via vidyo connection. This is possible also for participants that are not registered for the event. Please follow the dedicated link in the menu.