23–24 Feb 2015
CERN
Europe/Zurich timezone
There is a live webcast for this event.

Session

Exploring EDA, Clustering and Data Preprocessing

23 Feb 2015, 13:30
31/3-004 - IT Amphitheatre (CERN)

31/3-004 - IT Amphitheatre

CERN

31/3-004
105
Show room on map

Description

A short introduction, motivation and demonstration of how to move from tables full of numbers to a real description of data. Identifying patterns, relations between variables to reach an optimal understanding. In these lectures, the principles of exploratory data analysis, data preparation and visualisation are demonstrated.

Presentation materials

There are no materials yet.

  1. Vincent Alexander Croft (NIKHEF (NL))
    23/02/2015, 13:30
    An introduction to thinking in plots and searching for patterns. Understanding distributions, the interplay between variables of a single distribution and how to look at multiple variables. Principles explained with example data set and visualisation in R programming language. Targeted audience: Anyone wishing to learn more about how to look at data. Data scientists in this context can be...
    Go to contribution page
  2. Vincent Alexander Croft (NIKHEF (NL))
    24/02/2015, 14:30
    Taking the understanding of multi-variate interactions from lecture one; this lecture aims to demonstrate the descriptive power of various clustering algorithms in Python with a brief introduction to large scale data preprocessing software such as Hadoop. Targeted audience: Anyone wishing to learn more about how to look at data. Data scientists in this context can be either physicists or...
    Go to contribution page
Building timetable...