Data mining, data analytics and big data

Europe/Zurich
31/3-004 - IT Amphitheatre (CERN)

31/3-004 - IT Amphitheatre

CERN

105
Show room on map
Antonio Romero Marin (Universidad de Oviedo (ES)), Maaike Limper (CERN), Manuel Martin Marquez (CERN)
Description
Various databases and storage systems are needed to store the large amounts of control, operation and monitoring data in order to run the LHC accelerator and its experiments. In this presentation we will show the Openlab projects within the IT-DB group that look at how to improve the use of database and data analytic technologies at CERN. The Openlab Data Analytics project aims to profit from that big amount of data to obtain valuable insights and knowledge that can be used to improve the exploitation and operation of the accelerators chain and systems at CERN. We will show some data analytic techniques and tools like data discovery and R, as well as how they can be used for several real CERN use cases. In addition, we’ll also present another Openlab project that has investigated the use of database technology for analysing the physics events produced by the LHC experiments. This project looked how physics analysis can be written in SQL and looked at Hadoop, scalable Postgres and RDBMS systems.
Slides
    • 16:00 17:00
      Data mining, data analytics and big data 1h
      Speakers: Antonio Romero Marin (Universidad de Oviedo (ES)), Maaike Limper (CERN), Manuel Martin Marquez (CERN)
      Video in CDS
    • 17:00 17:30
      Questions and discussion 30m