The issues with file uploads should now be solved. We're sorry for the inconvenience. More details here.

CERN School of Computing 2017

Europe/Zurich
Universidad Politécnica de Madrid

Universidad Politécnica de Madrid

Madrid, Spain
Catharine Noble (CERN) , Nikos Kasioumis (CERN) , Sebastian Lopienski (CERN)
Description

Applications are now closed.

Welcome to the CERN School of Computing, which this year takes us to the world-class city of Madrid in Spain.

The programme

We have an indepth programme of advanced, interesting and challenging computing topics planned, which will provide ECTS university credits upon successful completion of the CSC exam.

CSC 2017 will provide around 52 hours of lectures and hands-on exercises. The hands-on component includes projects and mini-challenges carried out in small groups and as individuals.

We will be covering the broad domain of data storage and management technologies, fundamental concepts of Physics Computing, and cover many complementary topics such as software engineering, computer architecture, computing security and networking. See the Programme for more details.

Other activities

However it's not all study; the social and sporting programme is also a vital part of the School. Madrid is famous for its festivals, culture and cuisine, and along with your new friends and colleagues at the CSC 2017, there will ample opportunities to explore and experience the local area.

Who can apply?

The CSC 2017 school is aimed at postgraduate (ie. minimum of Bachelor degree or equivalent) engineers or scientists, working at CERN or at other research institutes, with experience in particle physics, in computing or in related fields. We welcome applications from all nationalities.

CSC 2017 is organised in collaboration with the Universidad Politécnica de Madrid.

Important dates

  • Wednesday 10 May (midnight CET/GMT+1) - deadline for applications
  • By Friday 2 June - you will be informed of outcome of selections
  • Monday 19 June - latest date for School fees payment if you are accepted.
  • Sunday 27 August - afternoon/evening - student arrivals at Residencia Galdos, Madrid.
  • Saturday 9 September - morning - departure.

       

    • 11:00 19:00
      Registration - welcome 8h

      For all participants of the CSC:2016

    • 19:00 20:30
      Dinner 1h 30m Residencia Galdos

      Residencia Galdos

      Buffet in Restaurant

    • 08:00 08:05
      DAY TBC 5m
    • 08:15 08:45
      Bus to UPM 30m
    • 08:45 10:30
      Opening Ceremony

      Presentations from:

      • Deputy Director of SCK•CEN
      • VUB
      • Eckhard Elsen, CERN Director of Research & Computing
      • Frederic Hemmer, CERN IT Department Head
      • Alberto Pace, Director of CERN School of Computing
      • 08:45
        Introduction 5m
        Speaker: Sebastian Lopienski (CERN)
      • 08:50
        OPENING SPECIAL LECTURE 20m
      • 09:10
        OPENING SPECIAL LECTURE 20m
      • 09:30
        Physics at the LHC and its future 20m
        Speaker: Eckhard Elsen (CERN)
      • 09:50
        Some computing challenges for next generation LHC physics 20m
        Speaker: Frederic Hemmer (CERN)
      • 10:10
        CERN School of Computing 20m
        Speaker: Sebastian Lopienski (CERN)
    • 10:30 11:00
      Welcome Coffee 30m
    • 11:00 12:00
      Introduction to Physics Computing L1: Hadron Collider Physics 1h

      Here we will focus on the physics of particle collisions, theoretical aspects of the standard model of particle physics, its predictive power as well as its shortcomings. Experimental aspects such as collider facilities and modern particle physics experiments as well as example physics questions and corresponding data analyses will be discussed. Furthermore, the compute models with the resulting amount of recorded data and simulated Monte Carlo events will be described.

      Speaker: Arnulf Quadt (Georg-August-Universitaet Goettingen (DE))
    • 12:00 13:00
      Tools and Techniques L1: Introduction 1h

      To start, we discuss some of the characteristics of software projects for high energy physics, and some of the issues that arise when people want to contribute to them. This forms the framework for the Software Technologies Track. We then continue with a brief introduction to software engineering from the perspective of the individual contributor, both as a formal process and how it actually affects what you do. The lecture discusses several categories of tools & techniques you can use to make yourself more productive and effective. Continuous testing and documentation has proven to be important in producing high quality work, but it's often difficult to do; we discuss some available approaches. Many problems require specific tools and techniques to solve them effectively: We discuss the examples of performance tuning and memory access problems.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Tools and Techniques L2: Tools for Collaboration, Software Engineering Across the Project 1h

      HEP software is built by huge teams. How can this be done effectively, while still giving people satisfying tasks to perform? Large systems are a hard problem, and this lecture focuses on the current techniques for dealing with it. Use of central source control (e.g. SVN) has become routine, so we build on that understanding to discuss distributed systems like Git. We then discuss the larger area of release & distributing via a package versioning system (e.g. SVN and CMT) vs. a distributed build system (e.g. Git and CMake). Now that we've covered both individual and group work, we go back to the software engineering topics of the first lecture to see how these fit together. How does our individual work affect the ability of the entire project to proceed? What are tools and techniques that will improve both our individual work, and out contributions to the whole? We close with a summary of observations.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 15:00 16:00
      Tools and Techniques E1 1h

      The first three exercises provide some direct experience with the tools and techniques described in Lectures 1 and 2. Teams of two students will work together on through examples designed to show the strengths and weaknesses of various tools and approaches. This will be followed by small projects for additional development experience.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 16:00 16:15
      Updates 15m
    • 16:15 16:45
      Afternoon break 30m
    • 16:45 17:45
      Tools and Techniques E2 1h

      The first three exercises provide some direct experience with the tools and techniques described in Lectures 1 and 2. Teams of two students will work together on through examples designed to show the strengths and weaknesses of various tools and approaches. This will be followed by small projects for additional development experience.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 17:45 18:45
      Tools and Techniques E3 1h

      The first three exercises provide some direct experience with the tools and techniques described in Lectures 1 and 2. Teams of two students will work together on through examples designed to show the strengths and weaknesses of various tools and approaches. This will be followed by small projects for additional development experience.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 20:15 22:15
      Welcome Cocktail & Dinner 2h
    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Introduction to Physics Computing L2: Digital Data, Simulation and Reconstruction in Modern Particle Physics Experiments 1h

      Here, a focus will be placed on specific detector sub-components and their data readout concepts as well as data reconstruction techniques, simulation techniques and analysis techniques.

      Speaker: Arnulf Quadt (Georg-August-Universitaet Goettingen (DE))
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Computer Architecture & Performance Tuning L1: Hardware vs. software vs. you – who’s the boss? 1h

      Considering the rise of complex many-core processors, a sufficient understanding of their architecture and of the applicable performance tuning opportunities has become an indispensable element of software development.

      In this lecture, the anatomy of a modern PC (x86) server is discussed, with a particular focus on the CPU. Multiple resources and strategies to achieve optimal performance exist, both in hardware and in software – but how to take control of them?

      Non-x86 architectures of growing importance, such as ARM and NVIDIA, will also be briefly covered. The lecture will be supported by relevant examples from the physics domain.

      Speaker: Andrzej Nowak
    • 11:45 12:45
      Computer Architecture & Performance Tuning L2: Architectural Details and Performance Studies 1h

      Although by using various tools we are often able to get a generous peek both inside hardware and software, drawing meaningful high-level conclusions is not always straightforward. More challenges come from accelerators and co-processors, where different computing paradigms take precedence, such as extreme data parallelism, different math capabilities, the importance of special languages as well as memory size and topology changes. The objective of this lecture is to bring the audience closer to "where it matters" in modern computing systems, and to identify important paths for high performance.

      Speaker: Andrzej Nowak
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Tools and Techniques E4 1h

      After the two-person teams acquire some experience with the development and release tools, we will group projects to demonstrate some of the real-world issues discussed in the lecture. Groups of two teams will first work together to create a functional release from individual sub-projects at various stages of completion to show the strengths and weaknesses of test and release tools. This is followed by a larger scale exercise with groups of teams.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 15:00 16:00
      Tools and Techniques E5 1h

      After the two-person teams acquire some experience with the development and release tools, we will group projects to demonstrate some of the real-world issues discussed in the lecture. Groups of two teams will first work together to create a functional release from individual sub-projects at various stages of completion to show the strengths and weaknesses of test and release tools. This is followed by a larger scale exercise with groups of teams.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 16:00 16:15
      Afternoon break 15m
    • 16:15 17:15
      Tools and Techniques E6 1h

      After the two-person teams acquire some experience with the development and release tools, we will group projects to demonstrate some of the real-world issues discussed in the lecture. Groups of two teams will first work together to create a functional release from individual sub-projects at various stages of completion to show the strengths and weaknesses of test and release tools. This is followed by a larger scale exercise with groups of teams.

      Speaker: Robert Jacobsen (Lawrence Berkeley National Lab. (US))
    • 17:30 17:45
      Bus to Boadilla 15m
    • 17:45 19:15
      Study time and/or daily sports program 1h 30m Boadilla

      Boadilla

    • 19:30 20:00
      Bus to Galdos 30m
    • 20:15 21:30
      Dinner 1h 15m
    • 21:30 22:45
      TUESDAY ICE-BREAKER 1h 15m
    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Software Security L1: Introduction 1h

      First lecture starts with a definition of computer security and an explanation of why it is so difficult to achieve. The lecture highlights the importance of proper threat modelling and risk assessment. It then presents three complementary methods of mitigating threats: protection, detection, reaction; and tries to prove that security through obscurity is not a good choice.

      Speaker: Sebastian Lopienski (CERN)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Computer Architecture & Performance Tuning E1 1h

      The aim of the exercises in this series is to give the attendees a practical introduction to performance oriented programming on Linux. Advanced tools will be used during the course, enabling the participants to discover how the interaction of the code and the hardware influences performance. The participants will also be given the task of correlating performance figures with certain programming decisions. In addition, the participants will understand the limits of performance optimization and the ways to establish at which point inside those limits their workload is placed. The exercises will be supported by demonstrating real world problems in production environments, including multi-threaded examples.

      Speaker: Andrzej Nowak
    • 11:45 12:45
      Software Security L2: Security in different phases of software development 1h

      The second lecture addresses the following question: how to create secure software? It introduces the main security principles (like least-privilege, or defense-in-depth) and discusses security in different phases of the software development cycle. The emphasis is put on the implementation part: most common pitfalls and security bugs are listed, followed by advice on best practice for security development.

      Speaker: Sebastian Lopienski (CERN)
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Software Security E1 1h

      In the practice session, a range of typical security vulnerabilities will be presented. The goal is to learn how they can be exploited (for privilege escalation, data confidentiality compromise etc.), how to correct them, and how to avoid them in the first place! Students will be given small pieces of source code in different programming languages, and will be asked to find vulnerabilities and fix them. The online course documentation will gradually reveal more and more information to help students in this task. Additionally, students will have a chance to try several source code analysis tools, and see how such tools can help them find functionality bugs and security vulnerabilities.

      Speaker: Sebastian Lopienski (CERN)
    • 15:00 16:00
      Software Security E2 1h

      In the practice session, a range of typical security vulnerabilities will be presented. The goal is to learn how they can be exploited (for privilege escalation, data confidentiality compromise etc.), how to correct them, and how to avoid them in the first place! Students will be given small pieces of source code in different programming languages, and will be asked to find vulnerabilities and fix them. The online course documentation will gradually reveal more and more information to help students in this task. Additionally, students will have a chance to try several source code analysis tools, and see how such tools can help them find functionality bugs and security vulnerabilities.

      Speaker: Sebastian Lopienski (CERN)
    • 16:00 16:15
      Afternoon break 15m
    • 16:15 17:15
      Software Security E3 1h

      In the practice session, a range of typical security vulnerabilities will be presented. The goal is to learn how they can be exploited (for privilege escalation, data confidentiality compromise etc.), how to correct them, and how to avoid them in the first place! Students will be given small pieces of source code in different programming languages, and will be asked to find vulnerabilities and fix them. The online course documentation will gradually reveal more and more information to help students in this task. Additionally, students will have a chance to try several source code analysis tools, and see how such tools can help them find functionality bugs and security vulnerabilities.

      Speaker: Sebastian Lopienski (CERN)
    • 17:30 19:00
      Study time and/or daily sports program 1h 30m
    • 19:15 19:45
      Bus to Galdos 30m
    • 20:15 21:15
      Dinner 1h Residencia Galdos

      Residencia Galdos

    • 21:15 22:15
      Special Francois evening talk? 1h
    • 07:15 07:20
      DAY TBC 5m
    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Software Design L1: Physics and Computing Challenges to Experiment Software 1h

      Even though the miniaturization of transistors on chips continues like predicted by Moore's law, computer hardware starts to face scaling issues, so-called performance 'walls'. Probably, the best known is the 'power wall', which limits clock frequencies. Amongst others, a way of increasing processor performance remains now to integrate many cores in the same chip. At the same time, the upcoming LHC upgrade will increase the required CPU power drastically. Both problems challenge the current way of software design in high energy physics (HEP). Developers in high energy physics are forced to re-think their ways of software design and need to move to massively parallel applications. This lecture will explain the current HEP software design, the hardware and physics issues that need to be tackled, and possible approaches to achieve the required level of parallelization.

      Speaker: Enric Tejedor Saavedra (CERN)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Software Design L2: Concurrent Programming in Action 1 1h

      This and the following lecture will explain the concepts behind various parallelization methodologies.
      First, a theoretical introduction into threads, thread-safety and concurrent data access will be given. As the new C++ standard (C++11) now provides build-in support for parallel programming, the new features of this standard will be shown. Finally, concrete solutions for the theoretical problems will be discussed.

      Speaker: Danilo Piparo (CERN)
    • 11:45 12:45
      Software Security L3: Web application security, exercise debriefing 1h

      This third hour consists of a debriefing of the exercises, and in particular those web-related. Various vulnerabilities typical to web applications (such as Cross-site scripting, SQL injection, cross-site request forgery etc.) are introduced and discussed.

      Speaker: Sebastian Lopienski (CERN)
    • 13:00 14:00
      Lunch 1h
    • 14:00 20:30
      Special Sports afternoon 6h 30m Outside Madrid

      Outside Madrid

    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Software Design L3: Concurrent Programming in Action 2 1h

      The focus of this lecture lies in concurrent programming based on the 'task model', using TBB as implementation library. There will be a deeper look into concurrent data access and lock and lock-free data formats. Using the learned concepts, we will look again at the data challenges from lecture 1 and see, how a future-proof software design might look like.

      Speaker: Danilo Piparo (CERN)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Software Design L4: Patterns for Parallel Software Development 1h

      This lecture will present a set of common patterns in parallel programming. The sequential origin of these patterns will be discussed, as well as the restrictions that they impose. A particularly successful combination of patterns, Map-Reduce, will be described in detail and examples of its everyday use at large scale will be given. On the other hand, it will be shown how high-level features like C++ lambdas, the TBB library or the Spark framework can help get started with the aforementioned parallel patterns.

      Speaker: Enric Tejedor Saavedra (CERN)
    • 11:45 12:45
      Data Technologies L1: Setting the scene: Storage technologies 1h

      The lecture presents the various Storage Models, and the supporting management techniques including Name Servers and interfaces for Data Management.

      Storage Reliability and performance
      The lecture will also discuss the various solution to ensure long data preservation and reliability with the consequences on performance, including when using Peer to Peer Storage and data transfers.

      Speaker: Alberto Pace (CERN)
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Data Technologies L2: Cryptography, authentication authorization and accounting 1 1h

      These lectures give elements of computer security that are relevant to data management. The lectures address the various technologies used in data storage systems to ensure data encryption, integrity, confidentiality and access control

      Speaker: Alberto Pace (CERN)
    • 15:00 16:00
      Software Design E1 1h

      The exercises will cover the topics of lectures 1 to 4 at a hands-on basis, based on C++11, TBB and Spark. It covers examples for the new C++11 functionality related to threads and thread safety. In addition, there will be examples for concurrent access to data, lock and lock-free data structures, and task based programming.  Finally, there will be an exercise to practise the Map-Reduce pattern by using the Spark parallel data processing framework.

      Speakers: Danilo Piparo (CERN) , Enric Tejedor Saavedra (CERN)
    • 16:00 16:15
      Afternoon break 15m
    • 16:15 17:15
      Software Design E2 1h

      The exercises will cover the topics of lectures 1 to 4 at a hands-on basis, based on C++11, TBB and Spark. It covers examples for the new C++11 functionality related to threads and thread safety. In addition, there will be examples for concurrent access to data, lock and lock-free data structures, and task based programming.  Finally, there will be an exercise to practise the Map-Reduce pattern by using the Spark parallel data processing framework.

      Speakers: Danilo Piparo (CERN) , Enric Tejedor Saavedra (CERN)
    • 17:30 17:45
      Bus to Boadilla 15m
    • 17:45 19:15
      Study time and/or daily sports program 1h 30m
    • 19:30 20:00
      Bus to Galdos 30m
    • 20:15 21:15
      Dinner 1h
    • 21:15 00:15
      Disco-time with DJ Andrzej 3h Residencia Galdos

      Residencia Galdos

    • 08:00 08:05
      DAY TBC 5m
    • 08:30 08:35
      Ciudad Universitaria 5m
    • 09:00 10:00
      Data Technologies L3: Cryptography, authentication authorization and accounting 2 1h

      These lectures give elements of computer security that are relevant to data management. The lectures address the various technologies used in data storage systems to ensure data encryption, integrity, confidentiality and access control.

      Speaker: Alberto Pace (CERN)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Computer Architecture & Performance Tuning E2 1h

      The aim of the exercises in this series is to give the attendees a practical introduction to performance oriented programming on Linux. Advanced tools will be used during the course, enabling the participants to discover how the interaction of the code and the hardware influences performance. The participants will also be given the task of correlating performance figures with certain programming decisions. In addition, the participants will understand the limits of performance optimization and the ways to establish at which point inside those limits their workload is placed. The exercises will be supported by demonstrating real world problems in production environments, including multi-threaded examples.

      Speaker: Andrzej Nowak
    • 11:45 12:45
      Computer Architecture & Performance Tuning E3 1h
      Speaker: Andrzej Nowak
    • 13:00 14:00
      Lunch (where?) 1h
    • 14:00 20:00
      Free time 6h
    • 20:15 21:15
      Dinner 1h Residencia Galdos

      Residencia Galdos

    • 09:00 21:30
      Day trip excursion 12h 30m Toledo

      Toledo

    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Data Analysis L1: Introduction to data analysis and Monte Carlo method 1h

      First lecture in Data Analysis series discuss graphical techniques used in exploratory data analysis, gives an introduction to concept of probability, and descriptive statistics summarizing the basic features of the data gathered from experiments. Monte Carlo method is introduced and explained with examples from engineering and high energy physics.

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Data Analysis L2: Distributions and estimators 1h

      In this lecture commonly used probability distributions are introduced with basic properties and few examples. Parameter estimation with maximum likelihood and least-squared methods is explained.

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 11:45 12:45
      Data Analysis L3: Confidence intervals 1h

      Determining the errors on the parameters, which is equivalent to the confidence interval estimation is shown with specific examples on maximum likelihood and least-squared methods in one and more than one dimension. Uncertainties in physics and error propagation are also discussed.

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Networking Performance L1: Internet Quality of Service options 1h

      This first topic discusses the status and the option to master and improve on the Quality of Service of the Internet. Indeed, modern scientific applications require fast transfers high-bit rate connections, as well as network predictability and high availability. On the other hand, the Internet historical technology is not naturally best suited to deterministic behaviour. This lecture explains the technical challenges and the range of options available to improve QoS guarantees in Internet-based networks.

      Speaker: Francois Fluckiger
    • 15:00 16:00
      Software Design E3 1h

      The exercises will cover the topics of lectures 1 to 4 at a hands-on basis, based on C++11, TBB and Spark. It covers examples for the new C++11 functionality related to threads and thread safety. In addition, there will be examples for concurrent access to data, lock and lock-free data structures, and task based programming.  Finally, there will be an exercise to practise the Map-Reduce pattern by using the Spark parallel data processing framework.

      Speakers: Danilo Piparo (CERN) , Enric Tejedor Saavedra (CERN)
    • 16:00 16:15
      Afternoon break 15m
    • 16:15 17:15
      Data Analysis E1: Monte Carlo method and Fitting with Root 1h

      • Generating random numbers
      • Monte-Carlo toy experiments
      • Modeling signal and background. 
      • Fitting with ROOT packages (finding peaks).

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 17:30 17:45
      Bus to Boadilla 15m
    • 17:45 19:15
      Study time and/or daily sports program 1h 30m Boadilla

      Boadilla

    • 19:30 20:00
      Bus to Galdos 30m
    • 20:15 21:15
      Dinner 1h
    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Data Analysis L4: Statistical tests 1h

      Hypothesis testing is introduced with examples of goodness-of-fit tests and the most recent examples from high energy physics. Particular emphasis is given on the p-values and when we claim the discoveries.

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Data Technologies L4: Additional component for Data Replication, Caching, Monitoring, Alarms and Quota 1 1h

      This lecture describes the various possible technologies used to implement data workflows and complex data transfer processes. It also discusses problems with data caching and Garbage Collection to conclude on monitoring and quota enforcement.

      Speaker: Alberto Pace (CERN)
    • 11:45 12:45
      Data Technologies L6: Distributed Hash Table in Cloud Storage 1h

      This last lecture addresses scalability of requirements of cloud storage in very large data sets and explains the techniques used to guarantee high efficiency in accessing data.

      Speaker: Alberto Pace (CERN)
    • 12:45 12:50
      CSC 2017 School Photo 5m
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Networking Performance L2: Multimedia over the Internet 1h

      The Internet is not only a network of computer resources but also a network of people cooperating to use these resources, in particular in professional scientific environment. Part of the collaborative tools scientists are increasingly using include audio and video systems. They place new challenging requirements on the networking systems. The class discusses these requirements and their consequences on the end-systems as well as within the underlying network.

      Speaker: Francois Fluckiger
    • 15:00 16:00
      Data Analysis E2: Confidence interval 1h

      • Finding errors on fit parameters 
      • Extracting confidence intervals

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 16:00 16:15
      Afternoon break 15m
    • 16:15 17:15
      Data Analysis E3: Hypothesis testing 1h

      • Finding p-value
      • Converting p-values to significance
      • Low count experiments and hypothesis testing

      Speaker: Ivica Puljak (Technical University of Split FESB)
    • 17:30 19:00
      Study time and/or daily sports program 1h 30m
    • 19:15 19:45
      Bus to Galdos 30m
    • 20:15 21:15
      Dinner 1h
    • 07:15 07:20
      DAY TBC 5m
    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Data Technologies E1 1h

      The first part of hands-on exercises aims to improve understanding of basic parameters in IO systems:
      • network and media latency
      • access patterns
      • OS caching
      • bottlenecks and optimization strategies for local and remote data access.
      Few essential Linux tools will be introduced to monitor and measure IO performance avoiding bias introduced by OS caching. Students will experience and measure the impact of latency and access patterns on IO performance.
      The second part covers the concept of parallelism and redundancy in storage system. We will apply the technology of Cloud storage systems to store and retrieve files in our local desktop cluster using a distributed hash table to locate files or file fragments and a REST interface to do GET, PUT or DELETE operations on these.
      The exercises conclude with the implementation and performance tuning of a RAID verification algorithm.

      Speaker: Andreas Joachim Peters (CERN)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Multivariate Classification L1: Traditional Techniques 1h

      The aim of this lecture is to make the audience aware of multivariate classification (MVC) methods. Commonly used classification methods are introduced and the fundamental concepts behind them are explained. In particular boosted decision trees and artificial neural networks are discussed in detail.

      Speaker: Thomas Keck (KIT)
    • 11:45 12:45
      Multivariate Classification L2: Deep Learning 1h

      The aim of this lecture is to make the audience aware of multivariate classification (MVC) methods. Commonly used classification methods are introduced and the fundamental concepts behind them are explained. In particular boosted decision trees and artificial neural networks are discussed in detail.

      Speaker: Thomas Keck (KIT)
    • 13:00 14:00
      Lunch 1h
    • 14:00 15:00
      Data Technologies L5: Additional component for Data Replication, Caching, Monitoring, Alarms and Quota 2 1h

      This lecture describes the various possible technologies used to implement data workflows and complex data transfer processes. It also discusses problems with data caching and Garbage Collection to conclude on monitoring and quota enforcement.

      Speaker: Alberto Pace (CERN)
    • 15:00 16:00
      Data Technologies E2 1h

      The first part of hands-on exercises aims to improve understanding of basic parameters in IO systems:
      • network and media latency
      • access patterns
      • OS caching
      • bottlenecks and optimization strategies for local and remote data access.
      Few essential Linux tools will be introduced to monitor and measure IO performance avoiding bias introduced by OS caching. Students will experience and measure the impact of latency and access patterns on IO performance.
      The second part covers the concept of parallelism and redundancy in storage system. We will apply the technology of Cloud storage systems to store and retrieve files in our local desktop cluster using a distributed hash table to locate files or file fragments and a REST interface to do GET, PUT or DELETE operations on these.
      The exercises conclude with the implementation and performance tuning of a RAID verification algorithm.

      Speaker: Andreas Joachim Peters (CERN)
    • 16:00 16:15
      Afternoon break 15m
    • 16:30 17:00
      Bus to Galdos 30m
    • 17:00 20:15
      Study/free time 3h 15m
    • 20:15 21:15
      Dinner 1h
    • 07:15 07:20
      DAY TBC 5m
    • 08:15 08:45
      Bus to UPM 30m
    • 09:00 10:00
      Data Technologies E3 1h

      The first part of hands-on exercises aims to improve understanding of basic parameters in IO systems:
      • network and media latency
      • access patterns
      • OS caching
      • bottlenecks and optimization strategies for local and remote data access.
      Few essential Linux tools will be introduced to monitor and measure IO performance avoiding bias introduced by OS caching. Students will experience and measure the impact of latency and access patterns on IO performance.
      The second part covers the concept of parallelism and redundancy in storage system. We will apply the technology of Cloud storage systems to store and retrieve files in our local desktop cluster using a distributed hash table to locate files or file fragments and a REST interface to do GET, PUT or DELETE operations on these.
      The exercises conclude with the implementation and performance tuning of a RAID verification algorithm.

      Speaker: Andreas Joachim Peters (CERN)
    • 10:00 10:15
      Updates 15m
    • 10:15 10:45
      Morning break 30m
    • 10:45 11:45
      Data Technologies E4 1h

      The first part of hands-on exercises aims to improve understanding of basic parameters in IO systems:
      • network and media latency
      • access patterns
      • OS caching
      • bottlenecks and optimization strategies for local and remote data access.
      Few essential Linux tools will be introduced to monitor and measure IO performance avoiding bias introduced by OS caching. Students will experience and measure the impact of latency and access patterns on IO performance.
      The second part covers the concept of parallelism and redundancy in storage system. We will apply the technology of Cloud storage systems to store and retrieve files in our local desktop cluster using a distributed hash table to locate files or file fragments and a REST interface to do GET, PUT or DELETE operations on these.
      The exercises conclude with the implementation and performance tuning of a RAID verification algorithm.

      Speaker: Andreas Joachim Peters (CERN)
    • 11:45 12:45
      Data Technologies E5 1h

      The first part of hands-on exercises aims to improve understanding of basic parameters in IO systems:
      • network and media latency
      • access patterns
      • OS caching
      • bottlenecks and optimization strategies for local and remote data access.
      Few essential Linux tools will be introduced to monitor and measure IO performance avoiding bias introduced by OS caching. Students will experience and measure the impact of latency and access patterns on IO performance.
      The second part covers the concept of parallelism and redundancy in storage system. We will apply the technology of Cloud storage systems to store and retrieve files in our local desktop cluster using a distributed hash table to locate files or file fragments and a REST interface to do GET, PUT or DELETE operations on these.
      The exercises conclude with the implementation and performance tuning of a RAID verification algorithm.

      Speaker: Andreas Joachim Peters (CERN)
    • 12:45 13:30
      Lunch 45m
    • 13:30 14:30
      Multivariate Classification E1 1h
      Speaker: Thomas Keck (KIT)
    • 14:30 15:30
      Multivariate Classification E2 1h
      Speaker: Thomas Keck (KIT)
    • 15:30 16:00
      Afternoon break 30m
    • 16:00 17:30
      CSC Examination 1h 30m
    • 17:30 19:00
      Traditional CSC football match 1h 30m
    • 19:00 19:30
      Bus to Galdos 30m
    • 20:15 21:15
      Dinner 1h
    • 07:00 07:05
      DAY TBC 5m
    • 08:00 08:30
      Bus to UPM 30m
    • 09:30 10:30
      Presentations by Students 1h
      • From neutron to data, what's inside a detector? 10m

        The European Spallation Source (ESS) will be the world's brightest neutron source when it's finished in Lund 2025. The neutrons used for experiments needs detectors to make them useful. Boron-10 is a good candidate to detect the otherwise hard to see neutrons.

        Speaker: Emil Rofors (Lund University / ESS)
      • Tools for Reproducible Computational Science 10m

        One of the prerequisites of reproducible results in computational science is the accessibility of research code. I will give a brief and opinionated survey of tools that can help modern researchers, in particular jupyter notebooks, cookiecutter-data-science and Zenodo.

        Speaker: Valentin Volkl (University of Innsbruck (AT))
      • Introduction to cloud modelling languages 10m

        What are cloud modelling languages and what are they used for. What cloud modelling languages exist and what are the differences between them.

        Speaker: Mr. Merlijn Sebrechts
      • Cognitive image processing through dedicated Hardware 10m

        The challenge of track reconstruction at trigger level in high energy physics experiments led to the design of cognitive image process algorithms. These algorithm can be implemented using a custom hardware able to perform parallel pattern recognition. The application of this approach on tracking and on images will be presented.

        Speaker: Nicolo Vladi Biesuz (Universita di Pisa & INFN (IT))
    • 10:30 10:45
      Updates 15m
    • 10:45 11:15
      Morning break 30m
    • 11:15 12:15
      Presentations by Students 2 1h
      • Creating effective data visualizations 10m

        I would like to suggest the addition of a CSC section on data visualization principles (along with human perception), what makes an effective data visualization, different types of data visualizations and approaches to visualizing multivariate data, and how to build your own visualizations using D3.

        Speaker: Eamonn James Maguire (CERN)
      • Unfolding - How to remove detector effects 10m

        When performing a measurement, it is sometimes crucial to compare our results with complementary measurements from independent experiments. However, different experiments use different instruments/detectors, meaning that detector effects can smear our results. In order to compare two different measurements, they have to be rid of detector effects. I give a short overview how this is done (e.g. in HEP).

        Speaker: Matic Lubej (Jozef Stefan Institute)
      • The Basis of the Kalman Filter 10m

        The Kalman filter is a tool to combine information in the presence of uncertainty. The talk will give a quick introduction to the most basic principles used.

        Speaker: Markus Prim (CERN)
      • Thoughts On Data Preservation 10m

        Some of our lecturers have mentioned data preservation. What does that mean and is this a good idea? If we can agree that we should preserve our data, how do we do it? What challenges are involved? What are we already doing?

        Speaker: Frank Olaf Berghaus (University of Victoria (CA))
    • 12:15 13:00
      Lunch 45m
    • 13:00 15:00
      SPECIAL GUEST LECTURES 2h
    • 15:00 15:30
      Afternoon break 30m
    • 15:30 16:30
      Graduation ceremony 1h
      Speaker: Sebastian Lopienski (CERN)
    • 16:30 17:00
      Closing Session 30m
    • 17:00 18:30
      Free time 1h 30m
    • 18:30 19:00
      Bus to somewhere 30m
    • 19:00 20:30
      Closing Dinner 1h 30m
    • 20:30 00:00
      After-dinner PARTY 3h 30m
    • 08:15 11:30
      Departure 3h 15m
Your browser is out of date!

Update your browser to view this website correctly. Update my browser now

×