CERN School of Computing on IT Services 2024
from
Monday 4 November 2024 (08:30)
to
Friday 8 November 2024 (18:00)
Monday 4 November 2024
08:30
Welcome coffee
Welcome coffee
08:30 - 08:45
08:45
Welcome address from the IT department head
-
Enrica Maria Porcari
(
CERN
)
Welcome address from the IT department head
Enrica Maria Porcari
(
CERN
)
08:45 - 09:00
09:00
Opening Session
-
Alberto Pace
(
CERN
)
Opening Session
Alberto Pace
(
CERN
)
09:00 - 10:00
10:00
Opening Lecture: The need for IT Service in accelerator and particle physics
-
Sebastien Ponce
(
CERN
)
Opening Lecture: The need for IT Service in accelerator and particle physics
Sebastien Ponce
(
CERN
)
10:00 - 11:00
Starting from accelerator and particle physics, we'll try to see what are the needs of the experiments and accelerator people in terms of IT services.
11:00
Break
Break
11:00 - 11:30
11:30
Student self presentation
Student self presentation
11:30 - 12:30
12:30
Lunch
Lunch
12:30 - 13:30
13:30
Storage (part 1 of 2)
-
Abhishek Lekshmanan
(
CERN
)
Storage (part 1 of 2)
Abhishek Lekshmanan
(
CERN
)
13:30 - 14:30
This two-part lecture series provides an overview of the various storage services at CERN. We will look into the motivation behind our large scale storage systems, cover some fundamentals and the design principles of the storage systems we've developed and use. We will also look into some practical use cases covering the ecosystem of the many storage systems we run in the IT department. This should serve as a basis for choosing the correct storage services for the applications you would develop and practical considerations into utilizing storage effectively.
14:30
Creation and maintenance of a website
-
Vasvi Sharma
Creation and maintenance of a website
Vasvi Sharma
14:30 - 15:30
Overview of the Web Services Portal, and how it can be used for website creation and management at CERN. Website hosting and management services - Drupal/WordPress, WebEOS, GitLab pages etc - will be highlighted and their specific use cases outlined. The use of Matomo for web analytics will be demonstrated.
15:30
Break
Break
15:30 - 16:00
16:00
Storage (part 2 of 2)
-
Abhishek Lekshmanan
(
CERN
)
Storage (part 2 of 2)
Abhishek Lekshmanan
(
CERN
)
16:00 - 17:00
17:00
Database Services (part 1 of 4) - Introduction to DBoD
-
Andrzej Nowicki
(
CERN
)
Database Services (part 1 of 4) - Introduction to DBoD
Andrzej Nowicki
(
CERN
)
17:00 - 18:00
- Short intro about databases - Presentation of the DBoD service - How to create a database in DBoD - How to connect to my DBoD
Tuesday 5 November 2024
09:00
Modern Application Development & Deployment (Part 1 of 2)
-
Francisco Borges Aurindo Barros
(
CERN
)
Modern Application Development & Deployment (Part 1 of 2)
Francisco Borges Aurindo Barros
(
CERN
)
09:00 - 10:00
“I need to develop an application X for user community Y, which will need to be run and maintained over time” 2. Application development a. This part of the session will explore the multiple types of applications, how to leverage version control system Gitlab and it's CI to have modern application deployment. This is organized as a workshop and will include a hands-on experience covering the best practices to develop containerized applications and strategies for deploying them. ------ Participants will begin by exploring various application types and learning how to leverage GitLab's version control and CI pipelines for efficient deployment. Through hands-on exercises, attendees will develop a simple application, while learning key concepts such as: - Best practices for developing containerized applications. - Writing DockerFiles for application deployment. - Setting up continuous integration (CI) workflows to automate testing and build and publish Docker images.
10:00
Modern Application Development & Deployment (part 2 of 2)
-
Francisco Borges Aurindo Barros
(
CERN
)
Modern Application Development & Deployment (part 2 of 2)
Francisco Borges Aurindo Barros
(
CERN
)
10:00 - 11:00
“I need to develop an application X for user community Y, which will need to be run and maintained over time” 2. Application development a. This part of the session will explore the multiple types of applications, how to leverage version control system Gitlab and it's CI to have modern application deployment. This is organized as a workshop and will include a hands-on experience covering the best practices to develop containerized applications and strategies for deploying them.
11:00
Break
Break
11:00 - 11:30
11:30
Project Management and documentation
-
Francisco Borges Aurindo Barros
(
CERN
)
Project Management and documentation
Francisco Borges Aurindo Barros
(
CERN
)
11:30 - 12:30
In this session we will explore the solutions for project management and effective software development. We'll start by demonstrating how to plan and track project progress using Jira, available at https://its.cern.ch. Next, we'll highlight how GitLab Pages can be utilized to store and share information, whether for internal use or a broader audience, showcasing instances of technical documentation delivered to end-users. Finally, we'll explore Confluence for documentation storage, providing a walkthrough of its features and real-world examples of its use.
12:30
Lunch
Lunch
12:30 - 13:30
13:30
Core compute services (part 1 of 4)
-
Giacomo Tenaglia
(
CERN
)
Core compute services (part 1 of 4)
Giacomo Tenaglia
(
CERN
)
13:30 - 14:30
An in depth set of use case where IT services are heavily used for physics, analysis and engineering applications. In part I of the series we will show use cases for Openstack, Linux and virtual-machine based configuration management.
14:30
Transport to UN
Transport to UN
14:30 - 15:30
15:30
Social Activity - Visit to the United Nations in Geneva
Social Activity - Visit to the United Nations in Geneva
15:30 - 18:30
18:30
Social dinner
Social dinner
18:30 - 20:30
Wednesday 6 November 2024
09:00
Database Services (part 2 of 4) - DBoD maintenance exercises
-
Andrzej Nowicki
(
CERN
)
Database Services (part 2 of 4) - DBoD maintenance exercises
Andrzej Nowicki
(
CERN
)
09:00 - 10:00
- What are the typical tasks to be performed as DBoD owner Exercises on: - Cloning mechanism - Upgrades - TLS certificates
10:00
Core compute services (part 2 of 4)
-
Ben Jones
(
CERN
)
Core compute services (part 2 of 4)
Ben Jones
(
CERN
)
10:00 - 11:00
An in depth set of use case where IT services are heavily used for physics, analysis and engineering applications. In part 2 of the series we will explore HTCondor, the high-throughput compute platform used for batch computing.
11:00
Break
Break
11:00 - 11:30
11:30
Database Services (part 3 of 4) - DBoD maintenance exercises
-
Andrzej Nowicki
(
CERN
)
Database Services (part 3 of 4) - DBoD maintenance exercises
Andrzej Nowicki
(
CERN
)
11:30 - 12:30
- What are the typical tasks to be performed as DBoD owner Exercises on: - Cloning mechanism - Upgrades - TLS certificates
12:30
Lunch
Lunch
12:30 - 13:30
13:30
Application security
-
Sebastian Lopienski
(
CERN
)
Application security
Sebastian Lopienski
(
CERN
)
13:30 - 14:30
Short introduction to best practices for secure development, testing and deployment - Three golden rules for system security - Software security, typical vulnerability types - How security analysis tools can help - Introduction to penetration testing - Deployment security best practices
14:30
Data Analysis Techniques using SWAN and REANA (part 1 of 3)
-
Enric Tejedor Saavedra
(
CERN
)
Pedro Miguel Esteves Maximino
Diogo Castro
(
CERN
)
Data Analysis Techniques using SWAN and REANA (part 1 of 3)
Enric Tejedor Saavedra
(
CERN
)
Pedro Miguel Esteves Maximino
Diogo Castro
(
CERN
)
14:30 - 15:30
In this first session, we will give an overview of the SWAN service. This will include the following points: - Interface: classic and JupyterLab - Creation of projects, notebooks and terminals - Integration with CVMFS for software provisioning - Integration with EOS for storage and CERNBox for sharing - Use of GPUs - Connection to Spark clusters Moreover, we will give a live demo that participants will be able to follow along and get familiar with the basic features of SWAN.
15:30
Break
Break
15:30 - 16:00
16:00
Deploying applications (part 1 of 2)
-
Alberto Pimpo
Deploying applications (part 1 of 2)
Alberto Pimpo
16:00 - 17:00
In this lecture, we will understand the difference between IaaS, PaaS and SaaS. Then, we will learn how to deploy custom and off-shelf applications to OKD PaaS.
17:00
Deploying applications (part 2 of 2)
-
Alberto Pimpo
Deploying applications (part 2 of 2)
Alberto Pimpo
17:00 - 18:00
Exercises regarding how to deploy custom and off-shelf applications to OKD PaaS.
Thursday 7 November 2024
09:00
Data Analysis Techniques using SWAN and REANA (part 2 of 3)
-
Tibor Simko
(
CERN
)
Marco Donadoni
(
CERN
)
Data Analysis Techniques using SWAN and REANA (part 2 of 3)
Tibor Simko
(
CERN
)
Marco Donadoni
(
CERN
)
09:00 - 10:00
In the second session of this series, we shall present REANA reusable and reproducible analysis platform. REANA allows researchers to structure their data analyses by means of declarative workflow languages (CWL, Snakemake, Yadage) and run containerised data analysis pipelines on remote compute clouds (Kubernetes, HTCondor, Slurm). In the first part of this session, we shall discuss the notions of computational reproducibility and reusability, underlying the importance of encapsulating the original computing environments by means of containers and documenting the steps necessary to arrive at results. We shall provide a brief introduction to declarative workflow languages and discuss its pros and cons when compared to imperative analysis code programming. In the second part of this session, the participants will familiarise themselves with the REANA platform by means of running a simple analysis example. We shall use the [https://reana.cern.ch](https://reana.cern.ch/) instance at CERN to run a [RooFit demo example](https://github.com/reanahub/reana-demo-root6-roofit/).
10:00
Services for Machine Learning applications (part 1 of 3)
-
Ricardo Rocha
(
CERN
)
Services for Machine Learning applications (part 1 of 3)
Ricardo Rocha
(
CERN
)
10:00 - 11:00
This session will introduce the different phases in a ML lifecycle, and how IT services can help in each of the parts. In particular, it will: * Overview of ML and use cases, containerization and how it helps out in defining single units of computation, isolate custom software environments, and ensure sustainability for reproducible results * Demo how cloud native environments (Kubernetes and its ecosystem) can help manage those units of computation and scale them out to large amounts of resources * Provide an example of how to scale out using both on-premises and public cloud resources, and when this might be useful and cost effective
11:00
Break
Break
11:00 - 11:30
11:30
Core compute services (part 3 of 4)
-
Nils Høimyr
(
CERN
)
Core compute services (part 3 of 4)
Nils Høimyr
(
CERN
)
11:30 - 12:30
An in depth set of use case where IT services are heavily used for physics, analysis and engineering applications. In part 3 of the series we will explore the Slurm, the technology underlying the HPC platform at CERN.
12:30
Lunch
Lunch
12:30 - 13:30
13:30
Services for Machine Learning applications (part 2 of 3)
-
Diana Gaponcic
(
IT-PW-PI
)
Services for Machine Learning applications (part 2 of 3)
Diana Gaponcic
(
IT-PW-PI
)
13:30 - 14:30
This session will focus on the infrastructure and low level tools required to efficiently deploy machine learning applications. In particular, it will cover: * The different data types and how they can impact ML workloads, as well as support in different types of hardware and software libraries * Key differences between CPUs and GPUs and how they impact ML workloads (training and serving) * The available techniques in IT services for GPU sharing and partitioning. In particular, it will cover how applications can build on the existing Kubernetes service to simplify these operations * Hands-on exercises on using GPUs for different types of workloads
14:30
Services for Machine Learning applications (part 3 of 3)
-
Raulian-Ionut Chiorescu
Services for Machine Learning applications (part 3 of 3)
Raulian-Ionut Chiorescu
14:30 - 15:30
This session will focus on available ML techniques for distributed training of models, hyperparameter optimization and model service. In particular, starting from a well known use case it will demonstrate: * How to go from a script, to a docker image training on a single node, to a distributed training setup with multiple nodes * How to do hyperparameter optimization, which kind of optimizers are available, how to monitor the workloads and how to publish the models * How to serve models in production, at scale, with a simple http entrypoint or embedding the model in an application
15:30
Break
Break
15:30 - 16:00
16:00
Authentication and authorization
-
Hannah Short
(
CERN
)
Authentication and authorization
Hannah Short
(
CERN
)
16:00 - 17:00
(Part of "Software development and hosting" track) In this class, we will see how to: - Enable authentication with CERN SSO - Define role-based authorization for our applications - Get tokens, and use them access APIs
17:00
Authentication and authorization (Excercises)
-
Hannah Short
(
CERN
)
Authentication and authorization (Excercises)
Hannah Short
(
CERN
)
17:00 - 18:00
(Part of "Software development and hosting" track) In this class, we will see how to: - Enable authentication with CERN SSO - Define role-based authorization for our applications - Get tokens, and use them access APIs
Friday 8 November 2024
09:00
Lightning talks
-
Hannes Jakob Hansen
Jonathan Samuel
(
CERN - IT-CD-DPP
)
Nayana Bangaru
(
Universita di Napoli Federico II (IT)
)
Joao Ramiro
Elena De La Fuente Garcia
(
Universidad Politecnica de Madrid (ES)
)
Gabor Biro
(
HUN-REN Wigner Research Centre for Physics (HU)
)
Jesse Geens
Lightning talks
Hannes Jakob Hansen
Jonathan Samuel
(
CERN - IT-CD-DPP
)
Nayana Bangaru
(
Universita di Napoli Federico II (IT)
)
Joao Ramiro
Elena De La Fuente Garcia
(
Universidad Politecnica de Madrid (ES)
)
Gabor Biro
(
HUN-REN Wigner Research Centre for Physics (HU)
)
Jesse Geens
09:00 - 10:00
| ID | Name | Title of my talk | | --- | ------------------------- | ----------------------------------------------------------------------------------------------- | | 1 | Nayana Bangaru | Simulating the response of a silicon detector | | 2 | Gábor Bíró | Computational Challenges in Image Reconstruction for Proton Computed Tomography | | 3 | Elena De la Fuente Garcia | A new Open-Source 3D Time-Domain Electromagnetic Solver for Beam-Coupling Impedance Calculation | | 4 | Jesse Geens | Solid: an open standard for structuring data, digital identities, and applications on the Web. | | 5 | Hannes Jakob Hansen | How to Manage the Your ML Model Artifacts? | | 6 | João Ramiro | How we use airflow | | 7 | Jonathan Samuel | Improving Education within Computer Science |
10:00
Data Analysis Techniques using SWAN and REANA (part 3 of 3)
-
Diogo Castro
(
CERN
)
Tibor Simko
(
CERN
)
Enric Tejedor Saavedra
(
CERN
)
Marco Donadoni
(
CERN
)
Pedro Miguel Esteves Maximino
Data Analysis Techniques using SWAN and REANA (part 3 of 3)
Diogo Castro
(
CERN
)
Tibor Simko
(
CERN
)
Enric Tejedor Saavedra
(
CERN
)
Marco Donadoni
(
CERN
)
Pedro Miguel Esteves Maximino
10:00 - 11:00
In the third session of this series, we will propose short exercises using SWAN and REANA to cover more data analysis examples and use cases. The session will be split in two parts, one for each tool, where participants will be able to work on the exercises and get assistance from the lecturers.
11:00
Break and school photo
Break and school photo
11:00 - 11:30
11:30
Core compute services (part 4 of 4)
-
Giacomo Tenaglia
(
CERN
)
Core compute services (part 4 of 4)
Giacomo Tenaglia
(
CERN
)
11:30 - 12:30
An in depth set of use case where IT services are heavily used for physics, analysis and engineering applications.
12:30
Lunch
Lunch
12:30 - 13:30
13:30
Database Services (part 4 of 4) - Oracle Database
-
Andrzej Nowicki
(
CERN
)
Database Services (part 4 of 4) - Oracle Database
Andrzej Nowicki
(
CERN
)
13:30 - 14:30
- Introduction of the Oracle Database service - Resource portal as a way to manage Oracle database users - needed e-groups - Other tooling provided by the team - Session Manager - How to connect to the Oracle database - Where to get the client? - What is the tnsnames.ora file?
14:30
Self Assessment
Self Assessment
14:30 - 16:00
16:00
Break
Break
16:00 - 16:30
16:30
Closing Session
-
Alberto Pace
(
CERN
)
Closing Session
Alberto Pace
(
CERN
)
16:30 - 17:30