4–8 Nov 2024
Europe/Zurich timezone

Services for Machine Learning applications (part 2 of 3)

7 Nov 2024, 13:30
1h

Speaker

Diana Gaponcic (IT-PW-PI)

Description

This session will focus on the infrastructure and low level tools required to efficiently deploy machine learning applications. In particular, it will cover:

  • The different data types and how they can impact ML workloads, as well as support in different types of hardware and software libraries
  • Key differences between CPUs and GPUs and how they impact ML workloads (training and serving)
  • The available techniques in IT services for GPU sharing and partitioning. In particular, it will cover how applications can build on the existing Kubernetes service to simplify these operations
  • Hands-on exercises on using GPUs for different types of workloads

Presentation materials