PyHEP 2022 (virtual) Workshop

Name: PyHEP 2022 (virtual) Workshop
Start: 2022-09-12T14:00:00+02:00
End: 2022-09-16T23:00:00+02:00
Location: No location set

12–16 Sept 2022

Europe/Zurich timezone

Overview
Call for Abstracts
Timetable
Registration
Participant List
Proceedings
Code of conduct
EDI statement

Contact us

pyhep2022-organisation@cern.ch

Dask Tutorial

16 Sept 2022, 14:00

Tutorial Plenary Session Friday

Doug Davis

Dask provides a foundation to natively scale Python libraries and applications. Dask collection libraries like dask.array and dask.dataframe mimic the ubiquitous APIs of NumPy and Pandas to parallelize and/or distribute NumPy-like and Pandas-like workflows. The dask.delayed collection supports parallalization of custom algorithms. In this tutorial we will introduce the core Dask collections, the concepts behind them (partitioned objects represented by task graphs), and Dask's distributed execution engine that is compatible with common HEP batch compute systems. Finally, we will introduce recently developed Dask collections that support partitioned and distributed representations of awkward arrays and boost-histogram objects.

Doug Davis

Binder

GitHub

YouTube

PyHEP 2022 (virtual) Workshop

Contact us

Dask Tutorial

Speaker

Description

Author

Presentation materials

Choose timezone

PyHEP 2022 (virtual) Workshop

Contact us

Speaker

Description

Author

Presentation materials