28–30 Jan 2019
CNR
Europe/Zurich timezone

SWAN and its analysis ecosystem

29 Jan 2019, 11:35
20m
CNR

CNR

National Research Council - Piazzale Aldo Moro 7, 00185 Roma, Italy
Presentation Cloud infrastructure and software stacks for data science Data science: applications and infrastructure

Speaker

Diogo Castro (CERN)

Description

CERN, and High Energy Physics (HEP) in general, face unprecedented challenges in data storage, processing and analysis. With the planned improvements to the Large Hadron Collider (LHC), including the High-Luminosity LHC, there is an expected increase of data in one order of magnitude. After processing and filtering these data, new tools and solutions, capable of dealing with such large datasets, are particularly important for the last phases of analysis.

SWAN (Service for Web-based ANalysis) is a service that provides an interactive interface to access data analysis tools from the web, allowing users to perform their work in a simpler way and with much larger datasets. Its integration with CERN’s infrastructure, more precisely with users synchronized storage and syncing capabilities (via CERNBox), computing resources, experiments data (via EOS) and software, allows a seamless experience across all of our user scenarios and devices.

But even more data means even more resources and different approaches. SWAN has recently been integrated with CERN's Spark Clusters and we are already working to provide access to our Worldwide LHC Computing Grid (WLCG) batch service. We're also experimenting with the integration outside of CERN's infrastructure, using external cloud vendors to install and deploy our Science Box service (which bundles SWAN, CERNBox, EOS and CVMFS). These last experiments are also important in our pursuit of bringing SWAN to new education scenarios, namely via the UP2University European Project.

Authors

Diogo Castro (CERN) Jakub Moscicki (CERN) Massimo Lamanna (CERN)

Co-authors

Enrico Bocchi (CERN) Enric Tejedor Saavedra (CERN) Danilo Piparo (CERN) Prasanth Kothuri (CERN)

Presentation materials