CHEP 2016 Conference, San Francisco, October 8-14, 2016

Name: CHEP 2016 Conference, San Francisco, October 8-14, 2016
Start: 2016-10-10T08:00:00-07:00
End: 2016-10-14T18:00:00-07:00
Location: San Francisco Marriott Marquis

10–14 Oct 2016

San Francisco Marriott Marquis

America/Los_Angeles timezone

SWAN: a Service for Web-Based Data Analysis in the Cloud

12 Oct 2016, 11:30

15m

Sierra B (San Francisco Mariott Marquis)

Sierra B

San Francisco Mariott Marquis

Oral Track 6: Infrastructures Track 6: Infrastructures

Enric Tejedor Saavedra (CERN)

SWAN is a novel service to perform interactive data analysis in the cloud. SWAN allows users to write and run their data analyses with only a web browser, leveraging the widely-adopted Jupyter notebook interface. The user code, executions and data live entirely in the cloud. SWAN makes it easier to produce and share results and scientific code, access scientific software, produce tutorials and demonstrations as well as preserve analyses. Furthermore, it is also a powerful tool for non-scientific data analytics.

The SWAN backend combines state-of-the-art software technologies, like Docker containers, with a set of existing IT services such as user authentication, virtual computing infrastructure, mass storage, file synchronisation and sharing, specialised clusters and batch systems. In this contribution, the architecture of the service and its integration with the aforementioned CERN services is described. SWAN acts as a "federator of services" and the reasons why this feature boosts the existing CERN IT infrastructure are reviewed.

Furthermore, the main characteristics of SWAN are compared to similar products offered by commercial and free providers. Use-cases extracted from workflows at CERN are outlined. Finally, the experience and feedback acquired during the first months of its operation are discussed.

Primary Keyword (Mandatory)	Cloud technologies
Secondary Keyword (Optional)	Analysis tools and techniques
Tertiary Keyword (Optional)	Data processing workflows and frameworks/pipelines

Danilo Piparo (CERN) Enric Tejedor Saavedra (CERN)

Jakub Moscicki (CERN) Luca Mascetti (CERN) Massimo Lamanna (CERN) Pere Mato Vila (CERN)

Highlights-SWAN_CHEP_1016.pdf

Oral-SWAN_CHEP_1016.pdf

SWAN and ATLAS Open Data

SWAN Notebook Gallery

CHEP 2016 Conference, San Francisco, October 8-14, 2016

SWAN: a Service for Web-Based Data Analysis in the Cloud

Sierra B

San Francisco Mariott Marquis

Speaker

Description

Authors

Co-authors

Presentation materials

Choose timezone

CHEP 2016 Conference, San Francisco, October 8-14, 2016

Speaker

Description

Authors

Co-authors

Presentation materials