10-14 October 2016
San Francisco Marriott Marquis
America/Los_Angeles timezone

Everware toolkit - supporting reproducible science and challenge driven education

13 Oct 2016, 12:15
GG A+B (San Francisco Mariott Marquis)


San Francisco Mariott Marquis

Oral Track 5: Software Development Track 5: Software Development


Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))


Reproducibility is a fundamental piece of the scientific method and increasingly complex problems demand ever wider collaboration between scientists. To make research fully reproducible and accessible to collaborators a researcher has to take care of several aspects: research protocol description, data access, preservation of the execution environment, workflow pipeline, and analysis script preservation.

Version control systems like git help with the workflow and analysis scripts part. Virtualization techniques like containers or virtual machines help with sharing execution environments. Jupyter notebooks are a powerful tool to capture the computational narrative of a data analysis project.

We present project Everware that seamlessly integrates github/gitlab, Docker and Jupyter helping with a) sharing results of real research [] and b) boosts education activities. With the help of everware one can not only share the final artifacts of research but all the depth of the research process. This been shown to be extremely helpful during organization of several data analysis hackathons and machine learning schools. Using everware participants could start from an existing solution instead of starting from scratch. They could start contributing immediately.

Everware allows its users to make use of their own computational resources to run the workflows they are interested in, which enables Everware to scale to large numbers of users.

Everware is supported by the Mozilla science lab and Yandex. It is being evaluated as an option for analysis preservation at LHCb. It is an open-source project that welcomes contributions of all kinds at: https://github.com/everware/everware.

Primary Keyword (Mandatory) Analysis tools and techniques
Tertiary Keyword (Optional) Outreach
Secondary Keyword (Optional) Preservation of analysis and data

Primary authors

Alexander Tiunov (Higher School of Economics) Andrey Ustyuzhanin (Yandex School of Data Analysis (RU)) Igor Babuschkin (University of Manchester (GB)) Tim Head (Ecole Polytechnique Federale de Lausanne (CH))

Presentation Materials