Speaker
Marco Donadoni
(CERN)
Description
REANA is a platform for reusable and reproducible data analyses. REANA allows researchers to use declarative analysis workflows (CWL, Snakemake, Yadage) and run them on containerised compute clouds (Kubernetes, HTCondor, Slurm). In this talk we present a status update on REANA, covering the latest developments, with a particular focus on data-preservation oriented use cases. We demonstrate how REANA can offer an "analysis engine" to complement data preservation activities in view of verifying data provenance information or to ensure the validity of data usage examples for future data reuse by means of actionable "continuous reuse" workflows.
Authors
Marco Donadoni
(CERN)
Tibor Simko
(CERN)