Speaker
Description
We will present our recent experiences with integrating EOS into Invenio digital library framework and how EOS allows data repository services such as Zenodo to handle large files in an efficient and scalable manner. Invenio v3, the underlying framework for a number of data preservation repositories such as CERN OpenData, CERN Document Server and Zenodo, was completely rebuilt from ground-up during 2016. In particular, Invenio's file handling layer was completely revamped in order to support multiple storage backends via PyFilesytem library as well as handling of large files. We will present both the Invenio layer and the benefits and obstacles we encounterd using EOS, as well as the XRootDPyFS plugin for PyFilesystem which provides access to EOS over XRootD for any Python application.