Speaker
Description
The Science Data Centre (SDC) is a new strategic infrastructure at the Leibniz-Institut für Sonnenphysik (KIS) in Freiburg, Germany, for archiving and disseminating raw and calibrated ground-based high-resolution multiwavelength spectropolarimetric and imaging data obtained primarily at the German Solar Telescopes in Tenerife, Spain, and soon from other observatories. Additionally, SDC develops data analysis tools and generates high-level science-ready products, e.g. multi-dimensional sets of physical and statistical parameters characterising the solar atmosphere, such as vector magnetic fields, doppler velocities, temperature, and their evolution. Being produced by a wide variety of different instruments, solar data is very diverse, inhomogeneous, and metadata-heavy.
We are transitioning our original single-site in-house solution based on MongoDB and GridFS into a multi-site solution based on Rucio and dCache; the challenges include:
- Implementing a suitable naming scheme for existing and future data (including versioning) within Rucio's flat namespace,
- mapping our tailored instrument-dependent data grouping into observations to Rucio containers,
- tying MongoDB, which we still use for organising and storing metadata and volatile data, to Rucio, and
- implementing site- and access-method independent embargoes using x509 authentication and dCache.
The SDC concept based on Rucio and dCache also serves as a prototype and testbed for the upcoming ESFRI 4m-class European Solar Telescope (EST).