25–28 Jan 2021
Europe/Zurich timezone

Session

Scalable Storage

27 Jan 2021, 13:00

Conveners

Scalable Storage

  • Hugo Gonzalez Labrador (CERN)
  • Jakub Moscicki (CERN)

Presentation materials

There are no materials yet.

  1. Fabrizio Furano (CERN)
    27/01/2021, 13:00
    Scalable Storage Backends for Cloud, HPC and Global Science
    Lightning talk

    The Reva component, at the heart of the CERNBox project at CERN
    will soon get new plugins that build on the experience
    accumulated with the current production deployment,
    where its data is stored centrally in a system called EOS. EOS
    represents since 10 years the ultimate development effort
    into providing an extremely scalable data storage system that
    supports the demanding requirements...

    Go to contribution page
  2. Weiser Olaf (IBM)
    27/01/2021, 13:10
    Scalable Storage Backends for Cloud, HPC and Global Science
    Presentation

    IBM's high performance file system solution scales out over multiple nodes and storage technologies. Common data access over multiple paths and protocols like direct-POSIX, NFS, SMB, HDFS, OBJ and a full integration into Kubernetes container can help to manage your data more efficiently.
    SpectrumScale's so called watchfolder function can be used to take advantage from central file system...

    Go to contribution page
  3. Gregor Molan (Comtade)
    27/01/2021, 13:20
    Scalable Storage Backends for Cloud, HPC and Global Science
    Presentation

    Most of CERN experiments are using Linux systems for data collection so EOS was designed to work primarily with Linux. But there are some high demanding users that are using Windows systems for data collection. Possible solution for them is to use Samba to mount Linux file system as Windows drive. Most of Windows users are used to get instant applications and instant file systems, and...

    Go to contribution page
  4. Terrell Russell (iRODS Consortium / RENCI)
    27/01/2021, 13:30
    Scalable Storage Backends for Cloud, HPC and Global Science
    Presentation

    With a twenty-five year history, iRODS open source technology has been used to automate data management across many scientific and business domains. The scale and value of data across these domains drives the necessity for automation. This variety also demands a flexibility in data management policies over time. Organizations have satisfied their own needs by investing in the development of...

    Go to contribution page
  5. Theofilos Mouratidis (CERN)
    27/01/2021, 13:40
    Technology & Research
    Lightning talk

    CephFS is a distributed file system based on the popular storage system Ceph. This filesystem is a scalable system with POSIX features that makes it a compelling candidate for a Sync&Share backend. Sync&Share applications have a lot of users interacting with them, doing constant I/O tasks. Therefore, the need for a filesystem that can handle this I/O load is necessary. Even if the performance...

    Go to contribution page
  6. Jörn Dreyer (ownCloud GmbH)
    27/01/2021, 13:50
    Scalable Storage Backends for Cloud, HPC and Global Science
    Presentation

    Looking up files by a stable id is an inefficient operation in most filesystems.
    While an efficient lookup by file id can be cached inside an OCIS storage provider this cache needs to be kept up to date. By deconstructing a filesystem and storing every node by its uuid we can evade the cache invalidation problem at the cost of more stat requests. The ocis storage driver allows an efficient...

    Go to contribution page
  7. Aritz Brosa Iartza (CERN)
    27/01/2021, 14:00
    Technology & Research
    Presentation

    This contribution presents the experience in providing CERN users with direct online access to their CERNBox storage from Windows. In production for about 15 months, a High-Available Samba cluster is regularly used by a significant fraction of the CERNBox user base, following the migration of their central home folders from Microsoft DFS in the context of CERN's strategy to move to open source...

    Go to contribution page
  8. Lukasz Dutka
    27/01/2021, 14:10
    Scalable Storage Backends for Cloud, HPC and Global Science
    Presentation

    Onedata [1] is a global high-performance, transparent data management system, that unifies data access across globally distributed infrastructures and multiple types of underlying storages, such as NFS, Amazon S3, Ceph, OpenStack Swift, WebDAV, XRootD and HTTP and HTTPS servers, as well as other POSIX-compliant file systems.
    Onedata allows users to collaborate, share, and perform computations...

    Go to contribution page
Building timetable...