CernVM Users Workshop

Name: CernVM Users Workshop
Start: 2016-06-06T09:00:00+01:00
End: 2016-06-08T16:00:00+01:00
Location: No location set

6–8 Jun 2016

Europe/London timezone

CVMFS for Data Federations

7 Jun 2016, 14:20

20m

CernVM Related Activities II

Brian Paul Bockelman (University of Nebraska (US))

A data federation is a cooperating set of storage resources transparently accessible across a wide area network via a common namespace. These are often implemented through a redirector hierarchy - clients query a centralized endpoint for a given file; this redirector locates an available storage resource, then redirects the client to the remote resource.

Data federations are an increasingly used as a way to distribute large-volumes of physics data. For example, the Compact Muon Solenoid (CMS) experiment has approximately 20PB of analysis data available through it's "Any Data, Any Time, Anywhere" (AAA) federation.

However, the namespace of AAA is extremely limited - it is equivalent to just a HTTP GET. There are no directory listings, authoritative size or checksum information - despite the fact this information is known to CMS and available in the underlying storage systems and across several services; it is user-hostile for data discovery.

In this presentation, we will discuss a series of improvements made to the CVMFS core to marry a user-friendly, CVMFS-based POSIX namespace with data federation. We will demonstrate a set of CVMFS repositories of increasing complexity that utilize these new CVMFS features. These repositories serve as frontends for data federations for OSG, LIGO, and CMS.

Finally, we will discuss plans to grow this work - in terms of scale (data volume), efficiency, and features used in production.

Summary

An effort to utilize CVMFS's scalable namespace features to provide a POSIX interface for data federations.

Brian Paul Bockelman (University of Nebraska (US))

Dave Dykstra (Fermi National Accelerator Lab. (US)) Derek John Weitzel (University of Nebraska (US))

CVMFS-DataFeds.pdf

Video

CernVM Users Workshop

CVMFS for Data Federations

Speaker

Description

Summary

Primary author

Co-authors

Presentation materials

Choose timezone

CernVM Users Workshop

Speaker

Description

Summary

Primary author

Co-authors

Presentation materials