Indico celebrates its 20th anniversary! Check our blog post for more information!

1–5 Sept 2014
Faculty of Civil Engineering
Europe/Prague timezone

A Survey on Distributed File System Technology

2 Sept 2014, 09:00
35m
B280 (Faculty of Civil Engineering)

B280

Faculty of Civil Engineering

Faculty of Civil Engineering, Czech Technical University in Prague Thakurova 7/2077 Prague 166 29 Czech Republic
Plenary Computing Technology for Physics Research Plenary

Speaker

Jakob Blomer (CERN)

Description

Distributed file systems provide a fundamental abstraction to location-transparent, permanent storage. They allow distributed processes to co-operate on hierarchically organized data beyond the life-time of each individual process. The great power of the file system interface lies in the fact that applications do not need to be modified in order to use distributed storage. On the other hand, the general and simple file system interface makes it notoriously difficult for a distributed file system to perform well under a variety of different workloads. This has lead to today’s landscape with a number of popular distributed file systems, each tailored to a specific use case. This contribution provides a survey of distributed file systems and key ideas of their internal mechanics. Early implementations merely execute file system calls on a remote server, which limits scalability and resilience to failures. Such limitations have been greatly reduced by modern techniques such as distributed hash tables, content-addressable storage, distributed consensus algorithms, or erasure codes. In the light of upcoming scientific data volumes at the exabyte scale, two trends are emerging. First, the previously monolithic design of distributed file systems is decomposed into services that independently provide a hierarchical namespace, data access, and distributed coordination. Secondly, the segregation of storage and computing resources yields to a storage architecture in which every compute node also participates in providing persistent storage.

Primary author

Jakob Blomer (CERN)

Presentation materials

Peer reviewing

Paper