Second CERN openlab summer student lightning talk session

Name: Second CERN openlab summer student lightning talk session
Start: 2019-08-15T14:00:00+02:00
End: 2019-08-15T18:59:00+02:00
Location: CERN

15 August 2019

CERN

Europe/Zurich timezone

There is a live webcast for this event.

Evaluation of Erasure Coding & other features of Hadoop 3

15 Aug 2019, 14:05

31/3-004 - IT Amphitheatre (CERN)

31/3-004 - IT Amphitheatre

CERN

105

Show room on map

Nazerke Seidan

Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big improvement in Hadoop 3 for replication which is Erasure Coding (EC).
Erasure Coding gives the same level of fault tolerance as 3x replication but with much less storage space.
My project aims to evaluate the performance of Erasure Coding.

NazerkeS_openlab.pdf

NazerkeS_openlab.pptx

Recording

Second CERN openlab summer student lightning talk session

Evaluation of Erasure Coding & other features of Hadoop 3

31/3-004 - IT Amphitheatre

CERN

Speaker

Description

Presentation materials