HEPiX Spring 2016 Workshop

Name: HEPiX Spring 2016 Workshop
Start: 2016-04-18T08:00:00+02:00
End: 2016-04-22T14:30:00+02:00
Location: DESY Zeuthen

18–22 Apr 2016

DESY Zeuthen

Europe/Berlin timezone

Local organisers

hepix-org-2015@desy.de

Running virtualized Hadoop, does it make sense?

18 Apr 2016, 16:10

25m

Seminar room 3 (DESY Zeuthen)

Seminar room 3

DESY Zeuthen

Platanenallee 6, 15738 Zeuthen (near Berlin), Germany

Storage & Filesystems Storage and file systems

Kacper Surdy (CERN)

Public and private clouds based on VMs are a modern approach for deploying computing resources. Virtualisation of computer hardware allows additional optimizations in the utilisation of computing resources compared to the traditional HW deployment model. A price to pay when running virtual machines on physical hypervisors is an additional overhead. This is an area of concern in the context of high throughput computing and big data analytics where distributed data processing frameworks typically push hardware capabilities to their limit. This presentation reports on our tests and experience with the Hadoop components running on fully virtualized hardware using CERN OpenStack infrastructure. Pros and cons of running Hadoop on VMs vs. physical machines will be discussed as well as performance aspects when running CERN data analytics workloads on a virtual stack.

Length of presentation (minutes, max. 20)	15

Kacper Surdy (CERN) Zbigniew Baranowski (CERN)

hepix2016_hadoop.pdf

hepix2016_hadoop.pptx

HEPiX Spring 2016 Workshop

Local organisers

Running virtualized Hadoop, does it make sense?

Seminar room 3

DESY Zeuthen

Speaker

Description

Authors

Presentation materials