21st International Conference on Computing in High Energy and Nuclear Physics (CHEP2015)

Name: 21st International Conference on Computing in High Energy and Nuclear Physics (CHEP2015)
Start: 2015-04-13T09:00:00+09:00
End: 2015-04-17T16:00:00+09:00
Location: OIST

13–17 Apr 2015

OIST

Asia/Tokyo timezone

Scale Out Databases for CERN Use Cases

Not scheduled

15m

OIST

1919-1 Tancha, Onna-son, Kunigami-gun Okinawa, Japan 904-0495

poster presentation Track3: Data store and access

Zbigniew Baranowski (CERN)

Data generation rates are expected to grow very fast for some database workloads going into LHC run 2 and beyond. In particular this is expected for data coming from controls, logging and monitoring systems. Storing, administering and accessing big data sets in a relational database system is in certain cases very demanding on the technology and therefore on costs. Notably one of the critical parts in the architecture of Oracle database clusters is the use of shared storage. Therefore there is a high interest in the CERN database community to look for alternative solutions for storing and querying big data volumes with fast and scalable data access time. Scale out database engines are an emerging and rapidly developing area. Recently a technical solution that has attracted attention is Cloudera Impala with columnar storage provided by Parquet on top of Hadoop Distributed File System. This solution has the additional benefit of offering SQL as the main data access interface which makes it easy to integrate with existing client application. In this paper we will describe the architecture of database systems based on Impala Hadoop clusters and we will discuss the results of our tests, including tests of data loading and integration with existing data sources, notably Oracle databases. We will report on query performance tests done with various data sets of interest at CERN, notably the accelerator log database.

Zbigniew Baranowski (CERN)

Daniel Lanza Garcia (Univ. Extremadura, Cen. Uni. Merida (ES)) Luca Canali (CERN) Maciej Grzybek (Warsaw University of Technology (PL))

Slides

chep2015-scaleout-databases.pdf

21st International Conference on Computing in High Energy and Nuclear Physics (CHEP2015)

Scale Out Databases for CERN Use Cases

OIST

Speaker

Description

Primary author

Co-authors

Presentation materials

Choose timezone

21st International Conference on Computing in High Energy and Nuclear Physics (CHEP2015)

Speaker

Description

Primary author

Co-authors

Presentation materials

Share this page

Direct link

Social networks

Calendaring