10–14 Oct 2016
San Francisco Marriott Marquis
America/Los_Angeles timezone

Integration of Oracle and Hadoop: hybrid databases affordable at scale

10 Oct 2016, 15:45
15m
GG C1 (San Francisco Mariott Marquis)

GG C1

San Francisco Mariott Marquis

Oral Track 2: Offline Computing Track 2: Offline Computing

Speaker

Luca Canali (CERN)

Description

This work reports on the activities of integrating Oracle and Hadoop technologies for CERN database services and in particular in the development of solutions for offloading data and queries from Oracle databases into Hadoop-based systems. This is of interest to increase the scalability and reduce cost for some our largest Oracle databases. These concepts have been applied, among others, to build offline copies of controls and logging databases, which allow reports to be run without affecting critical production and also reduces the storage cost. Other use cases include making data stored in Hadoop/Hive available from Oracle SQL, which opens the possibility for building applications that integrate data from both sources.

Primary Keyword (Mandatory) Databases
Secondary Keyword (Optional) Data processing workflows and frameworks/pipelines

Authors

Presentation materials