CERN Accelerating science

Talk
Title Real-Time processing of Big Data with ScyllaDB
Video
If you experience any problem watching the video, click the download button below
Download Embed
Mp4:Medium
(1000 kbps)
High
(4000 kbps)
More..
Copy-paste this code into your page:
Copy-paste this code into your page to include both slides and lecture:
Author(s) Costa, Glauber (speaker) (ScyllaDB) ; Martinez Pedreira, Miguel (speaker) (Johann-Wolfgang-Goethe Univ. (DE))
Corporate author(s) CERN. Geneva
Imprint 2018-06-29. - 1:38:30.
Series (CERN Computing Seminar)
Lecture note on 2018-06-29T14:00:00
Subject category CERN Computing Seminar
Abstract

ScyllaDB: achieving 1 million operations/sec with stable and consistent real time latencies

This talk will present ScyllaDB, a highly available Real-time Big Data Database that can achieve high throughput without compromising latencies or availability. ScyllaDB is API-compatible with Apache Cassandra but employs a different internal architecture to make sure that operational capacity is increased while the maintenance burden is reduced. It provides everything that a new-world database must provide: horizontal (infinite) scaling, no single point of failure, high availability and excellent performance, while keeping a sensible amount of operational efforts. Some of the key points that make ScyllaDB very efficient are its fully asynchronous operations and the smart integration with the kernel and hardware. You will learn about what makes ScyllaDB special in the crowded space of NoSQL solutions and how it can be used to power a wide variety of workloads: from real time bidding to the experiment data from the ALICE Collaboration.

About the speakers

Glauber Costa (Lord Glauber I of Sealand) is a Principal Architect at ScyllaDB. He shares his time between the engineering department working on upcoming Scylla features and helping customers succeed. Before ScyllaDB, Glauber worked with Virtualization in the Linux Kernel for 10 years, with contributions ranging from the Xen Hypervisor to all sorts of guest functionality and containers.

Miguel Martinez Pedreira is a core developer in the ALICE Offline team at CERN, working on the GRID middleware called AliEn and involved in the operation and maintenance of its multiple backends. The main one is the File Catalogue, that holds all the ALICE experiment data, and Miguel is facing its redesign in order to be ready for LHC Run3 scalability challenges.

Copyright/License © 2018-2024 CERN
Submitted by miguel.marquina@cern.ch

 


 Record created 2018-07-02, last modified 2022-11-02


External link:
Download fulltext
Event details