Presentation and demo of Delta Lake (storage layer with transactions for Apache Spark)

Europe/Zurich
513/1-024 (CERN)

513/1-024

CERN

50
Show room on map
Description

Delta Lake https://delta.io/ has recently been open sourced by Databricks. Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads.
This call with one of the main developers of Delta aims at informing CERN users about Delta's architecture and goals, main features, and roadmap of the open source release.

 

Speaker: Burak Yavuz is a Software Engineer at Databricks. He has been contributing to Spark since Spark 1.1, and is the maintainer of Spark Packages. Burak received his BS in Mechanical Engineering at Bogazici University, Istanbul, and his MS in Management Science & Engineering at Stanford.

 

    • 16:30 16:35
      Introduction of the participants, topics and goals for this meeting 5m
      Speaker: Luca Canali (CERN)
    • 16:35 17:20
      Presentation and demo of Delta Lake (speaker connecting remotely) 45m
      • Intro to Delta (motivations and goals)
      • Demo/tutorial of features we have today
      • Roadmap
      Speaker: Burak Yavuz
    • 17:20 17:30
      Q&A 10m