Play all

Intro

Three phases of Delta Lake (abridged)

What is in Delta 2.0.0?

Data skipping via column stats

Change Data Feed: Motivation

Change Data Feed: Problem

Change Data Feed: Solution

Column Mapping: Problem

Column Mapping Solution

Multi-cluster writes on S3: Problem

Multi-cluster writes on S3: Solution

Flink: Delta Source

Trino / Presto: Delta connector

Delta Standalone

Multiple Delta projects and repositories

Description:

Explore the latest features and integrations of Delta Lake 2.0 in this 38-minute video presentation by Databricks. Dive into the collaborative efforts of the Delta community that led to this significant release, including integrations with Apache Spark™, Apache Flink, Apache Pulsar, Presto, and Trino. Learn about advanced features such as OPTIMIZE ZORDER, data skipping using column stats, S3 multi-cluster writes, and Change Data Feed. Discover the expanded language support with APIs for Rust, Python, Ruby, GoLang, Scala, and Java. Gain insights into the three phases of Delta Lake's development, understand the motivations behind new features like Change Data Feed and Column Mapping, and explore solutions to challenges in multi-cluster writes on S3. Examine the Delta Source for Flink, Delta connector for Trino/Presto, and the introduction of Delta Standalone. Get an overview of multiple Delta projects and repositories in this comprehensive update on Delta Lake 2.0.

Delta Lake 2.0 Overview - New Features and Community Collaborations

Databricks

Add to list

#Data Science #Big Data #Apache Spark #Delta Lake #Business #Business Intelligence #Data Warehousing #Apache Flink #Presto #Computer Science #Distributed Systems #Apache Pulsar #Programming #Domain-Specific Languages (DSL) #SQL #Trino

0:00 / 0:00