Главная
Study mode:
on
1
Intro
2
Spark's Scale-Out World
3
Scale-Out Sum
4
Spark Aggregators
5
Data Sketching: T-Digest
6
Is T-Digest an Aggregator?
7
Romantic Chemistry
8
Romantic Montage
9
UDAF Anatomy
10
What Could Go Wrong?
11
Wait What?
12
SPARK-27296
13
Aggregator Anatomy
14
Intuitive Serialization
15
Custom Aggregation in Spark 3.0
16
Performance
17
Don't Give Up
18
Patience
19
Respect
Description:
Explore the evolution and power of User Defined Aggregate Functions (UDAFs) in Apache Spark through this 21-minute Databricks conference talk. Delve into the journey of customized scalable aggregation logic, from its initial flaws to the improved design in Spark 3.0. Learn how to create your own UDAF library, understand the inner workings of User Defined Aggregation, and discover how the latest UDAF features enhance both usability and performance. Gain insights into the Apache Spark code review process and pick up valuable tips for successfully integrating large features into the upstream community. Follow the speaker's personal experience with UDAFs, from initial challenges to ultimate triumph, while acquiring practical knowledge about this powerful feature in Apache Spark's data processing capabilities.

User Defined Aggregation in Apache Spark - From Challenges to Improvements

Databricks
Add to list
0:00 / 0:00