Главная
Study mode:
on
1
Intro
2
How Spark Works
3
What is Broadcast Join
4
How Broadcast Joins Work
5
Improving Broadcast Joins
6
Single Joint
7
Executors
8
Results
9
Production case study
10
Conclusion
Description:
Explore the intricacies of broadcast joins in Apache Spark SQL through this 28-minute Databricks conference talk. Delve into the mechanics of Spark's execution engine, focusing on broadcast joins and their performance implications. Learn about Workday's improvements to increase the threshold for effective broadcast joins, including executor-side broadcasting and modifications to Spark's whole-stage code generator. Discover techniques for limiting memory usage in executors while increasing broadcasting thresholds. Gain insights from real-world production case studies involving large-scale ETL pipelines. Acquire valuable knowledge to optimize your own Spark workloads and enhance your understanding of Spark's join infrastructure.

Improving Broadcast Joins in Apache Spark SQL

Databricks
Add to list