Главная
Study mode:
on
1
DATA AI
2
Our Team
3
Basic Operator Perf Stopped Grow
4
CPU Is The Bottleneck
5
Spark Is a Proved & Great Framework to Scale Out
6
SQL Engine Developed Years
7
An Evolution Is on The Way
8
Gluten is
9
Gluten Layout
10
Gluten Components
11
Plan Conversion
12
Buffer Passing & Sharing
13
Fallback Processing
14
Gluten's Shuffle
15
Gluten Memory Management
16
Gluten + Velox Performance
17
Gluten + ClickHouse Performance
18
Next Step
19
Take Gluten for a Spin
20
Call to Action
21
Performance Metrics (Velox)
Description:
Explore Gazelle-Jni, a middle layer designed to offload Spark SQL to native engines for execution acceleration, in this 41-minute conference talk from Databricks. Learn how Gazelle-Jni implements a shared JVM and JNI middle layer to better integrate various native SQL engines as Spark SQL's backend. Discover the process of passing Substrait transformed physical plans to native engines for improved performance. Gain insights into integrating native engines with Spark SQL through practical examples. Delve into topics such as basic operator performance, CPU bottlenecks, Spark's scalability, SQL engine development, and the evolution of data processing frameworks. Examine Gluten's layout, components, plan conversion, buffer passing and sharing, fallback processing, shuffle mechanism, and memory management. Compare performance metrics for Gluten with Velox and ClickHouse. Explore future steps, learn how to take Gluten for a spin, and understand the call to action for leveraging this technology in your data processing workflows. Read more

Gazelle-JNI: Offloading Spark SQL to Native Engines for Execution Acceleration

Databricks
Add to list
0:00 / 0:00