Главная
Study mode:
on
1
Intro
2
SOL Use Cases @ Facebook
3
Towards an Unified SOL Experience
4
Presto and Spark Architecture
5
Why Presto (or Other MPPs) Doesn't Scale?
6
Presto Unlimited
7
Why Presto-on-Spark
8
Presto-on-Spark Design Principles
9
Planning
10
Translating to RDD
11
Columnar Format to Row Format Conversion
12
Broadcast Join
13
Spark DAG
14
Execution
15
Threading Model
16
Classloader Isolation
17
Current Status
Description:
Explore the architectural tradeoffs between map/reduce and parallel databases in this 25-minute conference talk from Databricks. Dive deep into the architectures of Presto and Apache Spark, focusing on key differentiators like disaggregated shuffle. Learn about the Presto-on-Spark project, a specialized Data Frame application that combines Presto's low-latency evaluation with Spark's robust execution engine. Discover the motivation, design, and current status of this initiative aimed at enabling a unified SQL experience for both interactive and batch use cases. Gain insights into Facebook's experience scaling both Presto and Spark for large-scale batch workloads, and understand the potential for greater collaboration between the Spark and Presto communities.

Presto on Apache Spark - A Tale of Two Computation Engines

Databricks
Add to list