Главная
Study mode:
on
1
Intro
2
About BSC
3
TPC-DS Benchmark Work
4
Context and motivation
5
Systems Under Test (SUTs)
6
Hardware configuration
7
Software configuration System Runtime 5.5
8
Benchmark execution time (base)
9
Cost-Based Optimizer (CBO) stats
10
Benchmark execution time (stats)
11
Speedup with table and column stats
12
Additional configuration for Presto
13
TPC-DS Power Test - Query 72
14
Dynamic data partitioning
15
Benchmark exec. time (part + stats)
16
Speedup with partitioning and stats
17
TPC Benchmark total execution time
18
TPC Benchmark DS metric
19
System costs
20
TPC Benchmark DS cost
21
TPC-DS price-performance
22
Usability and developer productivity
23
Conclusions
Description:
Explore an in-depth performance analysis of Apache Spark and Presto in cloud environments through this 37-minute conference talk. Gain valuable insights into the performance and cost considerations of these big data analytics systems running on Amazon EMR, with a special focus on Apache Spark's performance on the Databricks Unified Analytics Platform. Learn about the TPC-DS benchmark results, SQL performance comparisons, and the advantages and disadvantages of each solution. Discover quantitative data and expert analysis to help inform your decision-making process when deploying data analytics at scale, avoiding common pitfalls, and optimizing your cloud-based big data infrastructure.

Performance Analysis of Apache Spark and Presto in Cloud Environments

Databricks
Add to list