Главная
Study mode:
on
1
This is a Special Year for Apache Spark
2
2008: Datacenter-scale computing
3
2009: Back to Berkeley
4
2010: Open Source Spark
5
2012-15: Expand Access to Spark
6
Apache Spark Today: Python
7
Apache Spark Today: SOL
8
Major Lessons
9
Apache Spark 3.0
10
Spark 3.0: SOL Engine
11
Spark 3.0: Python Usability Python type hints for Pandas UDFs
12
Spark 3.0: Python and R Performance
13
Spark 3.0: Other Features
14
Other Apache Spark Ecosystem Projects
15
Announcing Koalas 1.0!
16
Learning Spark 2nd Edition
17
OSS Spark Development Initiatives at Databricks
Description:
Explore the evolution and future of Apache Spark in this keynote from Spark + AI Summit 2020 featuring Matei Zaharia, the original creator of Apache Spark, and Brooke Wenig. Delve into the major community developments with the release of Apache Spark 3.0, designed to enhance usability, speed, and compatibility with various data sources and runtime environments. Discover how Spark 3.0 advances the project's goal of making data processing more accessible through improvements to SQL and Python APIs, as well as automatic tuning and optimization features. Reflect on Spark's 10-year journey since its initial open source release, examining the project's growth, user base expansion, and the evolving ecosystem around it, including Koalas, Delta Lake, and visualization tools. Gain insights into the latest developments in the open-source community, including Apache Spark 3.0 and DBR 7.0, and learn about Databricks' unified data analytics platform powered by Apache Spark.

Introducing Apache Spark 3.0 - A Decade of Progress and Future Outlook

Databricks
Add to list
0:00 / 0:00