Главная
Study mode:
on
1
Intro
2
Data Platform
3
Elastic Self Service Spark
4
Code to Deployment
5
Security
6
Monitoring
7
Orchestration Architecture
8
Varying Workload Pattern
9
One Interface over Multi-Cloud
10
Optimize Kubernetes for Spark Workload
11
Granular Concurrency Check at Orchestration
12
Avoid Partially Running Applications
13
Timeout Partially Running Applications
14
Mitigate Cluster Storage Stress
15
Utilization-based Allocation Recommendation
16
Dynamic Allocation
17
Push-button Cloud Management
18
Scale up Spark on Kubernetes
Description:
Explore lessons learned from launching millions of Spark executors on Kubernetes in this 35-minute conference talk by Databricks. Dive into Apple's approach to supporting enormous Spark workloads for cloud services, covering orchestration techniques across Mesos and Kubernetes, private and on-premise infrastructure. Learn about effective monitoring systems, resource requirement tuning, and execution analysis. Gain insights on optimizing Kubernetes for Spark workloads, implementing granular concurrency checks, mitigating cluster storage stress, and utilizing dynamic allocation. Discover strategies for push-button cloud management and scaling up Spark on Kubernetes to support varying workload patterns across multi-cloud environments.

Apache Spark on Kubernetes - Lessons Learned from Launching Millions of Spark Executors

Databricks
Add to list
0:00 / 0:00