Главная
Study mode:
on
1
Introduction
2
Why this talk
3
Data Science Libraries
4
Task
5
Desk
6
Environment Management
7
Uniform software environments
8
Data science vs IT
9
Resource sharing
10
Access
11
IT Professional
12
Credentials
13
Security
14
Costs
15
Avoid Track Optimize
16
Cost
17
Conclusion
18
Friendly Resource Managers
19
Managed Solutions
20
opinionated solutions
21
Coyle Computing
22
Managed Services
23
Summary
Description:
Explore the challenges and solutions for scaling Python with Dask on distributed hardware in this PyCon US talk. Dive into deployment strategies for Dask on cluster resource managers like Kubernetes, Yarn, and cloud platforms. Learn how the Dask library extends popular Python data science tools to handle 100+TB datasets across multi-core workstations and distributed clusters. Discover approaches to balance load, share resources, control access, and ensure security when deploying Dask within organizations. Examine real-world examples showcasing Dask's positive social impact in large-scale data processing. Gain insights into uniform software environments, resource sharing, credentials management, and cost optimization for IT professionals. Understand the landscape of friendly resource managers, managed solutions, and opinionated approaches for efficient Python deployment at scale.

Deploying Python at Scale with Dask

PyCon US
Add to list