Главная
Study mode:
on
1
Introduction
2
Challenges with Big Data
3
Common Big Data Queries
4
Difficulty
5
Parallelization
6
Last 30 Days
7
The Sketch
8
Properties
9
Major Properties
10
Query Space
11
Partitioning
12
Query Speed
13
Time Windowing
14
Example
15
Lower System Cost
16
Team
17
Mission
18
Family Groups
19
The Future
Description:
Explore the world of sketching algorithms for big data analysis in this 29-minute talk from Databricks. Dive into the challenges of processing massive datasets and learn how specialized algorithms called 'sketches' can provide accurate approximate answers to problem queries. Discover how this technology has helped Yahoo reduce data processing times from days to minutes and enabled subsecond queries on real-time platforms. Get an introduction to DataSketches, an open-source library of core sketching algorithms designed for large production analysis and AI systems. Understand the properties of sketches, including query space partitioning, speed, and time windowing. Learn about the benefits of sketching, such as lower system costs and improved scalability. Gain insights into the future of sketching algorithms and their potential impact on big data analysis.

DataSketches: A Production Quality Sketching Library for Big Data Analysis

Databricks
Add to list