Главная
Study mode:
on
1
Intro
2
Data Quality
3
ETL Process
4
Quality Checks
5
Data Quality Approaches
6
Data Quality Tools
7
Deku
8
Code Generation
9
Great Expectations
10
Pandas Profiling
11
Apache Griffin
12
Apache Griffin Limitations
13
Examples
14
Uniqueness checks
15
Advanced checks
16
Timely data
17
Other frameworks
Description:
Explore open-source solutions for ensuring data quality in continuous import scenarios in this 28-minute presentation from Databricks. Compare popular options like Apache Griffin, Deequ, DDQ, and Great Expectations across dimensions such as maturity, documentation, extensibility, and features including data profiling and anomaly detection. Learn about various data quality approaches, tools, and frameworks, including ETL processes, quality checks, code generation, and advanced uniqueness checks. Gain insights into the limitations of Apache Griffin and discover how to implement timely data quality assurance in your organization's data pipeline.

Data Quality Tools Comparison for Continuous Data Imports

Databricks
Add to list
0:00 / 0:00