Главная
Study mode:
on
1
Intro
2
Data-Driven Decisions
3
Data Discovery Not Productive
4
What is Amundsen
5
Dataset detail page
6
Lineage between dashboards and dataset
7
Search for existing dashboards/reports
8
Dashboard detail page
9
Search for co-workers
10
Central data quality issue portal
11
Data Preview
12
Databricks Lakehouse
13
Deployment detailed
14
Development
15
Metadata surfaced in amunden
16
Lineage information
17
What is table lineage
18
How is the lineage table generated?
19
Statistics information
20
Delta table extended metadata
21
Notebook structure
22
Redash dashboards
23
Sample data
24
Amundsen Open Source
Description:
Explore how Databricks leverages Amundsen, an open-source data discovery tool, to enhance productivity and trust in internal data exploration. Learn about the integration of Amundsen with Databricks' infrastructure to surface metadata, including popular tables, fuzzy and facet search capabilities, and rich dataset information such as lineage, ownership, and usage statistics. Discover how the tool provides insights on ETL jobs, column statistics, and associated dashboards. Gain knowledge about the implementation of user feedback and plans to extend these discovery improvements to Databricks customers. Delve into the deployment details, development process, and specific metadata surfaced in Amundsen, including table lineage generation, Delta table extended metadata, and Redash dashboard integration. Understand the benefits of this data discovery solution compared to the previous static wiki approach and its potential impact on data-driven decision-making within the organization.

Data Discovery at Databricks with Amundsen - Improving Productivity and Trust

Databricks
Add to list
0:00 / 0:00