Play all

Intro

Data-Driven Decisions

Data Discovery Not Productive

What is Amundsen

Dataset detail page

Lineage between dashboards and dataset

Search for existing dashboards/reports

Dashboard detail page

Search for co-workers

Central data quality issue portal

Data Preview

Databricks Lakehouse

Deployment detailed

Development

Metadata surfaced in amunden

Lineage information

What is table lineage

How is the lineage table generated?

Statistics information

Delta table extended metadata

Notebook structure

Redash dashboards

Sample data

Amundsen Open Source

Description:

Explore how Databricks leverages Amundsen, an open-source data discovery tool, to enhance productivity and trust in internal data exploration. Learn about the integration of Amundsen with Databricks' infrastructure to surface metadata, including popular tables, fuzzy and facet search capabilities, and rich dataset information such as lineage, ownership, and usage statistics. Discover how the tool provides insights on ETL jobs, column statistics, and associated dashboards. Gain knowledge about the implementation of user feedback and plans to extend these discovery improvements to Databricks customers. Delve into the deployment details, development process, and specific metadata surfaced in Amundsen, including table lineage generation, Delta table extended metadata, and Redash dashboard integration. Understand the benefits of this data discovery solution compared to the previous static wiki approach and its potential impact on data-driven decision-making within the organization.

Data Discovery at Databricks with Amundsen - Improving Productivity and Trust

Databricks

Add to list

#Computer Science #Information Technology #IT Governance #Data Governance #Data Science #Big Data #Databricks #Data Management #Data Lineage

0:00 / 0:00