Главная
Study mode:
on
1
Intro
2
The Compass Model
3
Babylon Health
4
AI Doctor
5
News Removed
6
Google Translate
7
Adverse adversarial attacks
8
The numbers
9
Costing money
10
Why this happens
11
Translation
12
Noise
13
Biases
14
Visualization
15
Data Availability
16
Data Consistency
17
Data Leakage
18
Titanic Dataset
19
How to Fix Missing Data
20
Feature Noise
21
Noise Reduction
22
Anomalies
23
Wine Quality
Description:
Explore the critical impact of data quality on machine learning projects in this 44-minute conference talk from GOTO Chicago 2019. Discover why bad data is costing the US economy trillions and learn practical techniques to identify and fix data issues. Delve into high-profile case studies, including Microsoft's Tay.ai chatbot, to understand the consequences of poor data management. Gain insights into the Compass Model, AI in healthcare, and challenges in Google Translate. Examine common data problems such as noise, biases, and leakage, and master strategies for data visualization, consistency, and anomaly detection. Apply these concepts to real-world examples like the Titanic dataset and wine quality analysis. Equip yourself with essential skills to improve data quality and boost the success of your machine learning projects.

Keep it Clean - Why Bad Data Ruins Projects and How to Fix It

GOTO Conferences
Add to list
0:00 / 0:00