Главная
Study mode:
on
1
Intro
2
If data is fuel, then we need to measure its value
3
Data value in the context of ML
4
Ingredients of Data Value in ML
5
Leave One Out Method
6
Desirable properties
7
Data Shapley Value
8
Applications of Data Shapley
9
UK Biobank Lung Cancer prediction
10
Removing low value data improves prediction
11
Adding high value data improves prediction
12
Negative Shapley identifies mislabeled data
13
Domain adaptation: face recognition
14
Dermatology classification
15
Clinical notes NLP
16
Efficiently approximating data Shapley
17
New frontiers of data valuation
18
Discussion
Description:
Explore the concept of equitable data valuation in machine learning through this 44-minute lecture by James Zou from Stanford University. Delve into the importance of measuring data value, focusing on its role in machine learning contexts. Learn about the Leave One Out Method and Data Shapley Value, understanding their desirable properties and applications. Examine real-world case studies, including UK Biobank lung cancer prediction, face recognition domain adaptation, dermatology classification, and clinical notes NLP. Discover how removing low-value data and adding high-value data impacts prediction accuracy, and how negative Shapley values can identify mislabeled data. Gain insights into efficient approximation methods for data Shapley and explore new frontiers in data valuation.

What is Your Data Worth? Equitable Data Valuation in Machine Learning

Simons Institute
Add to list
0:00 / 0:00