Главная
Study mode:
on
1
Intro
2
About Kevin
3
What is dirty data
4
General philosophy
5
Data quality services
6
Bonus
7
Dirty Data
8
Data Consistency
9
Incomplete Data
10
Data Accuracy
11
Duplicate Results
12
Rules Of Thumb
13
Normalization
14
Boyce Codd
15
Data Types
16
Key Constraints
17
Demo
18
Mapping Tables
19
Notebooks
20
Entity Attribute Values
21
Analysis
Description:
Learn essential data cleansing and preparation techniques using SQL Server and R in this comprehensive conference talk. Explore the concept of tidy data and discover how to simplify research and analysis of a small but realistic data set. Dive into various aspects of dirty data, including consistency, incompleteness, accuracy, and duplicate results. Gain insights into normalization, Boyce Codd normal form, data types, and key constraints. Follow along with practical demonstrations on mapping tables, notebooks, and entity-attribute values. Acquire valuable skills to efficiently handle the time-consuming task of data preparation, which often consumes up to 80% of a data scientist's project time.

Data Cleansing With SQL And R

NDC Conferences
Add to list