Learn essential data cleansing and preparation techniques using SQL Server and R in this comprehensive conference talk. Explore the concept of tidy data and discover how to simplify research and analysis of a small but realistic data set. Dive into various aspects of dirty data, including consistency, incompleteness, accuracy, and duplicate results. Gain insights into normalization, Boyce Codd normal form, data types, and key constraints. Follow along with practical demonstrations on mapping tables, notebooks, and entity-attribute values. Acquire valuable skills to efficiently handle the time-consuming task of data preparation, which often consumes up to 80% of a data scientist's project time.