Dive into a comprehensive 4.5-hour tutorial on machine learning data pre-processing and data wrangling using Python. Explore essential Python libraries for data science, learn to handle missing values and perform imputation, and master techniques like one-hot encoding for categorical variables. Discover how to split data into training and test sets, apply feature scaling methods, and detect and treat outliers. Gain proficiency in log and square root transformations for normalizing skewed data, and learn to manipulate DataFrames using Pandas, including adding and dropping columns, creating pivot tables, and merging datasets. Master the use of regular expressions for string manipulation and explore advanced Pandas functions like map, apply, and applymap to enhance your data wrangling skills.
Machine Learning Data Pre-processing and Data Wrangling Using Python