Главная
Study mode:
on
1
- Intro & Background Info
2
- What we're creating in this video!
3
- Start writing code generating a simple dataframe & csv
4
- Task: Making our data more realistic, selecting some products with higher probability than others
5
- Task: Generate 12 months worth of data in 12 csvs calendar library, f-strings
6
- Make some months have more purchases than others
7
- Normal distributions in NumPy
8
- Improving speed of our code making testing easier
9
- Task: Generate random addresses for our data
10
- Task: Generate order times for purchases datetime library overview
11
- Using timedelta objects to add & subtract time from dates
12
- Generate a realistic quantity ordered for each product using numpy geometric distribution
13
- Add multiple items being more likely to be sold together and cleaning code a bit
Description:
Learn how to generate realistic mock data for sales analysis using Python in this comprehensive tutorial video. Explore the power of NumPy, Pandas, and Datetime libraries to create a sales dataset from scratch. Master techniques for simulating product purchases, implementing normal and geometric distributions, generating realistic timestamps, and creating random addresses. Discover how to improve code efficiency, add multiple item correlations, and produce 12 months of data across separate CSV files. Gain practical skills in data manipulation and synthetic dataset creation for data science projects.

Generating Mock Data with Python - NumPy, Pandas, and Datetime Libraries

Keith Galli
Add to list
0:00 / 0:00