Главная
Study mode:
on
1
Introduction
2
Setting up Apache airflow with Celery Backend and Postgres
3
Reddit Data Pipeline with airflow
4
Cleaning and Transforming Reddit Data
5
Connecting to AWS from Airflow
6
AWS Glue data transformation
7
Querying Data with Athena
8
Setting up Redshift Data Warehouse
9
Redshift Data Warehouse Query Tool
10
Loading Data into Data Warehouse
11
Charting with Redshift Data Warehouse
Description:
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only! Grab it Embark on a comprehensive end-to-end data engineering journey, focusing on building a Reddit data pipeline using AWS services. Learn to extract data from Reddit's API, orchestrate ETL processes with Apache Airflow and Celery, and efficiently store data in Amazon S3. Discover how to leverage AWS Glue for data cataloging and ETL jobs, query and transform data using Amazon Athena, and set up a Redshift cluster for analytics. Gain insights into best practices for loading data into Amazon Redshift and explore data visualization techniques. Through hands-on demonstrations, master the integration of various tools and technologies to create a seamless ETL process, enhancing your skills in data pipeline engineering and AWS cloud services.

Reddit Data Pipeline Engineering with AWS - End-to-End Data Engineering

CodeWithYu
Add to list
0:00 / 0:00