Play all

Awesome song and introduction

Loading the modules

Creating the training dataset

Coding Position Encoding

Coding Attention

Coding a Decoder-Only Transformer

Running the model untrained

Training and using the model

Description:

Walk through the process of coding a ChatGPT-like Transformer from scratch using PyTorch in this comprehensive 31-minute video tutorial. Learn how to load necessary modules, create a training dataset, implement position encoding, code attention mechanisms, and build a decoder-only Transformer. Observe the model running untrained before diving into the training process and practical application. Gain insights into the step-by-step implementation with clear explanations of every detail, assuming prior knowledge of decoder-only Transformers, essential matrix algebra for neural networks, and matrix math behind Transformers.

Coding a ChatGPT-Like Transformer From Scratch in PyTorch

StatQuest with Josh Starmer

Add to list

#Computer Science #Deep Learning #PyTorch #Machine Learning #Artificial Intelligence #Neural Networks #Natural Language Processing (NLP) #LLM (Large Language Model) #ChatGPT #Transformers #Attention Mechanisms #Model Training

0:00 / 0:00