Play all

Intro

The Architecture of the Transformer

Model Training

Transformer LM Component 1: FFNN

Transformer LM Component 2: Self-Attention

Tokenization: Words to Token Ids

Embedding: Breathe meaning into tokens

Projecting the Output: Turning Computation into Language

Final Note: Visualizing Probabilities

Description:

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only! Grab it Explore the Transformer architecture, the foundation of state-of-the-art AI/ML models like BERT and GPT, in this 30-minute visual presentation. Delve into the components of Transformer language models, including feed-forward neural networks and self-attention mechanisms. Learn about tokenization, embedding, and output projection processes. Gain insights into model training and probability visualization. Suitable for viewers with various levels of machine learning experience, this accessible video provides a comprehensive overview of the Transformer model's structure and applications in natural language processing.

The Narrated Transformer Language Model

Jay Alammar

Add to list

#Computer Science #Artificial Intelligence #Natural Language Processing (NLP) #Machine Learning #Embeddings #Model Training #Neural Networks #Feedforward Neural Networks #Deep Learning #Transformer Architecture #Self-Attention Mechanisms

0:00 / 0:00