Главная
Study mode:
on
1
- Intro and overview of XLSTM
2
- Problems with LSTMs
3
- Recurrent Neural Networks RNNs
4
- LSTMs overview
5
- Drawbacks of LSTMs
6
- Sigmoid vs Exponential Function
7
- sLSTM block
8
- Normalizer in sLSTM
9
- Stabilizer in sLSTM
10
- mLSTM block
11
- Detailed block of sLSTM
12
- Detailed block of mLSTM
13
- XLSTM
14
- Advantages of XLSTM
15
- Evaluation
Description:
Explore a 14-minute technical video that delves into the evolution and enhancement of Long Short-Term Memory Networks (LSTMs) through the introduction of XLSTM (Extended LSTM). Learn about the historical limitations of traditional LSTMs in parallel processing and GPU utilization, and discover how the new XLSTM architecture addresses these constraints through its two main components: sLSTM and mLSTM. Understand the mathematical foundations, including detailed equations, as the video progresses from basic concepts of Recurrent Neural Networks to advanced implementations. Compare the performance of this parallel-capable LSTM variant with modern transformers, examining the technical specifications of both the normalizer and stabilizer in sLSTM, and the comprehensive structure of mLSTM blocks. Gain insights into the practical advantages and evaluation metrics of XLSTM, making it relevant for professionals working with sequence-related tasks such as text generation and translation.

XLSTM: Understanding Extended LSTMs with sLSTM and mLSTM Architecture

AI Bites
Add to list