Главная
Study mode:
on
1
- Intro & Overview
2
- Object Recognition as Parse Trees
3
- Capsule Networks
4
- GLOM Architecture Overview
5
- Top-Down and Bottom-Up communication
6
- Emergence of Islands
7
- Cross-Column Attention Mechanism
8
- My Improvements for the Attention Mechanism
9
- Some Design Decisions
10
- Training GLOM as a Denoising Autoencoder & Contrastive Learning
11
- Coordinate Transformations & Representing Uncertainty
12
- How GLOM handles Video
13
- Conclusion & Comments
Description:
Explore Geoffrey Hinton's GLOM model for computer vision in this comprehensive video explanation. Dive into the innovative approach that combines transformers, neural fields, contrastive learning, capsule networks, denoising autoencoders, and RNNs to dynamically construct parse trees for object recognition. Learn about the multi-step consensus algorithm, cross-column attention mechanism, and how GLOM handles video input. Discover the potential of this new AI approach for visual scene understanding, including discussions on architecture, training methods, and design decisions.

How to Represent Part-Whole Hierarchies in a Neural Network - Geoff Hinton's Paper Explained

Yannic Kilcher
Add to list
0:00 / 0:00