Главная
Study mode:
on
1
Intro - convergence of transformers and CNNs
2
Main diagram explained
3
Main diagram corrections
4
Swin transformer recap
5
Modernizing ResNets
6
Diving deeper: stage ratio
7
Diving deeper: misc inverted bottleneck, depthwise conv...
8
Results classification, object detection, segmentation
9
RIP DanNet
10
Summary and outro
Description:
Explore a comprehensive analysis of the "A ConvNet for the 2020s" paper in this 40-minute video lecture. Delve into the convergence of transformers and CNNs, understand the main diagram and its corrections, and recap the Swin transformer. Learn about modernizing ResNets, dive deeper into stage ratios and miscellaneous topics like inverted bottlenecks and depthwise convolutions. Examine the results in classification, object detection, and segmentation tasks. Gain insights into how ConvNets outperform vision transformers in big data regimes without attention layers, demonstrating the enduring relevance of convolutional priors in computer vision.

ConvNeXt- A ConvNet for the 2020s - Paper Explained

Aleksa Gordić - The AI Epiphany
Add to list
0:00 / 0:00