Главная
Study mode:
on
1
Intro
2
Action Recognition
3
Temporal Action Localization
4
Outline
5
Knowledge Transfer to Novel Categories
6
Comparison with Semantic Attributes (THUMOS)
7
Experiment Results on UCF101
8
Pipeline Overview
9
Detecting Actions
10
Experimental Setup
11
Detection examples
12
Generalizing Faster R-CNN from 2D to 3D
13
Tube Proposal Network
14
Tube of Interest Max Pooling
15
Experiment results on UCF-Sports
16
Evaluation on YouTube Videos
17
Limitations
18
Video Action Segmentation -- Overview
19
Video Object Segmentation -- Overview
20
Video Object Segmentation -- Encoder
21
Video Object Segmentation - 3D Pyramid Pooling
22
Video Object Segmentation -- Decoder
23
Dilated Convolution
24
Summary
25
Future Work
Description:
Explore action recognition, temporal localization, and detection in trimmed and untrimmed videos through this 36-minute lecture by Rui Hou from the University of Central Florida. Dive into topics such as knowledge transfer to novel categories, comparison with semantic attributes, and experimental results on datasets like UCF101 and UCF-Sports. Learn about the pipeline for detecting actions, including the adaptation of Faster R-CNN from 2D to 3D, Tube Proposal Network, and Tube of Interest Max Pooling. Examine video action and object segmentation techniques, including encoder-decoder architectures, 3D Pyramid Pooling, and dilated convolution. Gain insights into the current limitations and future directions of research in this field.

Action Recognition, Temporal Localization and Detection in Videos

University of Central Florida
Add to list
0:00 / 0:00