Главная
Study mode:
on
1
Intro
2
Vail
3
Agenda
4
Vision and Language
5
Deep Understanding
6
Image Classification
7
Labeled Data
8
Semantic Segmentation
9
Classification Problems
10
Standard Metrics
11
Urban Data
12
Object Detection
13
Universe En
14
Jonathan
15
Demo
16
Results
Description:
Explore cutting-edge research in visual scene understanding and grounded language comprehension in this 1-hour 22-minute talk by Kevin Murphy from Google Research. Delve into topics such as semantic segmentation, object detection, instance segmentation, and person detection/pose estimation, including award-winning systems like DeepLab and entries in the COCO'16 competition. Discover work on visually grounded referring expressions, discriminative image captioning, and generative models of visual imagination. Learn how these components can be integrated to create systems that better comprehend images and words, advancing the field of AI and machine learning. Gain insights from Murphy's extensive experience in computer science, statistics, and machine learning, spanning academia and industry.

Towards Machines that Perceive and Communicate

MITCBMM
Add to list
00:00
-04:02