Главная
Study mode:
on
1
Intro: ChatGPT, Language Models and the Goals of Generalist Robotics Policies
2
Reading and exploring the data
3
Creating a Dataset
4
Creating a Dataset
5
Creating the transformer encoder
6
Creating image patches to tokenized
7
Putting together the VIT
8
Training the VIT
9
Making the GRP, starting with adding text inputs
10
Modifying the data for training
11
Converting continuous actions to discrete bins
12
Converting continuous actions to discrete bins
13
Standardizing the state inputs
14
Changing to use continuous actions
15
Standizing the action space
16
Adding goal images to the transformer
17
Adding blocked masked attention to use either goal
18
Scaling training
19
Training results across A100s
20
Evaluation using the SimpleEnv robotics simulator
Description:
Dive into a comprehensive video tutorial on building Generalist Robotics Policies from scratch. Learn how to implement the "Octo: An Open-Source Generalist Robot Policy" model step-by-step, starting with basic transformer code and progressing to training the model using data from the open-x embodiment dataset. Explore topics such as data exploration, dataset creation, transformer encoder implementation, image patch tokenization, and Vision Transformer (ViT) construction. Discover techniques for incorporating text inputs, handling continuous and discrete actions, standardizing state inputs and action spaces, and integrating goal images into the transformer architecture. Gain insights into scaling training processes, analyzing results across A100 GPUs, and evaluating the model using the SimpleEnv robotics simulator. Access accompanying code, project details, and additional resources to enhance your understanding of Generalist Robotics Policies and their applications in the field of robotics. Read more

Building Generalist Robotics Policies from Scratch

Montreal Robotics
Add to list
0:00 / 0:00