Главная
Study mode:
on
1
Introduction
2
About Koichi
3
Why Raspberry Pi
4
Demos
5
Hardware
6
Benchmarks
7
Software Tools
8
Intel Software
9
Video 4 Architecture
10
Algorithm Selection
11
HWC Layout
12
Conversion Corners
13
Remove IO Control Overhead
14
CPU bottleneck
15
Activecast
16
Conclusion
Description:
Explore the acceleration of deep learning inference on Raspberry Pi's VideoCore GPU in this 26-minute conference talk from tinyML Asia 2020. Discover how Koichi NAKAMURA from Idein Inc. leverages the underutilized VideoCore IV/VI GPU to achieve significant speedups in machine learning models without compromising accuracy. Learn about the development of specialized device programming tools, libraries, math kernels, and an optimizing graph compiler for VideoCore GPGPU usage. Gain insights into the architectural features of VideoCore, acceleration techniques, and additional research on ARM CPUs, Intel GPUs, and FPGAs. Delve into topics such as hardware benchmarks, software tools, Video 4 Architecture, algorithm selection, HWC layout, conversion corners, and strategies to remove I/O control overhead and address CPU bottlenecks.

Acceleration of Deep Learning Inference on Raspberry Pi's VideoCore GPU

tinyML
Add to list