Главная
Study mode:
on
1
Introduction
2
What are we doing
3
Why are we doing this
4
Scaling
5
User perspective
6
Preparing the environment
7
Downloading the data
8
Code
9
Rust bindings
10
Quality
11
Development
12
Preparation
13
Getting Data
14
Contributing
15
Open Data
16
Import CV
17
Loss
18
Propagation
19
Overfitting
20
Acoustic vs Language
21
Terminology
22
German
23
Corpus
24
CSV
25
Language models
26
Conversion
27
Run
28
German Training
29
Next Steps
30
Contact Us
Description:
Explore Mozilla's DeepSpeech and Common Voice projects in this 27-minute conference talk presented by Tilman Kamp at FOSDEM 2018. Dive into the world of open and offline-capable voice recognition technology, learning about the motivations behind these initiatives and their potential impact. Gain insights into scaling challenges, user perspectives, and the technical aspects of preparing the environment and working with data. Discover the development process, including data preparation, contribution methods, and the importance of open data. Examine specific topics such as loss propagation, overfitting, and the differences between acoustic and language models. Understand the application of these technologies to different languages, with a focus on German corpus development and training. Conclude with next steps and ways to get involved in these groundbreaking projects.

Mozilla's DeepSpeech and Common Voice Projects - Open and Offline-Capable Voice Recognition

Mozilla Hacks
Add to list