Главная
Study mode:
on
1
Introduction
2
Speech Dataset
3
Dataset Overview
4
Preparing the Dataset
5
Prerequisites
6
Data Dictionary
7
Loop Free
8
Magic
9
Labels
10
Storage
11
Review
12
Store
13
Outro
Description:
Learn how to pre-process a voice dataset by extracting Mel-frequency cepstral coefficients (MFCCs) and saving them in a JSON file in this 37-minute tutorial video. Explore the Speech Commands Dataset and follow along with the provided code to prepare your audio data for deep learning applications. Gain insights into dataset overview, prerequisites, data dictionary creation, and efficient storage techniques. Perfect for those interested in audio processing and machine learning for speech recognition tasks.

Preparing the Speech Dataset

Valerio Velardo - The Sound of AI
Add to list
0:00 / 0:00