Главная
Study mode:
on
1
Intro
2
Data Challenges in Low-resource MT
3
Multilingual Training Approaches
4
Data Augmentation 101: Back Translation
5
Back Translation Idea
6
How to Generate Translations
7
Iterative Back-translation
8
Back Translation Issues
9
English - HRL Augmentation
10
Augmentation via Pivoting
11
Data w/ Various Types of Pivoting
12
Monolingual Data Copying
13
Dictionary-based Augmentation
14
An Aside: Word Alignment
15
Word-by-word Data Augmentation
16
Word-by-word Augmentation w/ Reordering
Description:
Explore data augmentation techniques for machine translation in this 25-minute lecture from CMU's Multilingual Natural Language Processing course. Delve into methods utilizing monolingual data and high-resource languages, covering topics such as back translation, multilingual training approaches, and pivoting strategies. Learn about iterative back-translation, English-HRL augmentation, and dictionary-based techniques. Gain insights into word alignment and word-by-word data augmentation with reordering. Understand the challenges of low-resource machine translation and discover practical solutions to enhance translation quality in resource-constrained scenarios.

CMU Multilingual NLP 2020 - Data Augmentation for Machine Translation

Graham Neubig
Add to list
0:00 / 0:00