Explore data-driven strategies for Neural Machine Translation in this 41-minute lecture by Graham Neubig. Delve into various data augmentation techniques, including back translation, meta-back translation, and pivoting for high-resource languages. Learn about machine translation evaluation methods, such as BLEU scores, BERT Score, and COMET. Examine the challenges of high and low-resource languages, and discover approaches like transfer learning and dictionary-based augmentation. Gain insights into word alignment, word-by-word data augmentation, and reordering techniques to enhance translation quality.
CMU Multilingual NLP 2022 - Data-Driven Strategies for NMT