Главная
Study mode:
on
1
Start
2
Lecture starts
Description:
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only! Grab it Explore a detailed guest lecture that demystifies the complex process of data curation for pretrained language models, delivered by expert Kylo Lo at the University of Utah Data Science department. Gain valuable insights into the methodologies and best practices of preparing and organizing data sets specifically designed for training large language models. Learn about the critical considerations, challenges, and solutions in data curation that directly impact model performance and reliability. Discover practical approaches to data selection, cleaning, and preprocessing through this comprehensive 47-minute presentation that begins with a brief introduction before diving into the core technical content.

Demystifying Data Curation for Pretrained Language Models

UofU Data Science
Add to list