Play all

Intro

Language Models • Language models are generative models of text

Conditioned Language Models

Calculating the Probability of a Sentence

Conditional Language Models

One Type of Language Model Mikolov et al. 2011

How to Pass Hidden State?

The Generation Problem

Ancestral Sampling

Greedy Search

Beam Search

Ensembling . Combine predictions from multiple models

Linear Interpolation • Take a weighted average of the M model probabilities

Log-linear Interpolation • Weighted combination of log probabilities, normalize

Linear or Log Linear?

Parameter Averaging

Ensemble Distillation (e.g. Kim et al. 2016)

Stacking

Still a Difficult Problem!

From Speaker/Document Traits (Hoang et al. 2016)

From Lists of Traits (Kiddon et al. 2016)

From Word Embeddings (Noraset et al. 2017)

Basic Evaluation Paradigm

Human Evaluation Shared Tasks

Embedding-based Metrics

Perplexity

Which One to Use?

Description:

Learn about conditioned generation in neural networks for natural language processing in this lecture from CMU's Neural Networks for NLP course. Explore encoder-decoder models, conditional generation techniques, and search algorithms like beam search. Examine methods for ensembling multiple models, including linear interpolation and parameter averaging. Discover various types of data that can be used to condition language models, from speaker traits to word embeddings. Gain insights into evaluation paradigms for generative models, covering human evaluation, embedding-based metrics, and perplexity. Understand the strengths and limitations of different evaluation approaches for assessing language model performance.

Neural Nets for NLP 2021 - Conditioned Generation

Graham Neubig

Add to list

#Computer Science #Artificial Intelligence #Neural Networks #Natural Language Processing (NLP) #Algorithms #Search Algorithms #Machine Learning #Encoder-Decoder Models

0:00 / 0:00