Play all

Intro

What might generalization theory offer deep learning?

Barriers to explaining generalization

PAC-Bayes yields risk bounds for Gibbs classifiers

PAC-Bayes generalization bounds

PAC-Bayes bounds on deterministic classifiers

Distribution-dependent approximations of optimal priors via privacy

A question of interpretation

Use SGD to predict SGD

Data and distribution priors for neural networks

MNIST Results - Coupled data dependent priors and posteriors

Oracle access to optimal prior covariance

Bounds with oracle covariance + ghost sample

Bounds on 32k samples v 64k samples

Recap and Conclusion

Description:

Explore the intersection of generalization theory and deep learning in this 45-minute lecture from the Frontiers of Deep Learning series. Delve into PAC-Bayes theory and its applications to risk bounds for Gibbs classifiers and deterministic classifiers. Examine distribution-dependent approximations of optimal priors, the role of privacy, and the use of SGD to predict SGD. Investigate data and distribution priors for neural networks, focusing on MNIST results with coupled data-dependent priors and posteriors. Analyze bounds with oracle covariance and ghost samples, comparing results across different sample sizes. Gain insights into the potential contributions of generalization theory to deep learning and the challenges in explaining generalization.

Studying Generalization in Deep Learning via PAC-Bayes

Simons Institute

Add to list

#Computer Science #Deep Learning #Artificial Intelligence #Neural Networks #Information Security (InfoSec) #Cybersecurity #Data Privacy #Machine Learning #MNIST Dataset #Stochastic Gradient Descent