Explore the mathematics behind deep learning in this 47-minute conference talk on nonlinear approximation using deep ReLU networks. Delve into the architecture of neural networks, focusing on ReLU activation functions and their role in approximation theory. Examine the structure of TW.L, compare it with other approaches, and analyze approximation errors and classes. Investigate more general constructions and their consequences, including extremes and manifold approximation. Learn about three key theorems and covering techniques. Gain insights into cutting-edge advances in data science, bridging gaps between computational statistics, machine learning, optimization, information theory, and learning theory.
Nonlinear Approximation by Deep ReLU Networks - Ron DeVore, Texas A&M