Explore the concept of Literate Statistical Programming (LSP) in this 46-minute conference talk by John Peach, Principal Data Scientist at Oracle, presented at the Toronto Machine Learning Series. Discover how LSP addresses the reproducibility crisis in science and data science by binding analysis code to result interpretation. Learn about the philosophy, tooling, and workflow of LSP, which has been successfully applied to both small-scale datasets in startups and massive-scale problems at Oracle and Amazon Alexa. Gain insights into creating a reproducible, auditable, and flexible data science workflow that allows for rapid iteration and experimentation. Understand the guiding principles of LSP and how to transition from an ad-hoc approach to a more structured and efficient methodology for data science projects.
Literate Statistical Programming for Reproducible Data Science