Explore the challenges and solutions of managing millions of daily tests for Databricks Runtime in this 25-minute conference talk. Dive into the automated test monitoring and reporting system built using Databricks, learning how to ingest data from various sources like CI systems and Bazel build metadata into Delta. Discover techniques for analyzing test results, reporting failures to owners through Jira, and creating effective quality tracking reports. Gain insights into the deep technical stack, wide surface area, and guiding principles behind Databricks' testing approach. Learn about establishing test results and owners tables, building data pipelines, and implementing developer-friendly failure reporting. Understand how to connect problems with the right owners and use appropriate tools to solve complex testing challenges in large-scale data engineering and machine learning environments.
Managing Millions of Tests Using Databricks - Automated Monitoring and Reporting System