Better Reliability Through Observability and Experimentation - Julie Gunderson & Kerim Satirli
Description:
Explore the intersection of Site Reliability Engineering (SRE), observability, and experimentation in this 37-minute conference talk from KubeCon + CloudNativeCon. Discover how to treat reliability as an organizational challenge rather than just a software problem. Learn practical approaches to improve service reliability through simulated outages, observability techniques, and analysis. Gain insights into determining workload misbehavior and preparing for service disruptions. Understand how to leverage OpenTelemetry and OpenTracing to enhance system reliability beyond deployments. Join Julie Gunderson from Gremlin and Kerim Satirli from HashiCorp as they guide you through a journey of better reliability practices in cloud-native environments.
Better Reliability Through Observability and Experimentation