Главная
Study mode:
on
1
Introduction
2
Netflixs Microservices
3
Time Series Database
4
Alerts Only for Zul
5
Dynamic Thresholds
6
Adaptive Thresholds
7
Anomaly Detector Raju
8
Results
9
No Machine Learning
10
How We Built It
11
Impact Graph
12
Context
13
Accuracy
14
Operational Burden
15
Realtime alerting
16
Realtime events
17
Mantis
18
How it works
19
Querying
20
Stream Processing
21
Aggregate
22
Job Chain
23
Requirements
24
Median estimation
25
Mad
26
Raju
27
Simple
28
Recovery Detection
29
Recovery Algorithm
30
What Raju Looks Like
31
Permutations
32
Data Visualization
33
Impact Assessment
34
Timeline of Events
35
What is API
36
Another needle in a haystack
37
Example
38
Gold Standard KPI
39
Spinnaker Events
40
Emailing the culprits
41
Benefits
42
Conclusion
Description:
Explore a comprehensive talk from the Strange Loop Conference on building a scalable anomaly detection system without using machine learning. Dive into Netflix's approach to detecting and pinpointing failures in their complex cloud architecture, composed of thousands of services and hundreds of thousands of VMs and containers. Learn how Zuul, Netflix's front-door for all cloud traffic, is leveraged to stream real-time events and identify broken paths in their microservices maze. Discover the innovative use of stream processing, anomaly detection algorithms, and a rules engine to create an efficient system capable of handling millions of requests across thousands of nodes. Gain insights into the benefits of using "old-fashioned math" over machine learning in certain scenarios, and understand the implementation of dynamic and adaptive thresholds. Examine the anomaly detection algorithm in-depth, including median estimation, MAD, and recovery detection. Explore the impact assessment process, data visualization techniques, and the use of Spinnaker events for more accurate problem identification. Understand how this system provides real-time alerting, reduces operational burden, and improves accuracy in detecting service issues within Netflix's complex microservices architecture. Read more

Scalable Anomaly Detection - With Zero Machine Learning

Strange Loop Conference
Add to list
0:00 / 0:00