Главная
Study mode:
on
1
Intro
2
Network Telemetry is Critical for Network Managem
3
Key Challenge for Telemetry in Production: Evolvability Network devices and management applications are constantly evolving
4
Magnitude of Changes
5
Incident 1: Changes Affect Many Components
6
Incident 2: Data Misinterpretation
7
Bringing Changes to First-Class Citizens in Telemetry
8
PCAT: Production Change-Aware Telemetry System
9
Change Abstraction: Change Cube
10
Change Attribution: Three Generations of Telemetry
11
and Gen2
12
's Problems
13
PCAT) Layering Design
14
Change Exploration: Topology Derivation Creates derived topology from normalized device-level data.
15
Open Questions: Adaptive Telemetry Primitives
16
Open Questions: Trustful Telemetry
17
Summary
Description:
Explore a conference talk on PCAT, a production change-aware telemetry system designed to handle changes in fast-evolving networks at Facebook. Delve into the challenges of network telemetry in large-scale production environments, focusing on the impact of constant changes in data collection, processing, interpretation, and consumption by applications. Learn about the innovative change cube abstraction for systematically tracking changes and the intent-based layering design for confining and monitoring modifications. Gain insights into real-world incidents demonstrating the magnitude of changes and their effects on multiple components. Discover how PCAT addresses these challenges, including its approach to change attribution and topology derivation. Consider open questions in adaptive telemetry primitives and trustful telemetry as you explore the future of network monitoring in rapidly evolving environments.

Evolvable Network Telemetry at Facebook

USENIX
Add to list