Главная
Study mode:
on
1
Intro
2
Nearline Applications
3
Heterogeneous Data Systems
4
Building the Right Infrastructure
5
Pluggable Sources & Destinations
6
Capturing Live Updates
7
Change Data Capture (CDC)
8
Streaming Bridge
9
Mirroring Kalka Data
10
Brooklin Kalka Mirroring
11
Application Use Cases
12
Client makes REST call to create datastream
13
Datastream is written to Zookeeper
14
Leader coordinator is notified of new datastream
15
Leader coordinator calculates work distribution
16
Leader coordinator writes the assignments to ZK
17
Zookeeper is used to communicate the assignments
18
Coordinators hand task assignments to consumers
19
Consumers start streaming data from the source
20
Consumers propagate data to producers
21
Producers write data to the destination
22
Destinations can be shared by apps
23
Brooklin Architecture
24
Brooklin in Production
25
Brooklin is now open-source!
Description:
Explore a comprehensive overview of Brooklin, LinkedIn's managed data streaming service, in this 41-minute Strange Loop Conference talk. Learn how Brooklin addresses the challenges of scaling up to handle increasing data volume and supporting the proliferation of new data systems at LinkedIn. Discover the architecture and use cases of this centralized, extensible solution for continuously delivering data to nearline applications. Examine Brooklin's support for multiple pluggable sources and destinations, including data stores and messaging systems. Delve into its applications in change data capture (CDC), data propagation between different systems and environments, and Kafka data mirroring. Gain insights into Brooklin's production implementation, its role in replacing Kafka MirrorMaker at LinkedIn, and future development plans for this open-source project.

Dive into Streams with Brooklin

Strange Loop Conference
Add to list