Главная
Study mode:
on
1
Intro
2
WTF is architecture? Why multiarch?
3
History: 80s, 90s, 00s, 10s, and beyond
4
If it ain't broke...
5
ARM is more efficient.
6
Data storage engine and analytics tool
7
Service Level Objectives (SLO)
8
SLOs are user flows
9
Same reliability, lower costs with ARM6
10
Complexity stayed manageable
11
Prod: customers observe data
12
Kibble observes dogfood
13
Dogfood observes prod
14
Service Architecture
15
Shepherd: ingest API service
16
Is it feasible to migrate?
17
Producing artifacts for Arm64
18
Initial findings
19
A/B testing
20
Dogfood Shepherd cost reduction
21
Migrated prod Shepherd
22
Migrated prod Retriever
23
AWS ran out of m6gd spot instances
24
Kafka + the long tail
25
Graviton2 going strong
26
Have a measurable goal in mind
27
Acknowledge hidden risks
28
Take care of your people
29
Optimize for safety
30
Graviton2 blog posts
Description:
Explore the journey of Honeycomb.io, a Series B startup in the observability space, as they evaluate and implement arm64 processor architecture to optimize cost and performance of their telemetry ingest and indexing workload. Dive into the process of setting up the evaluation, full migration, and improvements made to the ecosystem over a year-long period. Learn how 92% of all compute workloads were successfully migrated to arm64, resulting in a 40% drop in compute costs and modest improvements in end-user visible latency. Discover the roadblocks and challenges faced, including lack of full software compatibility, hidden performance quirks, and additional complexity. Gain insights into the history of processor architectures, the efficiency of ARM, and the importance of Service Level Objectives (SLOs) in user flows. Explore the service architecture, including the Shepherd ingest API service and Retriever, and understand the steps taken to migrate production environments. Examine the impact of AWS instance availability and Kafka on the migration process. Conclude with valuable lessons learned, including setting measurable goals, acknowledging hidden risks, prioritizing team well-being, and optimizing for safety in large-scale migrations. Read more

Optimizing Cost and Performance with Arm64

USENIX
Add to list
0:00 / 0:00