Главная
Study mode:
on
1
Intro
2
Similarity searches
3
Redis
4
Design Patterns
5
Context Retrieval
6
Hide
7
Feature Injection
8
Query Get
9
Guard Rails
10
Longterm Memory
11
Common Mistakes
12
Laziness
13
Index Management
14
Metadata
Description:
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only! Grab it Discover practical advice for using vector databases in production environments during this 30-minute conference talk from the LLMs in Prod Conference. Explore various use cases for vector databases with large language models, including information retrieval, conversational memory for chatbots, and semantic caching. Delve into the less flashy but crucial aspects of implementing these technologies, such as prompt engineering, text chunking, compliance considerations, on-premise solutions, embedding model changes, index types, A/B testing, cloud platform selection, deployment strategies, feature injection, and tool comparisons. Gain insights from a year's worth of Redis deployments for AI use cases, condensed into a comprehensive overview. Learn about similarity searches, design patterns, context retrieval, feature injection, query optimization, guard rails, long-term memory, common mistakes, and index management. Benefit from the expertise of Sam Partee, a Principal Engineer guiding AI efforts at Redis, as he shares valuable knowledge on integrating vector databases into ML pipelines for feature storage, search, and inference workloads. Read more

Using Vector Databases: Practical Advice for Production - LLMs in Prod Conference

MLOps.community
Add to list
0:00 / 0:00