Play all

intro

preamble

why to automate testing?

how to automate testing?

testing generative models is hard!

string matching

semantic similarity

llm-led evals

closeness between target, actual

using a grading rubric with marvin ai

a couple of other ideas

thank you!

Description:

Explore automated evaluation techniques for RAG chatbots and generative tools in this 14-minute conference talk from Conf42 LLMs 2024. Discover the importance of automating testing for generative models and learn about various approaches, including string matching, semantic similarity, and LLM-led evaluations. Gain insights into using grading rubrics with Marvin AI and explore additional ideas for effective automated testing. Understand the challenges of evaluating generative models and acquire practical strategies to improve your testing processes.

Automated Evaluation for RAG Chatbot or Other Generative Tool - Conf42 LLMs 2024

Conf42

Add to list

#Programming #Software Development #Software Testing #Automated testing #Computer Science #Artificial Intelligence #Chatbot #Generative AI #Machine Learning #Retrieval Augmented Generation (RAG)

0:00 / 0:00