AI Agent Evaluation with RAGAS

Published: 04 April 2024
on channel: James Briggs

14,493

288

RAGAS (RAG ASsessment) is an evaluation framework for RAG pipelines. Here, we see how to use RAGAS for evaluating an AI agent built using LangChain and using Anthropic's Claude 3, Cohere's embedding models, and the Pinecone vector database.

📌 Code:
https://github.com/pinecone-io/exampl...

📕 Article:
https://www.pinecone.io/learn/series/...

🌲 Subscribe for Latest Articles and Videos:
https://www.pinecone.io/newsletter-si...

👋🏼 AI Consulting:
https://aurelio.ai

👾 Discord:
  / discord

Twitter:   / jamescalam
LinkedIn:   / jamescalam

00:00 RAG Evaluation
00:39 Overview of LangChain RAG Agent
03:04 RAGAS Code Prerequisites
03:40 Agent Output for RAGAS
05:14 RAGAS Evaluation Format
08:04 RAGAS Metrics
08:56 Understanding RAGAS Metrics
09:16 Retrieval Metrics
11:55 RAGAS Context Recall
14:43 RAGAS Context Precision
15:52 Generation Metrics
16:05 RAGAS Faithfulness
17:16 RAGAS Answer Relevancy
18:40 Metrics Driven Development

#ai #artificialintelligence #nlp #chatbot #langchain

Watch video AI Agent Evaluation with RAGAS online without registration, duration hours minute second in high quality. This video was added by user James Briggs 04 April 2024, don't forget to share it with your friends and acquaintances, it has been viewed on our site 14,493 once and liked it 288 people.

1,953