Skip to content

feat: Add RAG evaluation cookbook using Claude as judge#389

Draft
Abanoubr wants to merge 1 commit intoanthropics:mainfrom
Abanoubr:feature/rag-evaluation-cookbook
Draft

feat: Add RAG evaluation cookbook using Claude as judge#389
Abanoubr wants to merge 1 commit intoanthropics:mainfrom
Abanoubr:feature/rag-evaluation-cookbook

Conversation

@Abanoubr
Copy link

Adds a new cookbook notebook demonstrating how to use Claude to evaluate RAG pipeline quality. Covers faithfulness, answer relevancy, and context precision metrics with working code examples and a full evaluation report on a sample dataset.

This notebook provides a framework for evaluating RAG pipelines using Claude, focusing on metrics like faithfulness, answer relevancy, and context precision.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant