Learn Before
Concept

CRUX Controlled RAG Context Evaluation (Ju et al., 2025)

CRUX (Controlled Retrieval-augmented Context Evaluation) is a RAG evaluation framework introduced by Ju et al. (2025) that directly assesses the retrieval-augmented context delivered to the generator, rather than scoring the retrieval module with standard relevance ranking metrics alone. The framework uses human-written summaries to fix the information scope of each query and a question-based scoring procedure: questions derived from the controlled summary are answered against the retrieved context, so the score measures how well the context covers the information actually needed for long-form generation. Because the information scope is held constant across compared systems, CRUX provides a controlled, diagnostic view of retrieval quality and is positioned as a more reflective alternative to relevance-only retrieval metrics for long-form RAG.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

Science

Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls

Related