Bi'an: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation Bi’an introduces a bilingual benchmark dataset (Bi’anBench) and lightweight judge models for hallucination detection in Retrieval-Augmented Generation (RAG). The dataset spans
Bi’an: Bilingual Benchmark for RAG Hallucination Detection
By
–
