HalluHard Benchmark 38.2% GPT-5.2 Realistic Conversations
https://files.fm/u/hdnp8v7h9h
Decoding Conversational AI Hallucination through Dialogue Accuracy Testing Understanding Hallucination Rates in Domain-Specific Models As of April 2025, the AI community remains obsessed with hallucination rates, how often