Back to Search
Journal ArticleOpen Access

Hallucination to truth: a review of fact-checking and factuality evaluation in large language models

Author Affiliations
Artificial Intelligence in Medicine (Canada), United International University, Daffodil International University, Monash University, ...
Published InArtificial Intelligence Review
Year2026
Citations12

Abstract

Abstract Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the…
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.