Journal ArticleOpen Access
Hallucination to truth: a review of fact-checking and factuality evaluation in large language models
Authors
Author Affiliations
Artificial Intelligence in Medicine (Canada), United International University, Daffodil International University, Monash University, ...
Published InArtificial Intelligence Review
Year2026
Citations12
Abstract
Abstract Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the…
View at Publisher
BORR does not host full-text PDFs. The button above takes you to the original publisher.