Journal ArticleOpen Access

Hallucination to truth: a review of fact-checking and factuality evaluation in large language models

Authors

S M Asif Ur Rahman, Md. Adnanul Islam, Md. Mahbub Alam, Musarrat Zeba, …

Author Affiliations

Artificial Intelligence in Medicine (Canada), United International University, Daffodil International University, Monash University, ...

Published InArtificial Intelligence Review

Year2026

Citations12

DOI10.1007/s10462-025-11454-w

Abstract

Abstract Large language models (LLMs) are trained on vast and diverse internet corpora that often include inaccurate or misleading content. Consequently, LLMs can generate misinformation, making robust fact-checking essential. This review systematically analyzes how LLM-generated content is evaluated for factual accuracy by exploring key challenges such as hallucinations, dataset limitations, and the reliability of evaluation metrics. The review emphasizes the need for strong fact-checking frameworks that integrate advanced prompting strategies, domain-specific fine-tuning, and retrieval-augmented generation (RAG) methods. It proposes five research questions that guide the analysis of the recent literature from 2020 to 2025, focusing on evaluation methods and mitigation techniques. Instruction tuning, multi-agent reasoning, and RAG frameworks for external knowledge access are also reviewed. The key findings demonstrate the…

View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.

Fields & Keywords

Social Sciences Sociology and Political Science Misinformation and Its Impacts Topic Modeling Computational and Text Analysis Methods Data science Artificial intelligence Natural language processing