2606.12900v1 Jun 11, 2026 cs.AI

Zero-source LLM Hallucination Detection with Human-like Criteria Probing

Mingkui Tan
Mingkui Tan
Citations: 85
h-index: 5
Jiahao Yang
Jiahao Yang
Citations: 80
h-index: 3
Shuhai Zhang
Shuhai Zhang
Citations: 296
h-index: 10
Hailong Kang
Hailong Kang
Citations: 14
h-index: 2
Feng Liu
Feng Liu
Citations: 1
h-index: 1
Qi Chen
Qi Chen
Citations: 246
h-index: 7

Large language models (LLMs) often hallucinate by generating factually incorrect or unfaithful content, posing significant risks to their safe use. Detecting such hallucinations is particularly challenging under the zero-source constraint, where no model internals or external references are available, and detection must rely solely on the textual query-answer pair. In this paper, we propose Human-like Criteria Probing for Hallucination Detection (HCPD), a paradigm that emulates the multi-faceted reasoning of human evaluators. Its core is a Human-like Criteria Probing (HCP) mechanism, in which a LLM agent adaptively decomposes its judgment into a weighted set of interpretable criteria and aggregates criterion-specific scores into a final truthfulness measure. To achieve this adaptive capability, we introduce a reward-based alignment scheme using only weak supervision from semantic consistency. At inference, we employ a multi-sampling aggregation strategy to ensure robust decisions while preserving full interpretability. We further provide theoretical analysis supporting the reliability of our approach. Extensive experiments show that HCPD consistently outperforms state-of-the-art baselines, offering an effective and explainable solution for zero-source hallucination detection. Code is available at https://github.com/TRISKEL10N/HCPD.

0 Citations
0 Influential
25 Altmetric
125.0 Score
Original PDF
0

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!