2311.05232 Nov 09, 2023 cs.AI

대규모 언어 모델의 환각 현상에 대한 조사: 원리, 분류 체계, 도전 과제 및 미해결 문제

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Lei Huang

Citations: 3,246

h-index: 10

Weijiang Yu

Citations: 3,526

h-index: 9

Weitao Ma

Citations: 3,228

h-index: 10

Weihong Zhong

Citations: 3,258

h-index: 10

Zhangyin Feng

Citations: 8,975

h-index: 12

Haotian Wang

Citations: 3,514

h-index: 8

Qianglong Chen

Zhejiang University

Citations: 3,654

h-index: 14

Weihua Peng

Citations: 3,402

h-index: 5

Xiaocheng Feng

Citations: 10,669

h-index: 31

Bing Qin

Citations: 3,455

h-index: 12

Ting Liu

Citations: 3,478

h-index: 7

대규모 언어 모델(LLM)의 등장은 자연어 처리(NLP) 분야에서 중대한 돌파구가 되었으며, 정보 습득의 패러다임 전환을 가속화했다. 그럼에도 불구하고, LLM은 그럴듯해 보이지만 사실이 아닌 내용을 생성하는 환각(hallucination) 현상을 일으키기 쉽다. 이러한 현상은 실제 정보 검색(IR) 시스템에서 LLM의 신뢰성에 대한 중대한 우려를 낳고 있으며, 이러한 환각을 탐지하고 완화하기 위한 집중적인 연구를 촉발했다. LLM에 내재된 개방형 범용 속성을 고려할 때, LLM의 환각은 기존의 과제 특화 모델과는 다른 독특한 도전 과제를 제시한다. 이러한 차이점은 LLM 환각에 대한 최근의 발전 사항들을 심도 있게 이해하고 포괄적으로 개관하는 것이 시급함을 시사한다. 본 조사에서는 LLM 시대의 환각에 대한 혁신적인 분류 체계로 시작하여, 환각을 유발하는 요인들을 심층적으로 다룬다. 이어서 환각 탐지 방법론과 벤치마크에 대한 철저한 개요를 제시한다. 그 후 LLM 환각을 완화하기 위한 대표적인 방법론으로 논의를 확장한다. 또한 환각에 대응하는 데 있어 검색 증강 LLM이 직면한 현재의 한계를 살펴보고, 더욱 견고한 IR 시스템을 개발하기 위한 통찰력을 제공한다. 마지막으로, 대규모 비전-언어 모델에서의 환각 및 LLM 환각 내 지식 경계에 대한 이해를 포함하여, LLM 환각에 관한 유망한 연구 방향을 강조한다.

Original Abstract

The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), fueling a paradigm shift in information acquisition. Nevertheless, LLMs are prone to hallucination, generating plausible yet nonfactual content. This phenomenon raises significant concerns over the reliability of LLMs in real-world information retrieval (IR) systems and has attracted intensive research to detect and mitigate such hallucinations. Given the open-ended general-purpose attributes inherent to LLMs, LLM hallucinations present distinct challenges that diverge from prior task-specific models. This divergence highlights the urgency for a nuanced understanding and comprehensive overview of recent advances in LLM hallucinations. In this survey, we begin with an innovative taxonomy of hallucination in the era of LLM and then delve into the factors contributing to hallucinations. Subsequently, we present a thorough overview of hallucination detection methods and benchmarks. Our discussion then transfers to representative methodologies for mitigating LLM hallucinations. Additionally, we delve into the current limitations faced by retrieval-augmented LLMs in combating hallucinations, offering insights for developing more robust IR systems. Finally, we highlight the promising research directions on LLM hallucinations, including hallucination in large vision-language models and understanding of knowledge boundaries in LLM hallucinations.

3075 Citations

100 Influential

15.5 Altmetric

3,352.5 Score

Original PDF

AI Analysis

Korean Summary

이 논문은 대형 언어 모델(LLM)에서 발생하는 환각(Hallucination) 현상에 대한 포괄적인 조사 연구입니다. 저자들은 기존의 자연어 생성(NLG) 태스크와 구별되는 LLM 시대의 환각을 '사실성 환각(Factuality Hallucination)'과 '충실성 환각(Faithfulness Hallucination)'으로 새롭게 분류하는 분류 체계를 제안합니다. 논문은 환각의 원인을 데이터, 학습(사전 학습 및 정렬), 추론 단계로 나누어 심층 분석하고, 이를 탐지하기 위한 방법론과 평가 벤치마크를 정리합니다. 또한, 데이터 필터링, 모델 편집, 검색 증강 생성(RAG), 추론 시 디코딩 전략 등 다양한 환각 완화 기법을 검토하며, 특히 RAG 시스템에서 발생하는 고유한 환각 문제와 멀티모달 모델에서의 향후 과제까지 폭넓게 다루고 있습니다.

Key Innovations

LLM에 특화된 환각 분류 체계 제안 (사실성 위반 vs 입력/맥락에 대한 충실성 위반)
데이터, 학습(Pre-training, SFT, RLHF), 추론 단계별 환각 발생 원인의 체계적 분석
검색 증강 생성(RAG) 과정에서 발생하는 검색 실패 및 생성 병목 현상에 대한 세부 분석
환각 완화를 위한 최신 기법(사실성/충실성 강화 디코딩, 모델 편집, RAG 최적화 등)의 포괄적 정리

Learning & Inference Impact

학습 단계에서는 데이터의 오정보 및 편향 제거, 그리고 모델이 자신의 지식 경계(Knowledge Boundary)를 인지하고 모르는 것에 대해 답변을 거부할 수 있도록 하는 정렬(Alignment) 학습의 중요성을 강조합니다. 추론 단계에서는 모델 내부의 불확실성을 추정하여 환각을 탐지하거나, DoLa(Decoding by Contrasting Layers)와 같은 사실성 강화 디코딩 전략을 통해 생성 품질을 높이는 방법을 제시합니다. 또한, 외부 지식을 활용하는 RAG 시스템의 검색 정확도와 문맥 활용 능력을 개선하여 환각을 최소화하는 실질적인 가이드라인을 제공합니다.

Technical Difficulty

중급

Estimated implementation complexity based on methodology.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!