2601.16993v2 Jan 12, 2026 cs.DL

BibAgent: 과학 논문에서 추적 가능한 오인용 탐지를 위한 에이전트 기반 프레임워크

BibAgent: An Agentic Framework for Traceable Miscitation Detection in Scientific Literature

Shuo Xing

Citations: 459

h-index: 12

Zhengzhong Tu

Citations: 52

h-index: 4

Fangzhou Lin

Citations: 35

h-index: 4

Peiran Li

Citations: 126

h-index: 5

Xiang Zheng

Citations: 135

h-index: 7

Xi Hong

Citations: 10

h-index: 1

Chaoqun Ni

Citations: 65

h-index: 5

Jiashuo Sun

DAMO Academy, Alibaba Group

Citations: 997

h-index: 10

인용은 과학적 권위의 근간이지만, 미묘한 왜곡부터 조작된 참고문헌에 이르기까지 광범위한 오인용으로 인해 그 신뢰성이 훼손됩니다. 현재 체계적인 인용 검증은 실현 불가능하며, 수동 검토는 현대 출판량에 비해 확장성이 부족하고, 기존 자동화 도구는 요약 정보만 분석하거나 소규모, 특정 분야의 데이터 세트에 제한되는 경우가 많으며, 이는 전면적인 텍스트 접근의 "페이월 장벽" 때문입니다. 본 연구에서는 확장 가능한, 완전 통합형 에이전트 기반 자동 인용 검증 프레임워크인 BibAgent를 소개합니다. BibAgent는 검색, 추론 및 적응형 증거 집계를 통합하여, 접근 가능한 자료와 유료 자료에 대해 각각 다른 전략을 적용합니다. 유료 자료에 대한 인용의 유효성은, BibAgent가 새롭게 도입한 '증거 위원회' 메커니즘을 통해, 후속 인용의 합의를 기반으로 추론합니다. 체계적인 평가를 지원하기 위해, 5가지 범주로 구성된 오인용 분류 체계와 6,350개의 오인용 샘플을 포함하는 광범위한 학제 분야 벤치마크인 MisciteBench를 제공합니다. 실험 결과, BibAgent는 최첨단 대규모 언어 모델(LLM)을 기반으로 한 기존 방법보다 인용 검증 정확도와 해석 가능성에서 우수한 성능을 보이며, 과학 논문 전반에 걸쳐 인용 오류를 확장 가능하고 투명하게 탐지할 수 있음을 입증합니다.

Original Abstract

Citations are the bedrock of scientific authority, yet their integrity is compromised by widespread miscitations: ranging from nuanced distortions to fabricated references. Systematic citation verification is currently unfeasible; manual review cannot scale to modern publishing volumes, while existing automated tools are restricted by abstract-only analysis or small-scale, domain-specific datasets in part due to the "paywall barrier" of full-text access. We introduce BibAgent, a scalable, end-to-end agentic framework for automated citation verification. BibAgent integrates retrieval, reasoning, and adaptive evidence aggregation, applying distinct strategies for accessible and paywalled sources. For paywalled references, it leverages a novel Evidence Committee mechanism that infers citation validity via downstream citation consensus. To support systematic evaluation, we contribute a 5-category Miscitation Taxonomy and MisciteBench, a massive cross-disciplinary benchmark comprising 6,350 miscitation samples spanning 254 fields. Our results demonstrate that BibAgent outperforms state-of-the-art Large Language Model (LLM) baselines in citation verification accuracy and interpretability, providing scalable, transparent detection of citation misalignments across the scientific literature.

7 Citations

0 Influential

6 Altmetric

37.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!