2603.22869v1 Mar 24, 2026 cs.AI

인증 체인(Chain-of-Authorization): 추론 경로를 통해 대규모 언어 모델에 인증 기능을 내재화

Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories

Qi Li

Citations: 95

h-index: 5

Yang Li

Citations: 36

h-index: 2

Yule Liu

HKUST(GZ)

Citations: 477

h-index: 9

Xinlei He

Citations: 414

h-index: 7

Youjian Zhao

Citations: 27

h-index: 4

Ke Xu

Citations: 77

h-index: 2

대규모 언어 모델(LLM)은 현대 인공지능(AI) 시스템의 핵심적인 인지 구성 요소로, 내부 지식과 외부 컨텍스트를 결합하여 복잡한 작업을 수행합니다. 그러나 LLM은 일반적으로 접근 가능한 모든 데이터를 차별 없이 처리하며, 지식 소유권 및 접근 경계에 대한 고유한 인식을 갖추지 못합니다. 이러한 결함은 민감한 데이터 유출 및 악의적인 조작의 위험을 증가시켜, 무단 시스템 접근 및 심각한 보안 위기를 초래할 수 있습니다. 기존의 보호 전략은 유연성이 떨어지는 단일 방어 메커니즘에 의존하여 동적 인증을 방해합니다. 구조적 분리 방법은 확장성 병목 현상을 겪고, 프롬프트 가이드 방식은 세분화된 권한 구분에 어려움을 겪습니다. 본 연구에서는 Chain-of-Authorization (CoA) 프레임워크를 제안합니다. CoA는 LLM의 핵심 기능에 인증 로직을 내재화하는 안전한 학습 및 추론 패러다임입니다. 수동적인 외부 방어 메커니즘과는 달리, CoA는 모델의 정보 흐름을 재구성합니다. CoA는 입력 단계에서 권한 컨텍스트를 포함하고, 최종 응답 전에 리소스 검토, 식별 확인 및 의사 결정 단계를 포함하는 명시적인 인증 추론 경로를 생성하도록 요구합니다. 다양한 인증 상태를 포함하는 데이터에 대한 지도 학습을 통해, CoA는 정책 실행을 작업 응답과 통합하여, 인증을 의미 있는 응답의 필수 전제로 만듭니다. 광범위한 실험 결과, CoA는 허가된 시나리오에서 비교 가능한 유용성을 유지할 뿐만 아니라, 권한 불일치 시 발생하는 인지적 혼란을 극복합니다. 또한 다양한 무단 및 악의적인 접근 시 높은 거부율을 보입니다. 이러한 메커니즘은 LLM의 추론 능력을 활용하여 동적 인증을 수행하며, 자연어 이해를 통해 현대 AI 시스템에 안정적인 LLM을 배포하기 위한 선제적인 보안 메커니즘으로 작동합니다.

Original Abstract

Large Language Models (LLMs) have become core cognitive components in modern artificial intelligence (AI) systems, combining internal knowledge with external context to perform complex tasks. However, LLMs typically treat all accessible data indiscriminately, lacking inherent awareness of knowledge ownership and access boundaries. This deficiency heightens risks of sensitive data leakage and adversarial manipulation, potentially enabling unauthorized system access and severe security crises. Existing protection strategies rely on rigid, uniform defense that prevent dynamic authorization. Structural isolation methods faces scalability bottlenecks, while prompt guidance methods struggle with fine-grained permissions distinctions. Here, we propose the Chain-of-Authorization (CoA) framework, a secure training and reasoning paradigm that internalizes authorization logic into LLMs' core capabilities. Unlike passive external defneses, CoA restructures the model's information flow: it embeds permission context at input and requires generating explicit authorization reasoning trajectory that includes resource review, identity resolution, and decision-making stages before final response. Through supervised fine-tuning on data covering various authorization status, CoA integrates policy execution with task responses, making authorization a causal prerequisite for substantive responses. Extensive evaluations show that CoA not only maintains comparable utility in authorized scenarios but also overcomes the cognitive confusion when permissions mismatches. It exhibits high rejection rates against various unauthorized and adversarial access. This mechanism leverages LLMs' reasoning capability to perform dynamic authorization, using natural language understanding as a proactive security mechanism for deploying reliable LLMs in modern AI systems.

0 Citations

0 Influential

4.5 Altmetric

22.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!