2603.11619v1 Mar 12, 2026 cs.CR

OpenClaw 제어: 자율 LLM 에이전트의 보안 분석 및 위협 완화

Taming OpenClaw: Security Analysis and Mitigation of Autonomous LLM Agent Threats

Yue Xiao

Citations: 236

h-index: 8

Xinhao Deng

Citations: 442

h-index: 8

Qi Li

Citations: 95

h-index: 5

Shiwen Cui

Citations: 71

h-index: 5

Jiaqi Bai

Citations: 19

h-index: 1

Sibo Yi

Citations: 654

h-index: 3

Zhuo Zou

Citations: 116

h-index: 3

Rennai Qiu

Citations: 35

h-index: 3

Jialuo Chen

Citations: 259

h-index: 6

Xiaohu Du

Citations: 155

h-index: 5

Changhua Meng

Citations: 932

h-index: 15

Weiqiang Wang

Citations: 63

h-index: 4

Ke Xu

Citations: 87

h-index: 5

Yixiang Zhang

Citations: 31

h-index: 2

Jiaqi Wu

Citations: 64

h-index: 3

Jianan Ma

Citations: 37

h-index: 2

Xiaofan Yang

Citations: 44

h-index: 4

Jiaxing Song

Citations: 315

h-index: 3

OpenClaw와 같은 자율 대규모 언어 모델(LLM) 에이전트는 복잡하고 장기적인 작업을 수행하는 데 놀라운 능력을 보여줍니다. 그러나 이들의 긴밀하게 결합된 즉시 메시지 상호 작용 방식과 높은 권한 실행 기능은 시스템 공격 표면을 크게 확장합니다. 본 논문에서는 OpenClaw의 포괄적인 보안 위협 분석을 제시합니다. 분석의 구조화를 위해, 에이전트 운영의 주요 단계를 포괄하는 5단계의 라이프사이클 지향 보안 프레임워크를 도입하고, 간접 프롬프트 주입, 기술 공급망 오염, 메모리 오염, 의도 변화 등 에이전트 운영 라이프사이클 전반에 걸친 복합적인 위협을 체계적으로 검토합니다. OpenClaw에 대한 상세한 사례 연구를 통해 이러한 위협의 발생 빈도와 심각성을 입증하고, 기존 방어 기법의 한계를 분석합니다. 연구 결과는 현재의 개별적인 방어 메커니즘이 시간과 단계를 초월하는 체계적인 위험에 대처하는 데 있어 중요한 약점을 가지고 있음을 보여주며, 자율 LLM 에이전트에 대한 전체적인 보안 아키텍처의 필요성을 강조합니다. 이 프레임워크 내에서, 플러그인 검증 프레임워크, 컨텍스트 기반 명령어 필터링, 메모리 무결성 검증 프로토콜, 의도 검증 메커니즘 및 기능 강제 아키텍처를 포함하여 각 라이프사이클 단계에서 대표적인 방어 전략을 추가적으로 검토합니다.

Original Abstract

Autonomous Large Language Model (LLM) agents, exemplified by OpenClaw, demonstrate remarkable capabilities in executing complex, long-horizon tasks. However, their tightly coupled instant-messaging interaction paradigm and high-privilege execution capabilities substantially expand the system attack surface. In this paper, we present a comprehensive security threat analysis of OpenClaw. To structure our analysis, we introduce a five-layer lifecycle-oriented security framework that captures key stages of agent operation, i.e., initialization, input, inference, decision, and execution, and systematically examine compound threats across the agent's operational lifecycle, including indirect prompt injection, skill supply chain contamination, memory poisoning, and intent drift. Through detailed case studies on OpenClaw, we demonstrate the prevalence and severity of these threats and analyze the limitations of existing defenses. Our findings reveal critical weaknesses in current point-based defense mechanisms when addressing cross-temporal and multi-stage systemic risks, highlighting the need for holistic security architectures for autonomous LLM agents. Within this framework, we further examine representative defense strategies at each lifecycle stage, including plugin vetting frameworks, context-aware instruction filtering, memory integrity validation protocols, intent verification mechanisms, and capability enforcement architectures.

19 Citations

4 Influential

7.5 Altmetric

64.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!