2602.12430v3 Feb 12, 2026 cs.MA

대규모 언어 모델을 위한 에이전트 스킬: 아키텍처, 습득, 보안, 그리고 향후 방향

Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

Citations: 669

h-index: 3

Citations: 113

h-index: 3

단일형 언어 모델에서 모듈식이며 스킬(skill)을 갖춘 에이전트로의 전환은 대규모 언어 모델(LLM)이 실제로 활용되는 방식에 있어 결정적인 변화를 나타낸다. 모든 절차적 지식을 모델 가중치 내에 인코딩하는 대신, 에이전트 스킬(에이전트가 필요에 따라 로드하는 명령어, 코드 및 리소스의 결합 가능한 패키지)은 재학습 없이 동적인 기능 확장을 가능하게 한다. 이는 점진적 공개(progressive disclosure) 패러다임, 이식 가능한 스킬 정의, 모델 컨텍스트 프로토콜(MCP)과의 통합으로 공식화된다. 본 서베이 논문은 지난 몇 달 동안 빠르게 진화해 온 에이전트 스킬 환경에 대한 포괄적인 분석을 제공한다. 우리는 이 분야를 네 가지 축으로 분류한다: (i) SKILL$.$md 명세, 점진적 컨텍스트 로딩, 스킬과 MCP의 상호보완적 역할을 검토하는 아키텍처 기반; (ii) 스킬 라이브러리를 활용한 강화 학습, 자율적 스킬 발견(SEAgent), 구성적 스킬 합성을 다루는 스킬 습득; (iii) 컴퓨터 사용 에이전트(CUA) 스택, GUI 그라운딩 기술 발전, OSWorld 및 SWE-bench의 벤치마크 진전을 포함하는 대규모 배포; (iv) 최근 실증적 분석에서 커뮤니티 기여 스킬의 26.1%가 취약점을 포함하고 있음을 밝혀내어, 스킬 출처를 단계별 배포 권한에 매핑하는 4계층 게이트 기반 권한 모델인 '스킬 신뢰 및 수명주기 거버넌스 프레임워크'를 제안하게 된 보안. 우리는 교차 플랫폼 스킬 이식성에서부터 기능 기반 권한 모델에 이르는 7가지 미해결 과제를 식별하고, 신뢰할 수 있으며 자가 개선이 가능한 스킬 생태계 실현을 위한 연구 의제를 제안한다. LLM 에이전트나 도구 사용을 폭넓게 다루는 기존 서베이 연구들과 달리, 본 연구는 새롭게 부상하는 스킬 추상화 계층과 그것이 차세대 에이전틱 시스템에 미치는 의미에 구체적으로 초점을 맞춘다. 프로젝트 저장소: https://github.com/scienceaix/agentskills

Original Abstract

The transition from monolithic language models to modular, skill-equipped agents marks a defining shift in how large language models (LLMs) are deployed in practice. Rather than encoding all procedural knowledge within model weights, agent skills -- composable packages of instructions, code, and resources that agents load on demand -- enable dynamic capability extension without retraining. It is formalized in a paradigm of progressive disclosure, portable skill definitions, and integration with the Model Context Protocol (MCP). This survey provides a comprehensive treatment of the agent skills landscape, as it has rapidly evolved during the last few months. We organize the field along four axes: (i) architectural foundations, examining the SKILL$.$md specification, progressive context loading, and the complementary roles of skills and MCP; (ii) skill acquisition, covering reinforcement learning with skill libraries, autonomous skill discovery (SEAgent), and compositional skill synthesis; (iii) deployment at scale, including the computer-use agent (CUA) stack, GUI grounding advances, and benchmark progress on OSWorld and SWE-bench; and (iv) security, where recent empirical analyses reveal that 26.1% of community-contributed skills contain vulnerabilities, motivating our proposed Skill Trust and Lifecycle Governance Framework -- a four-tier, gate-based permission model that maps skill provenance to graduated deployment capabilities. We identify seven open challenges -- from cross-platform skill portability to capability-based permission models -- and propose a research agenda for realizing trustworthy, self-improving skill ecosystems. Unlike prior surveys that broadly cover LLM agents or tool use, this work focuses specifically on the emerging skill abstraction layer and its implications for the next generation of agentic systems. Project repo: https://github.com/scienceaix/agentskills

63 Citations

3 Influential

29.547189562171 Altmetric

216.7 Score

Original PDF

AI Analysis

Korean Summary

이 논문은 대규모 언어 모델(LLM)이 단일 모델에서 모듈식 전문성을 갖춘 '에이전트 스킬(Agent Skills)' 기반 시스템으로 전환되는 패러다임을 종합적으로 분석합니다. 에이전트 스킬의 아키텍처(점진적 정보 공개, SKILL.md 규격, MCP와의 통합), 스킬 획득 방법(강화학습, 자율 탐색, 인간 작성 등), 컴퓨터 사용 에이전트(CUA) 환경에서의 배포를 다룹니다. 특히 커뮤니티 스킬의 26.1%가 보안 취약점을 지니고 있음을 지적하며, 이를 해결하기 위한 '스킬 신뢰 및 수명주기 거버넌스 프레임워크'와 7가지 향후 연구 과제를 제안합니다.

Key Innovations

컨텍스트 소모를 최소화하는 3단계 점진적 정보 공개(Progressive Disclosure) 아키텍처 및 SKILL.md 규격화
절차적 지식을 제공하는 스킬(Skills)과 외부 연결성을 제공하는 모델 컨텍스트 프로토콜(MCP)의 통합 에이전틱 스택(Agentic Stack) 정의
인간 작성, 스킬 라이브러리 기반 강화학습(SAGE), 자율 탐색(SEAgent), 조합적 스킬 합성 등 스킬 획득 방식의 분류 체계화
스킬 출처(Provenance), 검증 게이트(Verification Gates), 신뢰 계층(Trust Tiers)을 매핑하여 권한을 제어하는 보안 거버넌스 프레임워크 제안

Learning & Inference Impact

학습 측면에서 모델의 가중치를 직접 미세조정(Fine-tuning)하는 기존 방식에서 벗어나, 절차적 지식을 '스킬'이라는 모듈형 파일 형태로 외부화하여 에이전트가 습득하게 합니다. 이를 통해 막대한 재학습 비용이나 기존 지식을 잃어버리는 파국적 망각(Catastrophic Forgetting) 없이 능력을 지속적으로 확장할 수 있습니다. 추론 측면에서는 '점진적 정보 공개' 방식을 통해 효율성을 극대화합니다. 초기 로딩 시에는 스킬의 이름과 설명 같은 메타데이터(약 30토큰)만 시스템 프롬프트에 로드하고, 사용자의 의도와 일치하는 스킬이 트리거될 때만 세부 지침과 코드를 컨텍스트에 주입합니다. 이는 대규모 스킬 라이브러리를 보유하더라도 추론 시 컨텍스트 윈도우의 낭비를 막고 토큰 소모량과 지연 시간을 대폭 절감하게 해줍니다.

Technical Difficulty

중급

Estimated implementation complexity based on methodology.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!