2604.05333v1 Apr 07, 2026 cs.AI

기술 그래프: 종속성 기반 구조 검색을 통한 대규모 에이전트 기술 활용

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Zongxia Li

Citations: 912

h-index: 7

Hongyang Du

Citations: 201

h-index: 5

Xiyang Wu

Citations: 322

h-index: 8

Shihang Gui

Citations: 18

h-index: 1

Yongbei Kuang

Citations: 17

h-index: 1

Lichao Sun

Citations: 75

h-index: 5

Dawei Liu

Citations: 43

h-index: 2

기술 활용은 현대 에이전트 시스템의 핵심 구성 요소이며, 에이전트가 복잡한 작업을 수행하는 능력을 크게 향상시킬 수 있습니다. 실제 환경에서 에이전트는 수많은 개인 애플리케이션, 웹 브라우저 및 기타 환경 인터페이스를 모니터링하고 상호 작용해야 하며, 이로 인해 기술 라이브러리는 수천 개의 재사용 가능한 기술로 확장될 수 있습니다. 더 큰 기술 집합으로 확장하면 두 가지 주요 문제가 발생합니다. 첫째, 전체 기술 집합을 로드하면 컨텍스트 창이 포화되어 토큰 비용, 환각 현상 및 지연 시간이 증가합니다. 본 논문에서는 대규모 기술 라이브러리를 위한 추론 시간 구조 검색 레이어인 Graph of Skills (GoS)를 제시합니다. GoS는 기술 패키지에서 실행 가능한 기술 그래프를 오프라인으로 구축한 다음, 추론 시간에 하이브리드 의미론적-어휘적 시딩, 역 가중치 개인화된 페이지 랭크 및 컨텍스트 예산 기반 활성화 방식을 통해 제한적이고 종속성을 고려한 기술 번들을 검색합니다. SkillsBench 및 ALFWorld 데이터셋에서 GoS는 기본 전체 기술 로딩 방식에 비해 평균 보상을 43.6% 향상시키면서 입력 토큰을 37.8% 줄였습니다. 또한 Claude Sonnet, GPT-5.2 Codex 및 MiniMax 등 세 가지 모델 패밀리에 걸쳐 일반화 성능을 보였습니다. 200개에서 2,000개 사이의 다양한 기술 라이브러리에 대한 추가적인 분석 연구를 통해 GoS가 보상, 토큰 효율성 및 런타임 측면에서 기본 기술 로딩 및 단순 벡터 검색 방식보다 일관되게 우수한 성능을 발휘함을 확인했습니다.

Original Abstract

Skill usage has become a core component of modern agent systems and can substantially improve agents' ability to complete complex tasks. In real-world settings, where agents must monitor and interact with numerous personal applications, web browsers, and other environment interfaces, skill libraries can scale to thousands of reusable skills. Scaling to larger skill sets introduces two key challenges. First, loading the full skill set saturates the context window, driving up token costs, hallucination, and latency. In this paper, we present Graph of Skills (GoS), an inference-time structural retrieval layer for large skill libraries. GoS constructs an executable skill graph offline from skill packages, then at inference time retrieves a bounded, dependency-aware skill bundle through hybrid semantic-lexical seeding, reverse-weighted Personalized PageRank, and context-budgeted hydration. On SkillsBench and ALFWorld, GoS improves average reward by 43.6% over the vanilla full skill-loading baseline while reducing input tokens by 37.8%, and generalizes across three model families: Claude Sonnet, GPT-5.2 Codex, and MiniMax. Additional ablation studies across skill libraries ranging from 200 to 2,000 skills further demonstrate that GoS consistently outperforms both vanilla skills loading and simple vector retrieval in balancing reward, token efficiency, and runtime.

17 Citations

3 Influential

4 Altmetric

43.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!