2604.25727v1 Apr 28, 2026 cs.AI

스킬 그래프를 이용한 확장 가능한 터미널 작업 합성 연구

Toward Scalable Terminal Task Synthesis via Skill Graphs

Jian Guan

Citations: 171

h-index: 5

Xing Wu

Citations: 91

h-index: 4

Zhuowen Han

Citations: 27

h-index: 3

Jiangnan Zhou

Citations: 1

h-index: 1

Ting-Ting Yu

Citations: 40

h-index: 3

Zhiyuan Fan

Citations: 53

h-index: 4

Yu-Tong Cai

Citations: 24

h-index: 2

Yunhan Yang

Citations: 18

h-index: 2

Feng Zhang

Citations: 11

h-index: 2

Di Hu

Citations: 104

h-index: 5

Lilin Wang

Citations: 67

h-index: 5

터미널 에이전트는 자율적인 명령줄 실행에 강력한 잠재력을 보여주지만, 고품질이고 다양한 실행 경로의 부족으로 인해 학습이 제한됩니다. 기존 연구들은 이 병목 현상을 완화하기 위해 대규모 터미널 작업 인스턴스를 합성하여 경로 샘플링을 수행합니다. 하지만 이러한 접근 방식은 주로 작업 수를 늘리는 데 초점을 맞추고, 에이전트가 학습 과정에서 실제로 경험하는 실행 경로의 다양성을 제한하는 경향이 있습니다. 본 논문에서는 시나리오 기반 스킬 그래프를 활용하여 자동화된 터미널 작업 합성 프레임워크인 SkillSynth를 제시합니다. SkillSynth는 먼저 대규모 스킬 그래프를 구축하며, 여기서 시나리오는 다양한 명령줄 스킬을 연결하는 중간 전환 노드 역할을 합니다. 그런 다음 이 그래프에서 경로를 샘플링하여 실제 워크플로우의 추상화를 생성하고, 멀티 에이전트 환경을 사용하여 이를 실행 가능한 작업 인스턴스로 변환합니다. SkillSynth는 그래프에서 샘플링된 워크플로우 경로를 기반으로 작업 합성을 수행함으로써, 합성된 작업 해결에 필요한 최소 실행 경로의 다양성을 명시적으로 제어합니다. Terminal-Bench에서 수행된 실험을 통해 SkillSynth의 효과성을 입증했습니다. 또한, SkillSynth에 의해 합성된 작업 인스턴스는 Hy3 Preview의 학습에 사용되어 터미널 기반 환경에서의 에이전트 기능 향상에 기여했습니다.

Original Abstract

Terminal agents have demonstrated strong potential for autonomous command-line execution, yet their training remains constrained by the scarcity of high-quality and diverse execution trajectories. Existing approaches mitigate this bottleneck by synthesizing large-scale terminal task instances for trajectory sampling. However, they primarily focus on scaling the number of tasks while providing limited control over the diversity of execution trajectories that agents actually experience during training. In this paper, we present SkillSynth, an automated framework for terminal task synthesis built on a scenario-mediated skill graph. SkillSynth first constructs a large-scale skill graph, where scenarios serve as intermediate transition nodes that connect diverse command-line skills. It then samples paths from this graph as abstractions of real-world workflows, and uses a multi-agent harness to instantiate them into executable task instances. By grounding task synthesis in graph-sampled workflow paths, SkillSynth explicitly controls the diversity of minimal execution trajectories required to solve the synthesized tasks. Experiments on Terminal-Bench demonstrate the effectiveness of SkillSynth. Moreover, task instances synthesized by SkillSynth have been adopted to train Hy3 Preview, contributing to its enhanced agentic capabilities in terminal-based settings.

0 Citations

0 Influential

2.5 Altmetric

12.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!