2604.20727v1 Apr 22, 2026 cs.LG

에이전트 기반 작업 성능 향상을 위한 보충 정보 생성 훈련

Supplement Generation Training for Enhancing Agentic Task Performance

Khushbu Pahwa

Citations: 158

h-index: 7

Yi Zhang

Citations: 113

h-index: 5

Etsuko Ishii

The Hong Kong University of Science and Technology

Citations: 5,265

h-index: 12

D. Bhargavi

Citations: 45

h-index: 4

Monica Sunkara

Citations: 88

h-index: 5

Younghyun Cho

Citations: 16

h-index: 2

Daniele Bonadiman

University of Trento

Citations: 350

h-index: 11

Tamer Alkhouli

AppTek

Citations: 797

h-index: 14

Salvatore Romeo

Citations: 27

h-index: 3

Dongwei Jiang

Citations: 427

h-index: 7

Yubin Ge

Citations: 62

h-index: 5

에이전트 기반 작업 훈련은 높은 계산 비용, 긴 반복 주기, 그리고 지속적으로 새로운 모델이 출시됨에 따라 대규모 기초 모델을 훈련하는 것이 점점 더 비현실적이 되어가고 있습니다. 따라서, 모든 새로운 작업 또는 도메인에 대해 거대한 모델을 추가 훈련하는 대신, 우리는 보다 효율적이고 지속 가능한 전략인 보충 정보 생성 훈련(SGT)을 제안합니다. SGT는 더 작은 LLM을 훈련시켜 유용한 보충 텍스트를 생성하며, 이 텍스트를 원래 입력에 추가하면 더 큰 LLM이 해당 작업을 더욱 효과적으로 해결할 수 있도록 돕습니다. 이러한 경량 모델은 작업 요구 사항에 따라 보충 정보를 동적으로 조정하여, 기반이 되는 대규모 모델을 수정하지 않고도 성능을 향상시킬 수 있습니다. 이러한 접근 방식은 작업별 최적화를 대규모 기초 모델로부터 분리하고, 실제 응용 분야에서 LLM 기반 에이전트를 보다 유연하고 비용 효율적으로 배포할 수 있도록 합니다.

Original Abstract

Training large foundation models for agentic tasks is increasingly impractical due to the high computational costs, long iteration cycles, and rapid obsolescence as new models are continuously released. Instead of post-training massive models for every new task or domain, we propose Supplement Generation Training (SGT), a more efficient and sustainable strategy. SGT trains a smaller LLM to generate useful supplemental text that, when appended to the original input, helps the larger LLM solve the task more effectively. These lightweight models can dynamically adapt supplements to task requirements, improving performance without modifying the underlying large models. This approach decouples task-specific optimization from large foundation models and enables more flexible, cost-effective deployment of LLM-powered agents in real-world applications.

0 Citations

0 Influential

7 Altmetric

35.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!