2603.19639v1 Mar 20, 2026 cs.AI

HyEvo: 효율적인 추론을 위한 자체 진화형 하이브리드 에이전트 기반 워크플로우

HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning

Beibei Xu

Citations: 19

h-index: 2

Yutong Ye

Citations: 180

h-index: 7

Chuyun Shen

Citations: 84

h-index: 6

Yingbo Zhou

Citations: 40

h-index: 4

Cheng Chen

Citations: 6

h-index: 2

Mingsong Chen

Citations: 104

h-index: 5

에이전트 기반 워크플로우는 복잡한 작업 해결에 강력한 잠재력을 보여주었지만, 기존의 자동 생성 방법은 여전히 비효율적이며 성능이 부족합니다. 이는 미리 정의된 연산자 라이브러리와 확률적 추론을 통해 모든 작업 수준의 계산을 수행하는 균일한 LLM(대규모 언어 모델) 기반 워크플로우에 의존하기 때문입니다. 이러한 제한 사항을 해결하기 위해, 우리는 이기종 원자 합성 방식을 활용하는 자동 워크플로우 생성 프레임워크인 HyEvo를 제안합니다. HyEvo는 의미 추론을 위한 확률적 LLM 노드와 규칙 기반 실행을 위한 결정론적 코드 노드를 통합하여, 예측 가능한 작업을 LLM 추론에서 분리하고 추론 비용과 실행 지연 시간을 줄입니다. HyEvo는 하이브리드 검색 공간을 효율적으로 탐색하기 위해, LLM 기반의 다중 섬 진화 전략과 '반사-생성' 메커니즘을 사용하여 실행 피드백을 통해 워크플로우 토폴로지와 노드 로직을 반복적으로 개선합니다. 종합적인 실험 결과, HyEvo는 다양한 추론 및 코딩 벤치마크에서 기존 방법보다 일관되게 뛰어난 성능을 보이며, 최첨단 오픈 소스 기준과 비교하여 추론 비용과 실행 지연 시간을 각각 최대 19배, 16배까지 줄입니다.

Original Abstract

Although agentic workflows have demonstrated strong potential for solving complex tasks, existing automated generation methods remain inefficient and underperform, as they rely on predefined operator libraries and homogeneous LLM-only workflows in which all task-level computation is performed through probabilistic inference. To address these limitations, we propose HyEvo, an automated workflow-generation framework that leverages heterogeneous atomic synthesis. HyEvo integrates probabilistic LLM nodes for semantic reasoning with deterministic code nodes for rule-based execution, offloading predictable operations from LLM inference and reducing inference cost and execution latency. To efficiently navigate the hybrid search space, HyEvo employs an LLM-driven multi-island evolutionary strategy with a reflect-then-generate mechanism, iteratively refining both workflow topology and node logic via execution feedback. Comprehensive experiments show that HyEvo consistently outperforms existing methods across diverse reasoning and coding benchmarks, while reducing inference cost and execution latency by up to 19$\times$ and 16$\times$, respectively, compared to the state-of-the-art open-source baseline.

0 Citations

0 Influential

3.5 Altmetric

17.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!