2606.13239v1 Jun 11, 2026 cs.SE

ComAct: Reframing Professional Software Manipulation via COM-as-Action Paradigm

Xuemeng Yang
Xuemeng Yang
Citations: 507
h-index: 12
Pinlong Cai
Pinlong Cai
Citations: 2,009
h-index: 19
Licheng Wen
Licheng Wen
Citations: 1,214
h-index: 15
Botian Shi
Botian Shi
Citations: 561
h-index: 13
Yu Yang
Yu Yang
Citations: 73
h-index: 3
Jiaxin Ai
Jiaxin Ai
Citations: 112
h-index: 6
Tao Hu
Tao Hu
Citations: 43
h-index: 3
Shu Zou
Shu Zou
Citations: 0
h-index: 0
Hairong Zhang
Hairong Zhang
Citations: 9
h-index: 2
Daocheng Fu
Daocheng Fu
Citations: 56
h-index: 2
Nianchen Deng
Nianchen Deng
Citations: 1,190
h-index: 8
Hongbin Zhou
Hongbin Zhou
Citations: 363
h-index: 7
Zhongyuan Wang
Zhongyuan Wang
Citations: 950
h-index: 17
Kaipeng Zhang
Kaipeng Zhang
Citations: 158
h-index: 7

Existing computer-use agents remain fundamentally limited in professional software manipulation: GUI-based agents suffer from fragile visual grounding and long-horizon error accumulation, while API-basedapproaches struggle with heterogeneous protocols and inaccessible commercial interfaces. In this work,we identify the Component Object Model (COM) as a unified executable abstraction, proposing COM-as-Action: a new paradigm that reframes professional software interaction as deterministic program synthesisrather than sequential visual control. To validate this paradigm in the most demanding environments, weintroduce ComCADBench, the first benchmark for agents operating real industrial CAD software. Ourexperiments reveal a substantial paradigm gap: frontier proprietary models achieve near-zero successunder GUI-based interaction, whereas COM-based execution yields substantial immediate gains. Tobridge the remaining gap between syntactic correctness and geometric accuracy, we develop ComActor, aself-correcting agent trained through a progressive three-stage framework, alongside ComForge, a scalableplatform for large-scale training in Windows containers. Extensive experiments show that ComActorachieves state-of-the-art performance on ComCADBench, with strong resilience in long-horizon taskswhere baselines collapse, and generalizes to external CAD benchmark.

0 Citations
0 Influential
9.5 Altmetric
47.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!