2601.05560v1 Jan 09, 2026 cs.CL

ReasonAny: 간단하고 효과적인 모델 병합을 통해 모든 모델에 추론 능력을 통합

ReasonAny: Incorporating Reasoning Capability to Any Model via Simple and Effective Model Merging

Junyao Yang

Citations: 66

h-index: 3

Wen Shen

Citations: 17

h-index: 1

Chen Qian

Citations: 307

h-index: 8

Yong Liu

Citations: 206

h-index: 5

Dongrui Liu

Citations: 550

h-index: 13

Jing Shao

Citations: 595

h-index: 12

최근, 긴 추론 과정을 갖는 대규모 추론 모델(LRM)이 놀라운 성공을 거두었습니다. 그러나 도메인 전문 모델에 이러한 추론 능력을 부여하는 것, 즉 “추론 + X”를 구현하는 것은 여전히 중요한 과제입니다. 모델 병합은 학습이 필요 없는 유망한 해결책을 제시하지만, 기존 방법은 종종 성능 저하를 초래합니다. 기존 방법은 추론 깊이를 약화시키고 도메인별 유용성을 저해하는 경향이 있습니다. 흥미롭게도, 우리는 이러한 실패의 근본적인 이유를 밝혀냈습니다. 즉, 일반적인 가정과는 달리, 도메인 관련 기능은 큰 값의 파라미터에 해당하지 않고, 추론 능력은 주로 기울기 민감도가 낮은 파라미터 영역에 집중되어 있다는 것입니다. 이러한 통찰력을 바탕으로, 우리는 추론과 도메인 성능 저하 문제를 Contrastive Gradient Identification을 통해 해결하는 새로운 병합 프레임워크인 ReasonAny를 제안합니다. 안전, 생명 의학 및 금융 분야에서의 실험 결과, ReasonAny는 “추론 + X” 기능을 효과적으로 통합하며, 기존 최고 성능 모델보다 훨씬 뛰어난 성능을 보임과 동시에 견고한 추론 성능을 유지하는 것을 확인했습니다.

Original Abstract

Large Reasoning Models (LRMs) with long chain-of-thought reasoning have recently achieved remarkable success. Yet, equipping domain-specialized models with such reasoning capabilities, referred to as "Reasoning + X", remains a significant challenge. While model merging offers a promising training-free solution, existing methods often suffer from a destructive performance collapse: existing methods tend to both weaken reasoning depth and compromise domain-specific utility. Interestingly, we identify a counter-intuitive phenomenon underlying this failure: reasoning ability predominantly resides in parameter regions with low gradient sensitivity, contrary to the common assumption that domain capabilities correspond to high-magnitude parameters. Motivated by this insight, we propose ReasonAny, a novel merging framework that resolves the reasoning-domain performance collapse through Contrastive Gradient Identification. Experiments across safety, biomedicine, and finance domains show that ReasonAny effectively synthesizes "Reasoning + X" capabilities, significantly outperforming state-of-the-art baselines while retaining robust reasoning performance.

1 Citations

0 Influential

6.5 Altmetric

33.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!