2605.26789v1 May 26, 2026 cs.AI

Composition Collapse: Stable Factual Knowledge Does Not Imply Compositional Reasoning

Hongzhi Wang
Hongzhi Wang
Citations: 6
h-index: 2
Wenpeng Xing
Wenpeng Xing
Citations: 205
h-index: 10
Zhengtao Yu
Zhengtao Yu
Citations: 20
h-index: 2
Xuyang Teng
Xuyang Teng
Citations: 6
h-index: 2
Meng Han
Meng Han
Citations: 78
h-index: 3
Yunzhao Wei
Yunzhao Wei
Citations: 0
h-index: 0
Jiefeng Chen
Jiefeng Chen
Citations: 150
h-index: 4

Post-training is routinely evaluated through aggregate benchmark scores that treat multi-hop reasoning as a single capability -- as if a model that answers more questions correctly must be better at assembling facts. We show that this assumption can be misleading: recipes with statistically indistinguishable atomic knowledge produce composition behaviour separated by over 40 percentage points, a phenomenon we call composition collapse: the systematic failure to assemble stably-known facts into chains, invisible to aggregate metrics. We introduce a double-gate protocol that changes the estimand from an aggregate compositionality gap to residual composition failure conditioned on stable atomic access, decomposing post-training gains into three independent channels: atomic stability, residual composition, and critical depth. On a benchmark of temporal factual chains spanning depths 2--11 across four post-training recipes, this decomposition reveals that post-training objectives shift composition capability in directions that aggregate metrics mask, and suggests that claims about multi-hop reasoning improvement should be accompanied by atomic-gate-controlled composition metrics. Diagnostic probes further show that a substantial share of measured composition failure reflects generation-time computation constraints rather than permanent inability to compose.

0 Citations
0 Influential
5 Altmetric
25.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!