2606.10650v1 Jun 09, 2026 cs.CL

Dynamic Linear Attention

Bo Zheng
Bo Zheng
Citations: 44
h-index: 3
Xin Wang
Xin Wang
Citations: 7,232
h-index: 39
Xueshen Liu
Xueshen Liu
Citations: 58
h-index: 4
Hui Shen
Hui Shen
Citations: 402
h-index: 10
Zesen Zhao
Zesen Zhao
Citations: 41
h-index: 2
Minkyoung Cho
Minkyoung Cho
Citations: 29
h-index: 2
Zhongwei Wan
Zhongwei Wan
Citations: 1,260
h-index: 16
Z. Mao
Z. Mao
Citations: 30
h-index: 2
Shen Yan
Shen Yan
Citations: 452
h-index: 5
Mi Zhang
Mi Zhang
Citations: 692
h-index: 13

The scalability of Large Language Models (LLMs) to long contexts is fundamentally constrained by the quadratic complexity of standard attention, motivating the adoption of linear attention mechanisms with sub-quadratic cost. To improve representation capacity under long contexts, recent approaches organize memory in a multi-state manner. However, existing multi-state linear attention methods rely on fixed state merging policies that cannot adapt to dynamically varying token importance, irreversibly obscuring critical tokens and causing severe error accumulation over long sequences. To address this limitation, we propose DLA, a dynamic memory modeling framework for multi-state linear attention. DLA introduces (i) Information-Aware Dynamic State Merging, which adaptively determines state boundaries based on token-level information variation, preserving high-resolution representations around semantic transitions while aggressively summarizing stable regions, and (ii) Capacity-Bounded Memory Modeling, which maintains a fixed-size, chronologically ordered state cache by selectively merging adjacent low-information states to control memory growth with minimal information loss. We pre-train DLA on two different linear attention models and evaluate on 16 datasets across three categories. Experimental results demonstrate the superiority of DLA over state-of-the-art.

0 Citations
0 Influential
19.5 Altmetric
97.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!