2604.20817v1 Apr 22, 2026 cs.CL

수렴 진화: 다양한 언어 모델이 유사한 숫자 표현 방식을 학습하는 방법

Convergent Evolution: How Different Language Models Learn Similar Number Representations

Deqing Fu

University of Southern California

Citations: 385

h-index: 10

Mikhail Belkin

Citations: 58

h-index: 2

V. Sharan

Citations: 936

h-index: 17

Robin Jia

Citations: 256

h-index: 7

Tianyi Zhou

Citations: 8

h-index: 2

자연어 텍스트로 학습된 언어 모델은 주기적인 특징을 사용하여 숫자를 표현하며, 이 특징들은 주로 $T=2, 5, 10$의 우세한 주기를 갖습니다. 본 논문에서는 이러한 특징들의 두 계층 구조를 밝힙니다. 트랜스포머, 선형 RNN, LSTM, 그리고 다양한 방식으로 학습된 전통적인 단어 임베딩은 모두 푸리에 영역에서 $T$ 주기를 갖는 특징을 학습하지만, 숫자 $T$를 기준으로 기하학적으로 분리 가능한 특징을 학습하는 모델은 일부에 불과합니다. 이러한 불일치를 설명하기 위해, 푸리에 영역의 희소성이 $T$ 기준의 기하학적 분리에 필요하지만 충분하지 않음을 증명합니다. 또한, 모델 학습이 기하학적으로 분리 가능한 특징을 생성하는 조건을 경험적으로 조사한 결과, 데이터, 아키텍처, 옵티마이저, 그리고 토크나이저가 중요한 역할을 한다는 것을 확인했습니다. 특히, 모델이 기하학적으로 분리 가능한 특징을 획득하는 두 가지 경로를 식별했습니다. 첫째, 모델은 일반 언어 데이터, 특히 텍스트-숫자 동반 발생 및 숫자 간의 상호 작용을 포함하는 보완적인 동반 발생 신호로부터 이러한 특징을 학습할 수 있습니다. 둘째, 모델은 여러 토큰을 사용하는 (단일 토큰을 사용하지 않는) 덧셈 문제로부터 이러한 특징을 학습할 수 있습니다. 전반적으로, 본 연구 결과는 특징 학습에서 발생하는 수렴 진화 현상을 강조합니다. 다양한 모델들이 서로 다른 학습 신호로부터 유사한 특징을 학습하는 것입니다.

Original Abstract

Language models trained on natural text learn to represent numbers using periodic features with dominant periods at $T=2, 5, 10$. In this paper, we identify a two-tiered hierarchy of these features: while Transformers, Linear RNNs, LSTMs, and classical word embeddings trained in different ways all learn features that have period-$T$ spikes in the Fourier domain, only some learn geometrically separable features that can be used to linearly classify a number mod-$T$. To explain this incongruity, we prove that Fourier domain sparsity is necessary but not sufficient for mod-$T$ geometric separability. Empirically, we investigate when model training yields geometrically separable features, finding that the data, architecture, optimizer, and tokenizer all play key roles. In particular, we identify two different routes through which models can acquire geometrically separable features: they can learn them from complementary co-occurrence signals in general language data, including text-number co-occurrence and cross-number interaction, or from multi-token (but not single-token) addition problems. Overall, our results highlight the phenomenon of convergent evolution in feature learning: A diverse range of models learn similar features from different training signals.

0 Citations

0 Influential

8.5 Altmetric

42.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!