2604.05343v1 Apr 07, 2026 cs.SD

앵커 기반 순환 생성: 장기 시퀀스 심볼릭 음악 생성의 새로운 패러다임

Anchored Cyclic Generation: A Novel Paradigm for Long-Sequence Symbolic Music Generation

Boyu Cao

Citations: 11

h-index: 3

Lekai Qian

Citations: 1

h-index: 1

Dehan Li

Citations: 1

h-index: 1

Haoyue Gu

Citations: 4

h-index: 1

Ming Xu

Citations: 7

h-index: 1

Qi Liu

Citations: 10

h-index: 2

오토레그레시브 모델은 시퀀스 생성 작업 전반에서 구조적 일관성을 갖는 장기 시퀀스를 생성하는 데 근본적인 어려움을 겪습니다. 특히 심볼릭 음악 생성 분야에서는 기존 방법들이 오토레그레시브 모델의 고질적인 심각한 오류 누적 문제로 인해 제약을 받으며, 이는 음악 품질과 구조적 완성도 저하로 이어집니다. 본 논문에서는 이미 식별된 음악의 앵커 특징을 활용하여 오토레그레시브 과정에서 후속 생성을 안내하는 앵커 기반 순환 생성(Anchored Cyclic Generation, ACG) 패러다임을 제안합니다. ACG 패러다임을 기반으로, 우리는 체계적인 전역-국소 생성 전략을 사용하며, 효율적인 음악 표현 방식인 특수 설계된 피아노 토큰과 높은 호환성을 갖는 계층적 앵커 기반 순환 생성(Hierarchical Anchored Cyclic Generation, Hi-ACG) 프레임워크를 추가적으로 제안합니다. 실험 결과는 ACG 패러다임이 기존의 오토레그레시브 모델에 비해 예측된 특징 벡터와 실제 의미 벡터 간의 코사인 거리를 평균 34.7% 감소시킨다는 것을 보여줍니다. 장기 시퀀스 심볼릭 음악 생성 작업에서, Hi-ACG 프레임워크는 주관적 및 객관적 평가 모두에서 기존의 주요 방법보다 훨씬 뛰어난 성능을 보입니다. 또한, 본 프레임워크는 뛰어난 작업 일반화 능력을 보여주며, 음악 완성 등 관련 작업에서도 우수한 성능을 달성합니다.

Original Abstract

Generating long sequences with structural coherence remains a fundamental challenge for autoregressive models across sequential generation tasks. In symbolic music generation, this challenge is particularly pronounced, as existing methods are constrained by the inherent severe error accumulation problem of autoregressive models, leading to poor performance in music quality and structural integrity. In this paper, we propose the Anchored Cyclic Generation (ACG) paradigm, which relies on anchor features from already identified music to guide subsequent generation during the autoregressive process, effectively mitigating error accumulation in autoregressive methods. Based on the ACG paradigm, we further propose the Hierarchical Anchored Cyclic Generation (Hi-ACG) framework, which employs a systematic global-to-local generation strategy and is highly compatible with our specifically designed piano token, an efficient musical representation. The experimental results demonstrate that compared to traditional autoregressive models, the ACG paradigm achieves reduces cosine distance by an average of 34.7% between predicted feature vectors and ground-truth semantic vectors. In long-sequence symbolic music generation tasks, the Hi-ACG framework significantly outperforms existing mainstream methods in both subjective and objective evaluations. Furthermore, the framework exhibits excellent task generalization capabilities, achieving superior performance in related tasks such as music completion.

0 Citations

0 Influential

1.5 Altmetric

7.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!