2601.19232v1 Jan 27, 2026 cs.LG

잠재 확산 모델의 단계별 최적화를 통한 구조 기반 RNA 설계

Structure-based RNA Design by Step-wise Optimization of Latent Diffusion Model

Qi Si

Citations: 0

h-index: 0

Xuyang Liu

Citations: 408

h-index: 12

Penglei Wang

Citations: 148

h-index: 4

Xin Guo

Citations: 27

h-index: 2

Yuan Qi

Citations: 207

h-index: 9

Yuan Cheng

Citations: 31

h-index: 3

RNA 역방향 접힘은 특정 3차원 구조를 형성하도록 설계된 서열을 생성하는 것으로, 치료, 유전자 조절 및 합성 생물학에 매우 중요합니다. 기존 방법들은 주로 서열 복원에 초점을 맞추고 있으며, 2차 구조 일관성(SS), 최소 자유 에너지(MFE), 국소 거리 차이 테스트(LDDT)와 같은 구조적 목표를 해결하는 데 어려움을 겪어, 최적의 구조 정확도를 달성하지 못하는 경우가 많습니다. 이러한 문제를 해결하기 위해, 우리는 잠재 확산 모델(LDM)과 통합된 강화 학습(RL) 프레임워크를 제안합니다. 확산 모델이 RNA 역방향 접힘에서 복잡한 서열-구조 상호 작용을 효과적으로 모델링하는 데 성공한 사례에서 영감을 받아, 대규모 RNA 모델에서 사전 학습된 RNA-FM 임베딩을 통합한 LDM을 개발했습니다. 이러한 임베딩은 공진화 패턴을 포착하여 서열 복구 정확도를 크게 향상시킵니다. 그러나 기존의 방법들, 특히 확산 기반 방법들은 미분 불가능한 구조적 목표를 효과적으로 처리할 수 없습니다. 반면에, RL은 정책 기반의 보상 최적화를 사용하여 복잡하고 미분 기반이 아닌 목표를 탐색하는 데 뛰어납니다. 이는 기존 방법보다 상당한 장점을 제공합니다. 요약하자면, 우리는 잠재 확산 모델의 단계별 최적화(SOLD)라는 새로운 RL 프레임워크를 제안합니다. SOLD는 전체 확산 경로를 샘플링하지 않고 단일 단계의 노이즈를 최적화하여, 여러 가지 구조적 목표를 효율적으로 개선합니다. 실험 결과는 SOLD가 LDM 기준 모델 및 최첨단 방법보다 모든 지표에서 우수한 성능을 보임을 입증하며, RNA 역방향 접힘을 위한 강력한 프레임워크를 구축하고 생명공학 및 치료 응용 분야에 상당한 영향을 미칠 것으로 예상됩니다.

Original Abstract

RNA inverse folding, designing sequences to form specific 3D structures, is critical for therapeutics, gene regulation, and synthetic biology. Current methods, focused on sequence recovery, struggle to address structural objectives like secondary structure consistency (SS), minimum free energy (MFE), and local distance difference test (LDDT), leading to suboptimal structural accuracy. To tackle this, we propose a reinforcement learning (RL) framework integrated with a latent diffusion model (LDM). Drawing inspiration from the success of diffusion models in RNA inverse folding, which adeptly model complex sequence-structure interactions, we develop an LDM incorporating pre-trained RNA-FM embeddings from a large-scale RNA model. These embeddings capture co-evolutionary patterns, markedly improving sequence recovery accuracy. However, existing approaches, including diffusion-based methods, cannot effectively handle non-differentiable structural objectives. By contrast, RL excels in this task by using policy-driven reward optimization to navigate complex, non-gradient-based objectives, offering a significant advantage over traditional methods. In summary, we propose the Step-wise Optimization of Latent Diffusion Model (SOLD), a novel RL framework that optimizes single-step noise without sampling the full diffusion trajectory, achieving efficient refinement of multiple structural objectives. Experimental results demonstrate SOLD surpasses its LDM baseline and state-of-the-art methods across all metrics, establishing a robust framework for RNA inverse folding with profound implications for biotechnological and therapeutic applications.

0 Citations

0 Influential

6 Altmetric

30.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!