2602.08213v1 Feb 09, 2026 cs.LG

DrugR: LLM 기반 명시적 추론을 통한 분자 의약품 최적화

DrugR: Optimizing Molecular Drugs through LLM-based Explicit Reasoning

Yukun Yan

Citations: 772

h-index: 12

Haoran Liu

Citations: 134

h-index: 4

Zhen Zeng

Citations: 43

h-index: 3

Yuxuan Chen

Citations: 28

h-index: 4

Yunduo Xiao

Citations: 1

h-index: 1

분자 생성 및 최적화는 화학 분야의 기본적인 과제입니다. 특히 강력한 지식 저장 능력과 상호 작용 기능을 갖춘 대규모 언어 모델(LLM)과 같은 지능형 도구의 빠른 발전은 새로운 패러다임을 제시했습니다. 그러나 LLM의 근본적인 과제는 분자 구조와 약리학적 특성 간의 복잡하고 암묵적인 관계와 이에 상응하는 레이블링된 데이터의 부족에 있습니다. 이러한 격차를 해소하기 위해, 우리는 LLM 기반 방법인 DrugR을 제안합니다. DrugR은 최적화 과정에 명시적이고 단계별 약리학적 추론을 도입합니다. 우리의 접근 방식은 도메인 특화된 지속적인 사전 훈련, 역방향 데이터 엔지니어링을 통한 감독적 미세 조정, 그리고 자기 균형 다중 수준 강화 학습을 통합합니다. 이 프레임워크를 통해 DrugR은 원래 분자의 핵심 효능을 유지하면서 주요 ADMET 특성을 효과적으로 개선할 수 있습니다. 실험 결과는 DrugR이 구조적 유사성이나 표적 결합 친화도를 손상시키지 않고 여러 특성에 걸쳐 포괄적인 개선을 달성한다는 것을 보여줍니다. 더욱 중요한 점은, DrugR의 명시적 추론 과정은 각 최적화 단계에 대한 명확하고 해석 가능한 근거를 제공하여 실행 가능한 설계 통찰력을 제공하고 자동화된, 지식 기반 과학적 발견으로 나아가는 데 기여합니다. 우리의 코드와 모델 체크포인트는 향후 연구를 촉진하기 위해 공개적으로 제공됩니다.

Original Abstract

Molecule generation and optimization is a fundamental task in chemical domain. The rapid development of intelligent tools, especially large language models (LLMs) with powerful knowledge reserves and interactive capabilities, has provided new paradigms for it. Nevertheless, the intrinsic challenge for LLMs lies in the complex implicit relationship between molecular structure and pharmacological properties and the lack of corresponding labeled data. To bridge this gap, we propose DrugR, an LLM-based method that introduces explicit, step-by-step pharmacological reasoning into the optimization process. Our approach integrates domain-specific continual pretraining, supervised fine-tuning via reverse data engineering, and self-balanced multi-granular reinforcement learning. This framework enables DrugR to effectively improve key ADMET properties while preserving the original molecule's core efficacy. Experimental results demonstrate that DrugR achieves comprehensive enhancement across multiple properties without compromising structural similarity or target binding affinity. Importantly, its explicit reasoning process provides clear, interpretable rationales for each optimization step, yielding actionable design insights and advancing toward automated, knowledge-driven scientific discovery. Our code and model checkpoints are open-sourced to foster future research.

1 Citations

0 Influential

6 Altmetric

31.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!