2604.12350v1 Apr 14, 2026 cs.LG

대규모 언어 모델을 활용한 제어 가능한 분자 최적화를 위한 스캐폴드 기반 선호도 트리플렛

Scaffold-Conditioned Preference Triplets for Controllable Molecular Optimization with Large Language Models

Huaimin Wang

Citations: 371

h-index: 11

Liang Xiong

Citations: 1,400

h-index: 4

Sen Yang

Citations: 13

h-index: 2

Kele Xu

Citations: 227

h-index: 8

Zhifeng Gao

Citations: 1,109

h-index: 12

Yi Xiong

Citations: 0

h-index: 0

Xiaohong Ji

Citations: 176

h-index: 8

분자 특성 최적화는 신약 개발의 핵심이지만, 많은 딥러닝 방법은 블랙박스 방식으로 작동하며 스캐폴드 보존에 대한 제어가 제한적이어서 불안정하거나 생물학적으로 타당하지 않은 결과물을 생성하는 경우가 많습니다. 대규모 언어 모델(LLM)은 유망한 분자 생성 도구이지만, 화학적 지식을 기반으로 한 선호도 지도 및 체계적인 데이터 관리가 부족하여 최적화가 제한됩니다. 본 연구에서는 **스캐폴드 기반 선호도 트리플렛(SCPT)**이라는 파이프라인을 소개합니다. SCPT는 스캐폴드 정렬 및 유효성, 합성 가능성, 의미 있는 특성 개선을 위한 화학 기반 필터를 사용하여 유사성 제약 조건 하에 <스캐폴드, 더 나은 분자, 더 나쁜 분자> 형태의 트리플렛을 구성합니다. 이러한 선호도를 활용하여 사전 학습된 분자 LLM을 조건부 편집기로 조정함으로써, 스캐폴드를 유지하면서 특성을 개선하는 편집을 가능하게 합니다. 단일 및 다중 목표 벤치마크에서 SCPT는 최적화 성공률과 특성 개선 효과를 향상시키면서, 경쟁적인 기준 모델보다 높은 스캐폴드 유사성을 유지합니다. 대표적인 LLM이 아닌 분자 최적화 방법에 비해, SCPT로 학습된 LLM은 스캐폴드 제약 및 다중 목표 최적화에 더 적합합니다. 또한, 단일 특성 및 두 특성 지도에 대한 학습 모델은 세 특성 작업에 효과적으로 일반화되어, 제한적인 고차원 지도 하에서도 유망한 외삽적 일반화 능력을 보여줍니다. SCPT는 또한 제어 가능한 데이터 구성 기능을 제공하여 예측 가능한 유사성-개선 경계를 생성하며, 이를 통해 다양한 최적화 환경에 대한 체계적인 적응이 가능합니다.

Original Abstract

Molecular property optimization is central to drug discovery, yet many deep learning methods rely on black-box scoring and offer limited control over scaffold preservation, often producing unstable or biologically implausible edits. While large language models (LLMs) are promising molecular generators, optimization remains constrained by the lack of chemistry-grounded preference supervision and principled data curation. We introduce \textbf{Scaffold-Conditioned Preference Triplets (SCPT)}, a pipeline that constructs similarity-constrained triplets $\langle\text{scaffold}, \text{better}, \text{worse}\rangle$ via scaffold alignment and chemistry-driven filters for validity, synthesizability, and meaningful property gains. Using these preferences, we align a pretrained molecular LLM as a conditional editor, enabling property-improving edits that retain the scaffold. Across single- and multi-objective benchmarks, SCPT improves optimization success and property gains while maintaining higher scaffold similarity than competitive baselines. Compared with representative non-LLM molecular optimization methods, SCPT-trained LLMs are better suited to scaffold-constrained and multi-objective optimization. In addition, models trained on single-property and two-property supervision generalize effectively to three-property tasks, indicating promising extrapolative generalization under limited higher-order supervision. SCPT also provides controllable data-construction knobs that yield a predictable similarity-gain frontier, enabling systematic adaptation to diverse optimization regimes.

0 Citations

0 Influential

6 Altmetric

30.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!