2605.29940v1 May 28, 2026 cs.AI

Make LLM Learn to Synthesize from Streaming Experiences through Feedback

Zhen Bi

Citations: 15

h-index: 2

Jungang Lou

Citations: 44

h-index: 3

Zhixuan Chu

Citations: 3

h-index: 1

Zihao Xue

Citations: 6

h-index: 2

Longtao Huang

Citations: 31

h-index: 2

Zeyu Yang

Citations: 3

h-index: 1

Bin Zhu

Citations: 132

h-index: 5

Yan Wang

Citations: 12

h-index: 2

Zhen-Hua Hu

Citations: 12

h-index: 2

Xiongtao Zhang

Citations: 264

h-index: 9

Large language models (LLMs) have been widely adopted for synthetic data generation, significantly reducing annotation costs. However, most existing studies treat synthesis as a set of isolated tasks and overlook a more fundamental question: whether a model can learn to synthesize by accumulating experience from past tasks and transferring it to future ones. In this work, we introduce StreamSynth, a new setting in which synthesis tasks arrive sequentially and experience from historical tasks provides informative signals for future synthesis. To address this setting, we propose SynLearner, a general framework that enables synthesis models to acquire reusable synthesis experience over a task stream. Instead of generating data independently for each task, SynLearner encourages the model to explore diverse synthesis patterns, learn from feedback, and balance sample quality with set-level diversity as tasks evolve. Extensive experiments across multiple benchmarks show that SynLearner effectively leverages experience from earlier tasks to improve synthesis performance on later ones, exhibiting consistent cross-task transferability. These findings provide evidence for the feasibility of StreamSynth and highlight synthetic data generation as an experience-driven process that can benefit from task streams.

0 Citations

0 Influential

4.5 Altmetric

22.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!