2605.28213v1 May 27, 2026 cs.AI

Learning When to Optimize: Verified Optimization Skills from Expert GPU-Kernel Lineages

Xiyu Shi
Xiyu Shi
Citations: 52
h-index: 4
Guanglin Li
Guanglin Li
Citations: 13
h-index: 2
Xiaobing Feng
Xiaobing Feng
Citations: 120
h-index: 6
Shuoming Zhang
Shuoming Zhang
Citations: 26
h-index: 3
Jiacheng Zhao
Jiacheng Zhao
Citations: 270
h-index: 9
Qiuchun Yu
Qiuchun Yu
Citations: 12
h-index: 2
Huimin Cui
Huimin Cui
Citations: 8
h-index: 2
Ruiyuan Xu
Ruiyuan Xu
Citations: 15
h-index: 3
Yang Zhang
Yang Zhang
Citations: 5
h-index: 1

LLM-based agents are increasingly used to generate GPU kernels, but they often know what optimizations to try without knowing when those optimizations are sound. We introduce KLineage, which learns this missing "when" knowledge from expert kernels: instead of relying on forward rollouts, KLineage walks expert implementations backward through validation-gated simplifications and reverses each accepted step into a reusable optimization skill. Each skill records not only the optimization intent, but also where it applies in code, what conditions made it valid, what effect it had, and what failures its assumptions avoid. A downstream LLM materializes these skills on new code surfaces under the same compile/correctness/profile gate. On five expert workloads across two NVIDIA architectures, these lineage-derived skills serve as an effective optimization curriculum, exceeding recent memory-based LLM-kernel baselines in both final kernel quality and optimization efficiency under the same fixed budget. We additionally use a separate 22-instance held-out check as a sanity test against source-case memorization.

0 Citations
0 Influential
4.5 Altmetric
22.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!