2602.07309v1 Feb 07, 2026 cs.IR

LinkedIn의 의미 기반 검색

Semantic Search At LinkedIn

Yubo Wang

Citations: 55

h-index: 4

Jingwei Wu

Citations: 389

h-index: 6

Benjamin Le

Citations: 0

h-index: 0

Xueying Lu

Citations: 42

h-index: 2

Igor Lapchuk

Citations: 0

h-index: 0

Jianqiang Shen

Citations: 26

h-index: 3

Raghavan Muthuregunathan

Citations: 66

h-index: 4

Abhinav Gupta

Citations: 1

h-index: 1

Mathew Teoh

Citations: 0

h-index: 0

Wenjing Zhang

Citations: 16

h-index: 2

Rajat Arora

Citations: 4

h-index: 1

Ping Liu

Citations: 67

h-index: 4

Muchen Wu

Citations: 3

h-index: 1

Fedor Borisyuk

Citations: 435

h-index: 5

Sriram Vasudevan

Citations: 12

h-index: 2

Guoyao Li

Citations: 35

h-index: 2

Shaobo Zhang

Citations: 24

h-index: 3

Yuchin Juan

Citations: 10

h-index: 3

Kayhan Behdin

Citations: 175

h-index: 8

Liming Dong

Citations: 4

h-index: 1

Kai Yang

Citations: 142

h-index: 4

Shusen Jing

Citations: 0

h-index: 0

R. Pothamsetty

Citations: 2

h-index: 1

Sophie Yanying Sheng

Citations: 0

h-index: 0

Vitaly Abdrashitov

Citations: 108

h-index: 4

Yang Zhao

Citations: 161

h-index: 6

Lin Su

Citations: 6

h-index: 1

Xiaoqing Wang

Citations: 9

h-index: 2

Chujie Zheng

Citations: 3

h-index: 1

Sarang Metkar

Citations: 0

h-index: 0

Rupesh Gupta

Citations: 4

h-index: 1

D. Racca

Citations: 71

h-index: 4

Madhumitha Mohan

Citations: 2

h-index: 1

Yanbo Li

Citations: 3

h-index: 1

Haojun Li

Citations: 1

h-index: 1

S. Gandhi

Citations: 6

h-index: 2

Chetan Bhole

Citations: 3

h-index: 1

Ali Hooshmand

Citations: 0

h-index: 0

Xin Yang

Citations: 39

h-index: 3

Jiajun Zhang

Citations: 15

h-index: 1

Adam Coler

Citations: 0

h-index: 0

Xiaojing Ma

Citations: 40

h-index: 2

Sundara Raman Ramachandran

Citations: 2

h-index: 1

Morteza Ramezani

Citations: 37

h-index: 1

Lijuan Zhang

Citations: 9

h-index: 2

Richard Li

Citations: 47

h-index: 2

Jian Sheng

Citations: 6

h-index: 2

Chanh Nguyen

Citations: 153

h-index: 2

Chuanrui Zhu

Citations: 3

h-index: 1

Claire Zhang

Citations: 5

h-index: 1

Jiahao Xu

Citations: 27

h-index: 2

D. Kulkarni

Citations: 285

h-index: 2

Qing Lan

Citations: 28

h-index: 2

Arvind Subramaniam

Citations: 0

h-index: 0

Steven Shimizu

Citations: 121

h-index: 1

Yanning Chen

Citations: 121

h-index: 1

Zhipeng Wang

Citations: 16

h-index: 3

Ran He

Citations: 5

h-index: 1

Zhengze Zhou

Citations: 9

h-index: 2

Qing-Wen Song

Citations: 1

h-index: 1

Yun Dai

Citations: 153

h-index: 3

Caleb Johnson

Citations: 10

h-index: 2

Shaghayegh Gharghabi

Citations: 1,359

h-index: 7

Gokulraj Mohanasundaram

Citations: 13

h-index: 1

Juan Bottaro

Citations: 0

h-index: 0

Santhosh Sachindran

Citations: 0

h-index: 0

Yunxiang Ren

Citations: 6

h-index: 2

Cheng-lin Jiang

Citations: 1

h-index: 1

Di Mo

Citations: 0

h-index: 0

Luke Simon

Citations: 0

h-index: 0

K. Q. Shen

Citations: 8

h-index: 2

A. F. Baarzi

Citations: 196

h-index: 6

Yen-Chi Chen

Citations: 0

h-index: 0

Qi Guo

Citations: 399

h-index: 7

대규모 언어 모델(LLM)을 활용한 의미 기반 검색은 키워드 중복이 아닌 의미를 기반으로 정보를 검색할 수 있도록 하지만, 이를 확장하기 위해서는 추론 효율성을 크게 향상시켜야 합니다. 본 논문에서는 LinkedIn의 AI 채용 검색 및 AI 인물 검색을 위한 LLM 기반 의미 기반 검색 프레임워크를 소개합니다. 이 프레임워크는 LLM을 활용한 관련성 판단 모듈, 임베딩 기반 검색, 그리고 다중 지도 학습을 통해 훈련된 경량 언어 모델을 결합하여 관련성과 사용자 참여도를 동시에 최적화합니다. 모델 가지치기, 컨텍스트 압축, 그리고 텍스트 임베딩 하이브리드 상호 작용을 통합한 추론 아키텍처는, 지연 시간 제약 조건 하에서 순위 처리 속도를 75배 이상 향상시키면서도, 기존 모델 수준의 NDCG 성능을 유지합니다. 이를 통해 LinkedIn은 기존 방식과 유사한 효율성을 갖는 최초의 상용 LLM 기반 순위 시스템을 구축하고, 품질과 사용자 참여도를 크게 향상시켰습니다.

Original Abstract

Semantic search with large language models (LLMs) enables retrieval by meaning rather than keyword overlap, but scaling it requires major inference efficiency advances. We present LinkedIn's LLM-based semantic search framework for AI Job Search and AI People Search, combining an LLM relevance judge, embedding-based retrieval, and a compact Small Language Model trained via multi-teacher distillation to jointly optimize relevance and engagement. A prefill-oriented inference architecture co-designed with model pruning, context compression, and text-embedding hybrid interactions boosts ranking throughput by over 75x under a fixed latency constraint while preserving near-teacher-level NDCG, enabling one of the first production LLM-based ranking systems with efficiency comparable to traditional approaches and delivering significant gains in quality and user engagement.

0 Citations

0 Influential

4 Altmetric

20.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!