2602.08145v1 Feb 04, 2026 cs.LG

신뢰성과 책임성을 갖춘 기초 모델: 종합적인 연구

Reliable and Responsible Foundation Models: A Comprehensive Survey

F. Tramèr

Citations: 1,509

h-index: 16

Huaxiu Yao

Citations: 218

h-index: 8

Rishi Bommasani

Citations: 15,593

h-index: 12

Elias Stengel-Eskin

Citations: 1,189

h-index: 19

Huan Zhang

Citations: 173

h-index: 6

Zhaoyang Wang

Citations: 181

h-index: 6

Y. Tsvetkov

Citations: 498

h-index: 7

Jaehong Yoon

Citations: 271

h-index: 6

A. Bibi

Citations: 271

h-index: 9

Xinyu Yang

Citations: 405

h-index: 9

Mohit Bansal

Citations: 30

h-index: 4

Philip Torr

Citations: 946

h-index: 15

Wenjie Qu

Citations: 206

h-index: 8

Kaidi Xu

Citations: 43

h-index: 2

Junlin Han

Citations: 11

h-index: 2

Jinqi Luo

Citations: 21

h-index: 3

Wangchunshu Zhou

Citations: 130

h-index: 3

Xiyao Wang

Citations: 51

h-index: 2

Shengbang Tong

Citations: 75

h-index: 5

Lingfeng Shen

Citations: 61

h-index: 2

Rafael Rafailov

Citations: 16,817

h-index: 25

Runjia Li

University of Oxford

Citations: 308

h-index: 8

Yiyang Zhou

Citations: 1,487

h-index: 15

Chenhang Cui

Citations: 1,051

h-index: 10

Yu Wang

Citations: 16

h-index: 3

Wen-qing Zheng

Citations: 6

h-index: 1

Huichi Zhou

Citations: 16

h-index: 3

Jindong Gu

Citations: 103

h-index: 7

Zhaorun Chen

Citations: 76

h-index: 4

Peng Xia

Citations: 185

h-index: 4

Tony Lee

Citations: 7

h-index: 1

Thomas M. Zollo

Citations: 3

h-index: 1

Vikash Sehwag

Citations: 46

h-index: 4

Jixuan Leng

Citations: 14

h-index: 2

Jiuhai Chen

Citations: 19

h-index: 2

Yuxin Wen

Citations: 70

h-index: 5

Zhun Deng

Citations: 8

h-index: 1

Linjun Zhang

Citations: 28

h-index: 1

Pavel Izmailov

Citations: 17

h-index: 2

Pang Wei Koh

Citations: 15

h-index: 3

A. Wilson

Citations: 26

h-index: 2

Jiaheng Zhang

Citations: 2

h-index: 1

James Zou

Citations: 88

h-index: 5

Cihang Xie

Citations: 392

h-index: 8

Hao Wang

Citations: 24

h-index: 2

Julian McAuley

Citations: 685

h-index: 11

David Alvarez-Melis

Citations: 126

h-index: 7

Suman Jana

Citations: 152

h-index: 4

René Vidal

Citations: 218

h-index: 4

Filippos Kokkinos

Citations: 380

h-index: 9

Beidi Chen

Citations: 103

h-index: 6

Christopher Callison-Burch

Citations: 1,392

h-index: 15

대규모 언어 모델(LLM), 다중 모달 대규모 언어 모델(MLLM), 이미지 생성 모델(텍스트-이미지 모델 및 이미지 편집 모델), 비디오 생성 모델과 같은 기초 모델은 법률, 의학, 교육, 금융, 과학 등 다양한 분야에서 광범위하게 활용되는 필수적인 도구로 자리 잡았습니다. 이러한 모델이 실제 환경에 점점 더 많이 적용됨에 따라, 학계, 산업계 및 정부는 이러한 모델의 신뢰성과 책임성을 확보하는 것이 매우 중요해졌습니다. 본 연구는 기초 모델의 신뢰성과 책임성 있는 개발에 대한 종합적인 개요를 제공합니다. 우리는 편향 및 공정성, 보안 및 개인 정보 보호, 불확실성, 설명 가능성, 데이터 분포 변화 등 중요한 문제들을 탐구합니다. 또한, 환각 현상과 같은 모델의 한계점과 정렬(alignment) 및 인공지능 생성 콘텐츠(AIGC) 탐지 방법 등을 다룹니다. 각 영역별로 현재 연구 동향을 검토하고 구체적인 향후 연구 방향을 제시합니다. 또한, 이러한 영역들 간의 상호 연관성을 논의하며, 공통적인 과제들을 강조합니다. 본 연구가 강력하면서도 윤리적이고, 신뢰할 수 있으며, 안정적이고, 사회적으로 책임 있는 기초 모델 개발에 기여하기를 바랍니다.

Original Abstract

Foundation models, including Large Language Models (LLMs), Multimodal Large Language Models (MLLMs), Image Generative Models (i.e, Text-to-Image Models and Image-Editing Models), and Video Generative Models, have become essential tools with broad applications across various domains such as law, medicine, education, finance, science, and beyond. As these models see increasing real-world deployment, ensuring their reliability and responsibility has become critical for academia, industry, and government. This survey addresses the reliable and responsible development of foundation models. We explore critical issues, including bias and fairness, security and privacy, uncertainty, explainability, and distribution shift. Our research also covers model limitations, such as hallucinations, as well as methods like alignment and Artificial Intelligence-Generated Content (AIGC) detection. For each area, we review the current state of the field and outline concrete future research directions. Additionally, we discuss the intersections between these areas, highlighting their connections and shared challenges. We hope our survey fosters the development of foundation models that are not only powerful but also ethical, trustworthy, reliable, and socially responsible.

1 Citations

0 Influential

12.5 Altmetric

63.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!