2108.07258 Aug 16, 2021 cs.AI

파운데이션 모델(Foundation Models)의 기회와 위험에 관하여

On the Opportunities and Risks of Foundation Models

Niladri S. Chatterji

Citations: 26,418

h-index: 23

Rishi Bommasani

Citations: 15,593

h-index: 12

Drew A. Hudson

Citations: 13,204

h-index: 12

Ehsan Adeli

Citations: 6,925

h-index: 11

R. Altman

Citations: 6,835

h-index: 5

Simran Arora

Citations: 8,558

h-index: 22

Sydney von Arx

Citations: 6,757

h-index: 4

Michael S. Bernstein

Stanford University

Citations: 77,189

h-index: 67

Jeannette Bohg

Citations: 18,737

h-index: 50

Antoine Bosselut

EPFL

Citations: 14,518

h-index: 36

E. Brunskill

Citations: 17,672

h-index: 56

Erik Brynjolfsson

Citations: 46,989

h-index: 73

S. Buch

Citations: 12,122

h-index: 19

Dallas Card

University of Michigan

Citations: 10,178

h-index: 24

Rodrigo Castellon

Citations: 7,144

h-index: 5

Annie S. Chen

Citations: 8,238

h-index: 12

Kathleen A. Creel

Stanford University

Citations: 7,312

h-index: 12

Jared Davis

Citations: 9,579

h-index: 15

Dora Demszky

Citations: 6,695

h-index: 3

Chris Donahue

Stanford

Citations: 9,319

h-index: 19

M. Doumbouya

Citations: 6,681

h-index: 4

Esin Durmus

Stanford University

Citations: 16,216

h-index: 35

Stefano Ermon

Citations: 89,213

h-index: 102

J. Etchemendy

Citations: 9,583

h-index: 20

Kawin Ethayarajh

Citations: 10,160

h-index: 16

L. Fei-Fei

Citations: 7,561

h-index: 11

Chelsea Finn

Citations: 85,953

h-index: 94

Trevor Gale

Citations: 8,780

h-index: 7

Lauren Gillespie

Citations: 6,643

h-index: 3

Karan Goel

Citations: 13,228

h-index: 14

Noah D. Goodman

Citations: 28,351

h-index: 76

S. Grossman

Citations: 6,779

h-index: 8

Neel Guha

Citations: 10,609

h-index: 19

Tatsunori Hashimoto

Citations: 20,387

h-index: 29

Peter Henderson

Citations: 12,851

h-index: 17

John Hewitt

Stanford University

Citations: 13,852

h-index: 17

Daniel E. Ho

Citations: 7,671

h-index: 9

Jenny Hong

Citations: 6,675

h-index: 5

Kyle Hsu

Citations: 7,267

h-index: 9

Jing Huang

Citations: 8,173

h-index: 16

Thomas F. Icard

Citations: 10,813

h-index: 22

Saahil Jain

Citations: 7,067

h-index: 4

Dan Jurafsky

Stanford University

Citations: 65,469

h-index: 109

Pratyusha Kalluri

Citations: 8,934

h-index: 11

Siddharth Karamcheti

Stanford University

Citations: 12,045

h-index: 21

G. Keeling

Citations: 7,329

h-index: 15

Fereshte Khani

Stanford University

Citations: 7,089

h-index: 10

O. Khattab

Citations: 14,345

h-index: 21

Pang Wei Koh

Stanford University

Citations: 22,434

h-index: 29

M. Krass

Citations: 6,844

h-index: 6

Ranjay Krishna

University of Washington

Citations: 23,766

h-index: 37

Rohith Kuditipudi

Citations: 7,223

h-index: 9

Ananya Kumar

Citations: 11,533

h-index: 20

Faisal Ladhak

Citations: 12,796

h-index: 20

Mina Lee

Stanford University

Citations: 7,939

h-index: 9

Tony Lee

Stanford University

Citations: 8,834

h-index: 8

J. Leskovec

Citations: 172,663

h-index: 153

Isabelle Levent

Citations: 6,624

h-index: 2

Xiang Lisa Li

Stanford University

Citations: 15,731

h-index: 14

Xuechen Li

Stanford University

Citations: 13,391

h-index: 19

Tengyu Ma

Citations: 13,018

h-index: 29

Ali Malik

Citations: 6,847

h-index: 9

Christopher D. Manning

Stanford University

Citations: 198,380

h-index: 149

Suvir Mirchandani

Stanford

Citations: 8,430

h-index: 13

E. Mitchell

Citations: 20,538

h-index: 22

Zanele Munyikwa

Citations: 6,648

h-index: 2

Suraj Nair

Stanford University

Citations: 10,063

h-index: 20

A. Narayan

Citations: 7,623

h-index: 11

D. Narayanan

Citations: 16,434

h-index: 21

Benjamin Newman

University of Washington

Citations: 8,553

h-index: 9

Allen Nie

Stanford University

Citations: 7,259

h-index: 11

Juan Carlos Niebles

Salesforce

Citations: 26,446

h-index: 66

H. Nilforoshan

Citations: 7,166

h-index: 8

Julian Nyarko

Citations: 7,068

h-index: 4

Giray Ogut

Citations: 6,616

h-index: 2

Laurel J. Orr

Citations: 9,248

h-index: 12

Isabel Papadimitriou

Citations: 6,660

h-index: 3

J. Park

Citations: 12,620

h-index: 17

C. Piech

Citations: 12,508

h-index: 27

Eva Portelance

Citations: 6,674

h-index: 5

Christopher Potts

Citations: 39,561

h-index: 58

Aditi Raghunathan

Citations: 15,381

h-index: 27

Robert Reich

Citations: 6,638

h-index: 3

Hongyu Ren

Citations: 16,343

h-index: 26

Frieda Rong

Citations: 11,461

h-index: 8

Yusuf H. Roohani

Citations: 8,737

h-index: 16

Camilo Ruiz

Citations: 7,579

h-index: 5

Jack Ryan

Stanford University

Citations: 6,822

h-index: 2

Christopher R'e

Citations: 19,881

h-index: 21

Dorsa Sadigh

Citations: 30,784

h-index: 69

Shiori Sagawa

Citations: 13,501

h-index: 16

Keshav Santhanam

Citations: 11,456

h-index: 17

Andy Shih

Citations: 7,718

h-index: 19

K. Srinivasan

Citations: 10,125

h-index: 16

Alex Tamkin

Stanford University

Citations: 9,586

h-index: 25

Rohan Taori

Stanford

Citations: 9,579

h-index: 16

A. Thomas

Citations: 7,555

h-index: 13

Florian Tramèr

ETH Zürich

Citations: 37,825

h-index: 54

Rose E. Wang

Citations: 7,042

h-index: 7

William Wang

Citations: 8,379

h-index: 5

Bohan Wu

Citations: 6,970

h-index: 10

Jiajun Wu

Stanford University

Citations: 34,534

h-index: 80

Yuhuai Wu

Citations: 22,147

h-index: 37

Sang Michael Xie

Citations: 13,872

h-index: 18

Michihiro Yasunaga

Citations: 24,585

h-index: 40

Jiaxuan You

Citations: 15,967

h-index: 23

M. Zaharia

Citations: 72,272

h-index: 77

Michael Zhang

Citations: 7,072

h-index: 6

Tianyi Zhang

Citations: 22,296

h-index: 12

Xikun Zhang

Stanford University

Citations: 8,388

h-index: 9

Yuhui Zhang

Stanford University

Citations: 13,149

h-index: 21

Lucia Zheng

Citations: 9,017

h-index: 8

Kaitlyn Zhou

Citations: 7,450

h-index: 13

Percy Liang

Citations: 87,393

h-index: 104

광범위한 데이터를 대규모로 학습하여 다양한 다운스트림 작업에 적응할 수 있는 모델들(예: BERT, DALL-E, GPT-3)의 부상으로 인해 AI는 패러다임의 전환을 겪고 있다. 우리는 이러한 모델들의 중요하고 중심적이지만 미완성인 특성을 강조하기 위해 이를 '파운데이션 모델(foundation models)'이라 부른다. 본 보고서는 파운데이션 모델의 능력(예: 언어, 시각, 로봇공학, 추론, 인간 상호작용) 및 기술적 원리(예: 모델 아키텍처, 훈련 절차, 데이터, 시스템, 보안, 평가, 이론)에서부터 응용 분야(예: 법률, 의료, 교육) 및 사회적 영향(예: 불평등, 오용, 경제 및 환경적 영향, 법적 및 윤리적 고려사항)에 이르기까지, 파운데이션 모델의 기회와 위험에 대한 포괄적인 설명을 제공한다. 파운데이션 모델은 표준 딥러닝과 전이 학습에 기반을 두고 있지만, 그 규모로 인해 새로운 창발적 능력이 나타나며, 수많은 작업에 걸친 효율성은 획일화를 촉진한다. 획일화는 강력한 이점을 제공하지만, 파운데이션 모델의 결함이 모든 다운스트림 적응 모델에 상속되기 때문에 주의가 요구된다. 파운데이션 모델의 광범위한 배포가 임박했음에도 불구하고, 우리는 현재 그 창발적 특성으로 인해 이 모델들이 어떻게 작동하는지, 언제 실패하는지, 심지어 무엇을 할 수 있는지에 대한 명확한 이해가 부족한 상태다. 이러한 문제들을 해결하기 위해, 우리는 파운데이션 모델에 대한 중요한 연구의 상당 부분이 그 근본적인 사회기술적 본질에 상응하는 깊이 있는 학제 간 협력을 필요로 할 것이라 믿는다.

Original Abstract

AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their capabilities (e.g., language, vision, robotics, reasoning, human interaction) and technical principles(e.g., model architectures, training procedures, data, systems, security, evaluation, theory) to their applications (e.g., law, healthcare, education) and societal impact (e.g., inequity, misuse, economic and environmental impact, legal and ethical considerations). Though foundation models are based on standard deep learning and transfer learning, their scale results in new emergent capabilities,and their effectiveness across so many tasks incentivizes homogenization. Homogenization provides powerful leverage but demands caution, as the defects of the foundation model are inherited by all the adapted models downstream. Despite the impending widespread deployment of foundation models, we currently lack a clear understanding of how they work, when they fail, and what they are even capable of due to their emergent properties. To tackle these questions, we believe much of the critical research on foundation models will require deep interdisciplinary collaboration commensurate with their fundamentally sociotechnical nature.

6653 Citations

297 Influential

30 Altmetric

7,397.0 Score

Original PDF

AI Analysis

Korean Summary

스탠포드 대학의 CRFM(Center for Research on Foundation Models)에서 발표한 이 보고서는 '파운데이션 모델(Foundation Models)'이라는 새로운 AI 패러다임을 정의하고 심층 분석합니다. 파운데이션 모델은 방대한 데이터로(주로 자기 지도 학습을 통해) 학습되어 다양한 다운스트림 작업에 적응할 수 있는 모델(예: BERT, GPT-3, CLIP)을 지칭합니다. 이 보고서는 모델의 기술적 원리(모델링, 학습, 시스템 등)부터 언어, 비전, 로보틱스 등의 능력, 그리고 의료, 법률 등의 응용 분야를 포괄합니다. 특히 모델의 규모 확장이 가져오는 '창발성(Emergence)'과 단일 모델이 여러 애플리케이션의 기반이 되는 '동질화(Homogenization)' 특성을 중심으로, 기술적 기회와 함께 편향성, 오용, 환경 문제, 법적/윤리적 위험과 같은 사회적 영향을 종합적으로 다룹니다.

Key Innovations

광범위한 데이터에 대한 대규모 자기 지도 학습(Self-supervised learning)과 전이 학습(Transfer learning)의 결합
트랜스포머(Transformer) 아키텍처와 하드웨어 발전을 통한 모델 규모의 급격한 확장(Scale)
별도의 명시적 학습 없이 프롬프트만으로 새로운 작업을 수행하는 인컨텍스트 러닝(In-context learning)과 같은 창발적 능력
단일 모델을 다양한 다운스트림 태스크에 적용하는 적응(Adaptation) 기술(예: 파인튜닝, 프롬프팅)
텍스트, 이미지, 로보틱스 센서 데이터 등을 통합하는 멀티모달(Multimodal) 학습 능력

Learning & Inference Impact

학습 과정에서는 방대한 데이터셋 구축과 막대한 컴퓨팅 자원이 요구되는 '사전 학습(Pre-training)' 단계가 핵심이 되며, 이는 하드웨어와 소프트웨어의 공동 설계(Co-design) 및 분산 학습 시스템의 중요성을 높입니다. 추론 및 응용 단계에서는 처음부터 모델을 학습시키는 대신, 사전 학습된 파운데이션 모델을 특정 태스크에 맞게 '적응(Adaptation)'시키는 방식으로 워크플로우가 변화했습니다. 이는 소량의 데이터로도 높은 성능을 낼 수 있게(Sample efficiency) 해주지만, 파운데이션 모델의 결함이나 편향이 모든 다운스트림 애플리케이션에 전파될 수 있는 단일 실패 지점(Single point of failure) 문제를 야기하기도 합니다.

Technical Difficulty

고급

Estimated implementation complexity based on methodology.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!