2307.09288 Jul 18, 2023 cs.AI

Llama 2: 개방형 파운데이션 및 미세 조정된 챗 모델

Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron

Citations: 63,472

h-index: 17

Thibaut Lavril

Citations: 44,624

h-index: 19

X. Martinet

Citations: 51,266

h-index: 9

M. Lachaux

Citations: 44,165

h-index: 16

Naman Goyal

Citations: 132,454

h-index: 38

Aur'elien Rodriguez

Citations: 51,243

h-index: 9

Todor Mihaylov

Citations: 40,702

h-index: 25

Punit Singh Koura

Citations: 37,042

h-index: 9

A. Schelten

Citations: 31,440

h-index: 9

A. Korenev

Citations: 31,296

h-index: 8

Cristian Canton Ferrer

Citations: 35,840

h-index: 15

David Esiobu

Citations: 31,644

h-index: 11

Iliyan Zarov

Citations: 31,166

h-index: 5

Isabel M. Kloumann

Citations: 34,583

h-index: 18

Jenya Lee

Citations: 31,424

h-index: 8

Lukas Blecher

Citations: 31,696

h-index: 9

Marcin Kardas

Citations: 32,607

h-index: 9

M. Kambadur

Citations: 31,937

h-index: 11

Niko-lay Bashlykov

Citations: 31,461

h-index: 9

Prajjwal Bhargava

Citations: 32,050

h-index: 12

Puxin Xu

Citations: 32,748

h-index: 9

Robert Stojnic

Citations: 32,846

h-index: 13

Ross Taylor

Citations: 32,338

h-index: 9

Ruan Silva

Citations: 31,318

h-index: 8

Sergey Edunov

Facebook AI Research

Citations: 48,300

h-index: 19

Sharan Narang

Citations: 78,760

h-index: 29

Shruti Bhosale

Citations: 36,492

h-index: 22

Soumya Batra

Citations: 31,522

h-index: 12

Thomas Scialom

Citations: 36,610

h-index: 16

Vedanuj Goswami

Citations: 36,389

h-index: 22

Viktor Kerkez

Citations: 32,419

h-index: 9

Wenyin Fu

Citations: 31,427

h-index: 8

Yasmine Babaei

Citations: 31,501

h-index: 9

Yuchen Zhang

Meta AI

Citations: 31,372

h-index: 8

Diana Liskovich

Citations: 31,661

h-index: 9

Igor Molybog

Citations: 31,610

h-index: 7

J. Reizenstein

Citations: 33,420

h-index: 14

Madian Khabsa

Citations: 38,800

h-index: 30

Guillem Cucurull

Citations: 44,778

h-index: 15

A. Hartshorn

Citations: 19,021

h-index: 22

Andrew Poulton

Citations: 17,985

h-index: 10

Louis Martin

Facebook AI Research

Citations: 21,874

h-index: 11

Angela Fan

Citations: 39,303

h-index: 37

Cynthia Gao

Citations: 20,292

h-index: 15

Kevin R. Stone

Citations: 18,081

h-index: 8

Peter Albert

Citations: 16,680

h-index: 4

Amjad Almahairi

Citations: 21,390

h-index: 21

D. Bikel

Citations: 19,761

h-index: 18

Moya Chen

Citations: 23,355

h-index: 10

Jude Fernandes

Citations: 16,953

h-index: 7

Brian Fuller

Citations: 17,801

h-index: 8

Saghar Hosseini

Citations: 17,845

h-index: 17

Rui Hou

Citations: 17,342

h-index: 8

Hakan Inan

Citations: 17,797

h-index: 8

Yinghai Lu

Citations: 17,768

h-index: 9

Yuning Mao

Citations: 20,211

h-index: 22

Pushkar Mishra

Facebook AI

Citations: 17,498

h-index: 19

Yixin Nie

Citations: 19,531

h-index: 16

Rashi Rungta

Citations: 18,087

h-index: 8

Kalyan Saladi

Citations: 16,863

h-index: 7

Eric Michael Smith

Meta AI

Citations: 20,413

h-index: 18

R. Subramanian

Citations: 16,617

h-index: 2

Xia Tan

Citations: 16,665

h-index: 3

Binh Tang

Citations: 16,972

h-index: 8

Adina Williams

Citations: 18,767

h-index: 15

Jian Xiang Kuan

Citations: 16,628

h-index: 3

Zhengxu Yan

Citations: 16,838

h-index: 7

Jeremy Fu

Citations: 31,479

h-index: 9

본 연구에서는 70억 개에서 700억 개의 파라미터 규모를 갖춘 사전 학습 및 미세 조정된 대규모 언어 모델(LLM) 모음인 Llama 2를 개발하고 공개합니다. Llama 2-Chat이라고 명명된 미세 조정된 LLM은 대화형 사용 사례에 최적화되어 있습니다. 우리의 모델은 테스트를 수행한 대부분의 벤치마크에서 오픈 소스 챗 모델보다 뛰어난 성능을 보였으며, 유용성과 안전성에 대한 인적 평가를 토대로 볼 때 비공개형(closed-source) 모델의 적절한 대체재가 될 수 있습니다. 우리는 커뮤니티가 본 연구를 기반으로 발전하고 LLM의 책임감 있는 개발에 기여할 수 있도록, Llama 2-Chat의 미세 조정 및 안전성 개선 접근 방식에 대해 상세히 기술합니다.

Original Abstract

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. Our models outperform open-source chat models on most benchmarks we tested, and based on our human evaluations for helpfulness and safety, may be a suitable substitute for closed-source models. We provide a detailed description of our approach to fine-tuning and safety improvements of Llama 2-Chat in order to enable the community to build on our work and contribute to the responsible development of LLMs.

16666 Citations

2126 Influential

19 Altmetric

21,013.0 Score

Original PDF

AI Analysis

Korean Summary

Meta가 공개한 Llama 2는 70억(7B)에서 700억(70B) 파라미터 규모의 사전 학습 및 미세 조정된 대규모 언어 모델(LLM) 제품군에 대한 연구 논문입니다. Llama 2는 이전 버전 대비 40% 더 많은 2조 개의 토큰으로 학습되었으며, 컨텍스트 길이를 4096 토큰으로 2배 확장했습니다. 특히 대화형 모델인 Llama 2-Chat은 지도 미세 조정(SFT)과 인간 피드백 기반 강화 학습(RLHF)을 통해 최적화되었으며, Ghost Attention(GAtt) 기법을 도입하여 다중 턴 대화의 일관성을 높였습니다. 벤치마크 결과 기존 오픈 소스 모델들을 능가하고 일부 비공개 모델(ChatGPT 등)과 대등한 성능을 보이며, 특히 유용성과 안전성(Safety) 간의 균형을 맞추기 위한 구체적인 방법론을 제시하여 책임감 있는 AI 개발에 기여합니다.

Key Innovations

이전 대비 40% 증가한 2조 토큰 규모의 사전 학습 데이터 사용
기존 모델 대비 2배 확장된 4096 토큰 컨텍스트 윈도우
대규모 모델(34B, 70B)의 추론 효율성을 위한 그룹 쿼리 어텐션(Grouped-Query Attention, GQA) 적용
다중 턴 대화에서 지시 사항 유지를 위한 고스트 어텐션(Ghost Attention, GAtt) 기법
유용성(Helpfulness)과 안전성(Safety)을 분리한 이중 보상 모델(Reward Model) 및 반복적인 RLHF 파이프라인

Learning & Inference Impact

학습 측면에서는 방대한 데이터셋과 엄격한 정제 과정을 통해 모델의 기초 성능을 강화했고, 수천 개의 고품질 SFT 데이터와 100만 개 이상의 인간 선호도 데이터를 활용한 RLHF(거부 샘플링 및 PPO)를 통해 모델의 정렬(Alignment) 성능을 극대화했습니다. 추론 측면에서는 34B 및 70B 모델에 GQA를 적용하여 KV 캐시 크기를 줄임으로써 메모리 효율성과 처리 속도를 크게 향상시켰습니다. 또한, GAtt 기법을 통해 긴 대화 상황에서도 초기 시스템 프롬프트의 제약 조건을 잃지 않고 일관된 답변을 생성할 수 있도록 추론 능력을 개선했습니다.

Technical Difficulty

중급

Estimated implementation complexity based on methodology.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!