2601.22401v3 Jan 29, 2026 cs.AI

제미니를 활용한 준자율적 수학 발견: 에르되스 문제에 대한 사례 연구

Semi-Autonomous Mathematics Discovery with Gemini: A Case Study on the Erdős Problems

Tony Feng

Citations: 53

h-index: 4

Carlo Pagano

Citations: 57

h-index: 4

Dawsen Hwang

Citations: 3,329

h-index: 5

Yuri Chervonyi

Citations: 3,507

h-index: 8

G. Bingham

Citations: 3,372

h-index: 6

Trieu H. Trinh

Citations: 5,111

h-index: 12

Quoc V. Le

Citations: 985

h-index: 6

Thang Luong

Citations: 952

h-index: 6

Junehyuk Jung

Citations: 3,440

h-index: 6

Sang-hyun Kim

Citations: 42

h-index: 3

Sergei Gukov

Citations: 36

h-index: 2

Kaiying Hou

Citations: 36

h-index: 2

Weiyu Li

Citations: 31

h-index: 3

Jiwon Kang

Citations: 28

h-index: 2

Shengtong Zhang

Citations: 28

h-index: 2

Kevin Barreto

Citations: 31

h-index: 3

J. Seo

Citations: 184

h-index: 6

C. Tsai

Citations: 38

h-index: 3

Hyunwoo Choi

Citations: 35

h-index: 3

Youngbeom Jin

Citations: 22

h-index: 1

Hao-An Wu

Citations: 22

h-index: 1

Ruey-An Shiu

Citations: 26

h-index: 2

Yung-Sheng Shih

Citations: 83

h-index: 3

Carl Schildkraut

Citations: 33

h-index: 2

본 연구는 제미니를 사용하여 Bloom의 에르되스 문제 데이터베이스에 '미해결(Open)'로 표시된 700개의 추측을 체계적으로 평가하는 준자율적 수학 발견의 사례 연구를 제시합니다. 우리는 AI 기반의 자연어 검증을 통해 탐색 공간을 좁히는 하이브리드 방법을 사용하며, 그 후 인간 전문가의 평가를 통해 정확성과 참신성을 판단합니다. 데이터베이스에서 '미해결'로 표시된 13개의 문제에 대해 분석한 결과, 5개 문제는 겉보기에는 새로운 자율적 해결책을 통해, 8개 문제는 기존 문헌에서 이미 해결된 내용으로 확인되었습니다. 우리의 연구 결과는 '미해결' 상태가 문제의 난이도보다는 문헌의 가려짐으로 인해 발생했을 가능성을 시사합니다. 또한, AI를 사용하여 수학적 추측을 대규모로 분석하는 과정에서 발생하는 문제점들을 파악하고 논의하며, 특히 문헌 식별의 어려움과 AI에 의한 '무의식적 표절'의 위험성을 강조합니다. 마지막으로, 에르되스 문제에 대한 AI 지원 연구를 통해 얻은 교훈들을 고찰합니다.

Original Abstract

We present a case study in semi-autonomous mathematics discovery, using Gemini to systematically evaluate 700 conjectures labeled 'Open' in Bloom's Erdős Problems database. We employ a hybrid methodology: AI-driven natural language verification to narrow the search space, followed by human expert evaluation to gauge correctness and novelty. We address 13 problems that were marked 'Open' in the database: 5 through seemingly novel autonomous solutions, and 8 through identification of previous solutions in the existing literature. Our findings suggest that the 'Open' status of the problems was through obscurity rather than difficulty. We also identify and discuss issues arising in applying AI to math conjectures at scale, highlighting the difficulty of literature identification and the risk of ''subconscious plagiarism'' by AI. We reflect on the takeaways from AI-assisted efforts on the Erdős Problems.

22 Citations

1 Influential

6 Altmetric

54.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!