2603.15260v1 Mar 16, 2026 cs.AI

AGCD: 에이전트 기반 교차 모드 디코딩을 이용한 기상 예측

AGCD: Agent-Guided Cross-Modal Decoding for Weather Forecasting

Yutong Lu

Citations: 1,074

h-index: 8

Yang Liu

Citations: 54

h-index: 2

Guowen Li

Citations: 18

h-index: 2

Shilei Cao

Citations: 67

h-index: 5

Jiashun Cheng

Citations: 1

h-index: 1

Hong Cheng

Citations: 3

h-index: 1

Juepeng Zheng

Citations: 1,643

h-index: 21

Haohuan Fu

Citations: 737

h-index: 15

Jingwen Wu

Citations: 2

h-index: 1

Lin Zhang

Citations: 102

h-index: 1

Junbo Zeng

Citations: 2

h-index: 1

Jiabing Wang

Citations: 0

h-index: 0

Zi Ye

Citations: 8

h-index: 2

Fangyijie Wang

Citations: 51

h-index: 3

M. Jin

Citations: 0

h-index: 0

Yerong Feng

Citations: 131

h-index: 3

정확한 기상 예측은 단순히 격자 단위의 회귀 분석을 넘어, 기상 현상의 일관된 구조와 물리적 제약을 유지해야 합니다. 특히, 자기 회귀 방식으로 예측을 수행할 때 작은 오차가 구조적인 편향으로 증폭될 수 있습니다. 기존의 물리 기반 접근 방식은 일반적으로 아키텍처, 정규화 또는 수치 예보 모델 결합을 통해 전역적인 제약을 가하지만, 실제 적용 단계에서 상태에 따른 적응성과 개별 데이터에 대한 제어 능력이 제한적입니다. 이러한 간극을 해소하기 위해, 본 연구에서는 현재의 다변량 대기 상태를 기반으로 상태에 종속적인 물리적 제약을 생성하고, 예측 모델에 제어 가능하고 재사용 가능한 방식으로 주입하는 "에이전트 기반 교차 모드 디코딩 (Agent-Guided Cross-modal Decoding, AGCD)"이라는 새로운 패러다임을 제안합니다. 구체적으로, 우리는 상태에 종속적인 물리적 제약을 생성하기 위해 다중 에이전트 기상 해설 파이프라인을 설계하며, 다양한 기상 요소를 효율적으로 추출하기 위해 대규모 언어 모델 (MLLM)을 활용합니다. 또한, AGCD는 생성된 제약을 효과적으로 적용하기 위해, 지역 정보를 고려한 다중 스케일 토큰화와 효율적인 물리적 제약 주입을 수행하여 시각적 특징을 개선하고, 동시에 핵심 인터페이스를 변경하지 않고도 이를 수행합니다. WeatherBench 데이터셋에 대한 실험 결과, AGCD는 5.625도 및 1.40625도 두 가지 해상도와 다양한 모델 구조 (일반 모델 및 기상 특화 모델)에서 6시간 예측 성능을 향상시켰으며, 특히 초기 단계의 오차 누적을 줄이고 장기 예측의 안정성을 개선하는 엄격하게 자기 회귀적인 48시간 예측에서도 긍정적인 결과를 보였습니다.

Original Abstract

Accurate weather forecasting is more than grid-wise regression: it must preserve coherent synoptic structures and physical consistency of meteorological fields, especially under autoregressive rollouts where small one-step errors can amplify into structural bias. Existing physics-priors approaches typically impose global, once-for-all constraints via architectures, regularization, or NWP coupling, offering limited state-adaptive and sample-specific controllability at deployment. To bridge this gap, we propose Agent-Guided Cross-modal Decoding (AGCD), a plug-and-play decoding-time prior-injection paradigm that derives state-conditioned physics-priors from the current multivariate atmosphere and injects them into forecasters in a controllable and reusable way. Specifically, We design a multi-agent meteorological narration pipeline to generate state-conditioned physics-priors, utilizing MLLMs to extract various meteorological elements effectively. To effectively apply the priors, AGCD further introduce cross-modal region interaction decoding that performs region-aware multi-scale tokenization and efficient physics-priors injection to refine visual features without changing the backbone interface. Experiments on WeatherBench demonstrate consistent gains for 6-hour forecasting across two resolutions (5.625 degree and 1.40625 degree) and diverse backbones (generic and weather-specialized), including strictly causal 48-hour autoregressive rollouts that reduce early-stage error accumulation and improve long-horizon stability.

0 Citations

0 Influential

10.5 Altmetric

52.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!