2605.01888v1 May 03, 2026 cs.CV

AFFormer: 채널 왜곡 환경에서의 V2X 협력 인식을 위한 적응형 특징 융합 트랜스포머

AFFormer: Adaptive Feature Fusion Transformer for V2X Cooperative Perception under Channel Impairments

Xi Zhou

Citations: 128

h-index: 3

Tao Huang

Citations: 93

h-index: 2

Qing-Long Han

Citations: 133

h-index: 5

Rana Abbas

Citations: 0

h-index: 0

M. Azghadi

Citations: 4,237

h-index: 33

정확한 3차원 객체 인식은 자율 주행 차량의 안전을 보장하는 데 필수적입니다. 차량 대 모든 것(V2X) 통신을 활용하여 인식 데이터를 공유하는 협력 인지는 인식 성능을 향상시키지만, 노이즈, 페이딩, 간섭과 같은 채널 왜곡에 취약합니다. 본 연구는 일반적인 채널 왜곡 환경을 반영하는 통신 조건에서 V2X 협력 인식의 안정성을 향상시키는 것을 목표로 합니다. 본 논문에서는 시간, 에이전트 간, 공간 상관관계를 모델링하여 손상된 특징의 부정적인 영향을 완화하는 트랜스포머 기반 프레임워크인 Adaptive Feature Fusion Transformer (AFFormer)를 제안합니다. AFFormer는 세 가지 핵심 모듈을 도입합니다. 첫째, 에이전트와 시간 경계를 넘어 맥락 인식 융합을 위한 Multi-Agent and Temporal Aggregation, 둘째, 공간 의존성을 효율적으로 모델링하기 위한 Dual Spatial Attention, 셋째, 엔트로피 기반으로 융합된 특징을 개선하기 위한 Uncertainty-Guided Fusion입니다. 또한, 교사-학생 지식 증류 전략을 통해 융합된 특징을 신뢰할 수 있는 초기 협력 감독과 일치시켜 안정성을 더욱 향상시킵니다. AFFormer는 V2XSet 및 DAIR-V2X 데이터 세트에서 검증되었으며, 이상적인 조건과 왜곡된 통신 조건 모두에서 기존 방법보다 우수한 성능을 보였습니다. 이는 통신으로 인한 특징 손실에 대한 향상된 안정성을 입증하는 동시에 경쟁력 있는 효율성-정확성 균형을 유지합니다.

Original Abstract

Accurate 3D object detection is essential for ensuring the safety of autonomous vehicles. Cooperative perception, which leverages vehicle-to-everything (V2X) communication to share perceptual data, enhances detection but is vulnerable to channel impairments, such as noise, fading, and interference. To strengthen the reliability of intelligent transportation systems, this work improves the robustness of V2X cooperative perception under communication conditions that reflect common channel impairments. This paper proposes an Adaptive Feature Fusion Transformer (AFFormer), a Transformer-based framework that mitigates the adverse effects of corrupted features by modeling temporal, inter-agent, and spatial correlations. AFFormer introduces three key modules: Multi-Agent and Temporal Aggregation for context-aware fusion across agents and over time, Dual Spatial Attention for efficient modeling of spatial dependencies, and Uncertainty-Guided Fusion for entropy-driven refinement of fused features. A teacher-student knowledge distillation strategy further enhances robustness by aligning fused features with reliable early-collaboration supervision. AFFormer is validated on the V2XSet and DAIR-V2X datasets, where it consistently outperforms existing methods under both ideal and impaired communication conditions, demonstrating improved robustness to communication-induced feature degradation while maintaining a competitive efficiency-accuracy trade-off.

1 Citations

1 Influential

16.5 Altmetric

85.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!