2606.12018v1 Jun 10, 2026 cs.AI

MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning

Jisheng Dang
Jisheng Dang
Citations: 119
h-index: 4
Bimei Wang
Bimei Wang
Citations: 27
h-index: 3
Wencan Zhang
Wencan Zhang
Citations: 305
h-index: 7
Hong Peng
Hong Peng
Citations: 84
h-index: 5
Shangyuan Ma
Shangyuan Ma
Citations: 0
h-index: 0
Binbin Hu
Binbin Hu
Citations: 346
h-index: 9
Qi Tian
Qi Tian
Citations: 47
h-index: 3
Tat-Seng Chua
Tat-Seng Chua
Citations: 23
h-index: 2
Yifan Zhang
Yifan Zhang
Citations: 116
h-index: 5

We propose a multi-agent collaborative framework built upon a lightweight Multimodal Large Language Model (MLLM), specifically designed for social intelligence reasoning. A key feature of our approach is that both the training and inference phases are augmented via knowledge distillation. Within this architecture, multi-modal data pertinent to social intelligence is precisely localized. Furthermore, relevant long-tail events are identified, extracted, and rendered as formatted, explicit text. This formatting strategy prevents critical long-tail information from being overshadowed by head events and environmental noise during the tokenization process. Specifically, we integrate Test-Time Adaptation (TTA) across the entire reasoning pipeline, encompassing the extraction and representation of long-tail events, Chain-of-Thought (CoT) prompting, and self-reflection. This TTA mechanism is also distillation-enhanced, utilizing Low-Rank Adaptation (LoRA) to fine-tune the foundation model exclusively for instance-level reasoning. Extensive evaluations against various open-source and proprietary AI models across multiple benchmarks demonstrate the effectiveness of the proposed framework. With around 30% of training data from IntentTrain, we achieve state-of-the-art results. Codes are available at https://github.com/eeee-sys/MODF-SIR, demo is available at https://huggingface.co/spaces/Harry-1234/MODF-SIR, LoRA is available at https://huggingface.co/Harry-1234/MODF-SIR and the dataset for training router is available at https://huggingface.co/datasets/Harry-1234/IntentRouterTrain.

0 Citations
0 Influential
24.5 Altmetric
122.5 Score
Original PDF
0

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!