2601.13546v1 Jan 20, 2026 cs.AI

ChatAD: 추론 능력 강화된 시계열 이상 탐지 모델 - 다중 대화 기반 지시 진화

ChatAD: Reasoning-Enhanced Time-Series Anomaly Detection with Multi-Turn Instruction Evolution

Chang Xu

Citations: 44

h-index: 3

Chuheng Zhang

Citations: 58

h-index: 3

Ming Jin

Citations: 20

h-index: 3

Xiaoguang Liu

Citations: 52

h-index: 5

Gang Wang

Citations: 0

h-index: 0

Hui Sun

Citations: 50

h-index: 5

Haonan Xie

Citations: 25

h-index: 3

Hao Li

Citations: 29

h-index: 2

Yuhao Huang

Citations: 0

h-index: 0

Jiang Bian

Citations: 103

h-index: 7

LLM(대규모 언어 모델) 기반 이상 탐지(AD)는 시계열(TS) 데이터에서 발생하는 이상 현상에 대한 이해와 설명 능력을 향상시키는 데 도움을 줍니다. 기존 방법들은 추론 능력 부족, 미흡한 다중 대화 능력, 그리고 제한적인 일반화 능력을 가진다는 문제점을 안고 있습니다. 이러한 문제점을 해결하기 위해, 본 연구에서는 1) 다중 에이전트 기반의 시계열 진화 알고리즘인 TSEvol을 제안합니다. 2) AD 추론 및 다중 대화 데이터셋인 TSEData-20K를 소개하고, ChatAD-Llama3-8B, Qwen2.5-7B, Mistral-7B를 포함하는 Chatbot 모델 패밀리를 개발합니다. 3) ChatAD의 다양한 작업에서의 일반화 능력을 향상시키기 위해 TS Kahneman-Tversky Optimization (TKTO) 방법을 제안합니다. 마지막으로, 4) ChatAD 및 9개의 기준 모델의 성능을 평가하기 위한 LLM 기반 학습 기반 이상 탐지 벤치마크인 LLADBench를 제안합니다. 저희의 세 가지 ChatAD 모델은 정확도에서 최대 34.50%, F1 점수에서 34.71%, 그리고 오탐 감소에서 37.42%의 상당한 성능 향상을 보였습니다. 또한, TKTO를 통해 최적화된 ChatAD 모델은 분류, 예측, 그리고 결측치 보완 작업에서 뛰어난 추론 능력과 작업 간 일반화 능력을 보여줍니다.

Original Abstract

LLM-driven Anomaly Detection (AD) helps enhance the understanding and explanatory abilities of anomalous behaviors in Time Series (TS). Existing methods face challenges of inadequate reasoning ability, deficient multi-turn dialogue capability, and narrow generalization. To this end, we 1) propose a multi-agent-based TS Evolution algorithm named TSEvol. On top of it, we 2) introduce the AD reasoning and multi-turn dialogue Dataset TSEData-20K and contribute the Chatbot family for AD, including ChatAD-Llama3-8B, Qwen2.5-7B, and Mistral-7B. Furthermore, 3) we propose the TS Kahneman-Tversky Optimization (TKTO) to enhance ChatAD's cross-task generalization capability. Lastly, 4) we propose a LLM-driven Learning-based AD Benchmark LLADBench to evaluate the performance of ChatAD and nine baselines across seven datasets and tasks. Our three ChatAD models achieve substantial gains, up to 34.50% in accuracy, 34.71% in F1, and a 37.42% reduction in false positives. Besides, via KTKO, our optimized ChatAD achieves competitive performance in reasoning and cross-task generalization on classification, forecasting, and imputation.

0 Citations

0 Influential

3.5 Altmetric

17.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!