2602.07044v1 Feb 04, 2026 cs.CV

PipeMFL-240K: 파이프라인 자기장 누설 이미징에서의 객체 탐지를 위한 대규모 데이터셋 및 벤치마크

PipeMFL-240K: A Large-scale Dataset and Benchmark for Object Detection in Pipeline Magnetic Flux Leakage Imaging

Tianyi Qu

Citations: 8

h-index: 1

Songxiao Yang

Citations: 21

h-index: 3

Haolin Wang

Citations: 31

h-index: 4

Huadong Song

Citations: 357

h-index: 7

Xiaoting Guo

Citations: 49

h-index: 4

Wenguang Hu

Citations: 12

h-index: 2

Guan-Hong Liu

Citations: 19

h-index: 2

Honghe Chen

Citations: 13

h-index: 2

Yafei Ou

Citations: 145

h-index: 7

파이프라인의 안전성은 산업 안전 및 환경 보호에 매우 중요하며, 자기장 누설(MFL) 탐지는 주요 비파괴 검사 기술입니다. 딥러닝을 통해 MFL 해석을 자동화할 수 있다는 잠재력이 있지만, 신뢰할 수 있는 모델 개발은 대규모 공개 데이터셋 및 벤치마크의 부족으로 인해 어려움을 겪고 있으며, 이는 공정한 비교 및 재현 가능한 평가를 어렵게 만듭니다. 본 연구에서는 복잡한 객체 탐지를 위한 대규모의 정교하게 주석이 달린 데이터셋 및 벤치마크인 **PipeMFL-240K**를 소개합니다. PipeMFL-240K는 실제 검사 복잡성을 반영하며, 다음과 같은 독특한 과제를 제시합니다. (i) **12**개의 범주에 대한 극단적인 불균형 분포, (ii) 종종 몇 개의 픽셀로 구성된 매우 작은 객체의 높은 비율, (iii) 상당한 클래스 내 변동성. 이 데이터셋은 약 **1,480**km에 이르는 11개의 파이프라인에서 수집된 **240,320**장의 이미지와 **191,530**개의 고품질 바운딩 박스 주석을 포함합니다. 최첨단 객체 탐지기를 사용하여 광범위한 실험을 수행하여 기준 성능을 확립했습니다. 실험 결과는 최신 탐지기가 여전히 MFL 데이터의 고유한 특성으로 인해 어려움을 겪고 있으며, 개선될 여지가 많다는 것을 보여줍니다. PipeMFL-240K는 향후 연구를 위한 신뢰할 수 있고 도전적인 테스트 환경을 제공합니다. 파이프라인 MFL 검사에 대한 최초의 공개 데이터셋이자 이 규모와 범위를 가진 최초의 벤치마크로서, 효율적인 파이프라인 진단 및 유지 보수 계획 수립을 위한 중요한 기반을 제공하며, MFL 기반 파이프라인 안전성 평가 분야의 알고리즘 혁신 및 재현 가능한 연구를 가속화할 것으로 예상됩니다.

Original Abstract

Pipeline integrity is critical to industrial safety and environmental protection, with Magnetic Flux Leakage (MFL) detection being a primary non-destructive testing technology. Despite the promise of deep learning for automating MFL interpretation, progress toward reliable models has been constrained by the absence of a large-scale public dataset and benchmark, making fair comparison and reproducible evaluation difficult. We introduce \textbf{PipeMFL-240K}, a large-scale, meticulously annotated dataset and benchmark for complex object detection in pipeline MFL pseudo-color images. PipeMFL-240K reflects real-world inspection complexity and poses several unique challenges: (i) an extremely long-tailed distribution over \textbf{12} categories, (ii) a high prevalence of tiny objects that often comprise only a handful of pixels, and (iii) substantial intra-class variability. The dataset contains \textbf{240,320} images and \textbf{191,530} high-quality bounding-box annotations, collected from 11 pipelines spanning approximately \textbf{1,480} km. Extensive experiments are conducted with state-of-the-art object detectors to establish baselines. Results show that modern detectors still struggle with the intrinsic properties of MFL data, highlighting considerable headroom for improvement, while PipeMFL-240K provides a reliable and challenging testbed to drive future research. As the first public dataset and the first benchmark of this scale and scope for pipeline MFL inspection, it provides a critical foundation for efficient pipeline diagnostics as well as maintenance planning and is expected to accelerate algorithmic innovation and reproducible research in MFL-based pipeline integrity assessment.

0 Citations

0 Influential

3.5 Altmetric

17.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!