2601.20301v1 Jan 28, 2026 cs.CV

압축을 고려한 샤프니스 최소화를 통한 소형 및 강력한 심층 신경망 개발

Towards Compact and Robust DNNs via Compression-aware Sharpness Minimization

Citations: 2

h-index: 1

Citations: 45

h-index: 2

샤프니스 인식 최소화(SAM)는 최근 심층 신경망의 입력 변화에 대한 강건성을 향상시키는 효과적인 기술로 부상했습니다. 그러나 SAM과 온디바이스 심층 신경망 배포의 소형화 요구 사항 간의 상호 작용은 아직 충분히 연구되지 않았습니다. SAM으로 학습된 모델을 단순히 가지치기하면 강건성이 저하될 수 있습니다. 이는 연속적인 파라미터 공간에서의 평탄성이 가지치기에 의해 발생하는 이산적인 구조 변화 하에서 반드시 강건성을 보장하는 것은 아니기 때문입니다. 반대로, 가지치기 후에 SAM을 적용하는 것은 초기, 강건성과는 무관하게 설계된 가지치기 패턴으로 인해 발생하는 아키텍처적 제약으로 인해 근본적으로 제한될 수 있습니다. 이러한 격차를 해소하기 위해, 저희는 압축을 고려한 샤프니스 인식 최소화(C-SAM)라는 프레임워크를 제안합니다. C-SAM은 파라미터 변화가 아닌 마스크 변화에 대한 샤프니스 인지 학습을 적용합니다. C-SAM은 학습 과정에서 가지치기 마스크를 명시적으로 변화시켜 모델 구조에 대한 더 평탄한 손실 지형을 만들고, 모델의 소형화와 입력 변화에 대한 강건성을 동시에 최적화하는 가지치기 패턴을 발견하도록 합니다. ResNet-18, GoogLeNet, MobileNet-V2를 사용하여 CelebA-HQ, Flowers-102, CIFAR-10-C 데이터셋에 대한 광범위한 실험 결과, C-SAM은 강력한 기준 모델보다 지속적으로 더 높은 수준의 인증된 강건성을 달성하며, 최대 42%의 성능 향상을 보였고, 동시에 가지치기되지 않은 모델과 비교 가능한 수준의 작업 정확도를 유지했습니다.

Original Abstract

Sharpness-Aware Minimization (SAM) has recently emerged as an effective technique for improving DNN robustness to input variations. However, its interplay with the compactness requirements of on-device DNN deployments remains less explored. Simply pruning a SAM-trained model can undermine robustness, since flatness in the continuous parameter space does not necessarily translate to robustness under the discrete structural changes induced by pruning. Conversely, applying SAM after pruning may be fundamentally constrained by architectural limitations imposed by an early, robustness-agnostic pruning pattern. To address this gap, we propose Compression-aware ShArpness Minimization (C-SAM), a framework that shifts sharpness-aware learning from parameter perturbations to mask perturbations. By explicitly perturbing pruning masks during training, C-SAM promotes a flatter loss landscape with respect to model structure, enabling the discovery of pruning patterns that simultaneously optimize model compactness and robustness to input variations. Extensive experiments on CelebA-HQ, Flowers-102, and CIFAR-10-C across ResNet-18, GoogLeNet, and MobileNet-V2 show that C-SAM consistently achieves higher certified robustness than strong baselines, with improvements of up to 42%, while maintaining task accuracy comparable to the corresponding unpruned models.

0 Citations

0 Influential

1 Altmetric

5.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!