2603.00955v1 Mar 01, 2026 stat.ME

가짜 발견률(False Discovery Rate)을 넘어: 그룹화된 변수 선택을 위한 단계별 그룹 SLOPE 접근 방식

Beyond False Discovery Rate: A Stepdown Group SLOPE Approach for Grouped Variable Selection

Xuelin Zhang

Citations: 24

h-index: 3

Jingxuan Liang

Citations: 36

h-index: 2

Xinyue Liu

Citations: 0

h-index: 0

Hong Chen

Citations: 6

h-index: 2

Biqin Song

Citations: 164

h-index: 3

고차원 특징 선택은 통계적 검정력을 유지하면서 k-가족 전체 오류율(k-FWER) 및 가짜 발견 비율(FDP)과 같은 다중 오류 지표를 엄격하게 제어해야 하는 경우가 많습니다. 하지만 기존의 일부 방법은 기대되는 가짜 발견율(FDR)을 제어하는 데만 국한되어 있으며, 정렬된 L-1 페널티 추정(SLOPE)과 같이 변수들의 그룹 구조를 활용하지 못합니다. 본 연구에서는 Lehmann-Romano의 단계별 규칙을 SLOPE에 통합하여 k-FWER 및 FDP 임계값 하에서 유한 표본 보장을 달성할 수 있는 통합 최적화 절차인 Group Stepdown SLOPE를 소개합니다. 특히, 직교 설계 하에서 사용자가 지정한 수준으로 k-FWER 및 FDP를 증명적으로 제한하는 폐쇄형 정규화 시퀀스를 도출하고, 이러한 결과를 gk-SLOPE 및 gF-SLOPE를 통해 그룹 설정으로 확장하여, 각각 그룹 수준 오류인 gk-FWER 및 gFDP를 제어합니다. 비직교 일반 설계의 경우, 가우시안 근사와 몬테카를로 보정을 기반으로 하는 데이터 기반 시퀀스를 제공하여, 볼록성을 유지하고 확장성을 확보합니다. 희소, 상관 관계, 그룹 구조를 갖는 다양한 환경에서 광범위한 시뮬레이션을 수행했습니다. 실험 결과는 제안된 방법이 명목적인 오류 제어를 달성하며, 다른 단계별 절차보다 훨씬 높은 검정력을 제공한다는 이론적 결과를 뒷받침하며, 이러한 이론적 발전의 실질적인 가치를 확인시켜줍니다.

Original Abstract

High-dimensional feature selection is routinely required to balance statistical power with strict control of multiple-error metrics such as the k-Family-Wise Error Rate (k-FWER) and the False Discovery Proportion (FDP), yet some existing frameworks are confined to the narrower goal of controlling the expected False Discovery Rate (FDR) and can not exploit the group-structure of the covariates, such as Sorted L-One Penalized Estimation (SLOPE). We introduce the Group Stepdown SLOPE, a unified optimization procedure which is capable of embedding the Lehmann-Romano stepdown rules into SLOPE to achieve finite-sample guarantees under k-FWER and FDP thresholds. Specifically, we derive closed-form regularization sequences under orthogonal designs that provably bound k-FWER and FDP at user-specified levels, and extend these results to grouped settings via gk-SLOPE and gF-SLOPE, which control the analogous group-level errors gk-FWER and gFDP. For non-orthogonal general designs, we provide a calibrated data-driven sequence inspired by Gaussian approximation and Monte-Carlo correction, preserving convexity and scalability. Extensive simulations are conducted across sparse, correlated, and group-structured regimes. Empirical results corroborate our theoretical findings that the proposed methods achieve nominal error control, while yielding markedly higher power than competing stepdown procedures, thereby confirming the practical value of the theoretical advances.

0 Citations

0 Influential

1.5 Altmetric

7.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!