Large Covariance Matrix Estimation With Oracle Statistical Rate via Majorization-Minimization

The \boldsymbol{\ell}_{\mathbf{1}} penalized covariance estimator has been widely used for estimating large sparse covariance matrices. It is recognized that \boldsymbol{\ell}_{\mathbf{1}} penalty introduces a non-negligible estimation bias, while a proper utilization of non-convex penalty may lead...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on signal processing 2023, Vol.71, p.3328-3342
Hauptverfasser: Wei, Quan, Zhao, Ziping
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The \boldsymbol{\ell}_{\mathbf{1}} penalized covariance estimator has been widely used for estimating large sparse covariance matrices. It is recognized that \boldsymbol{\ell}_{\mathbf{1}} penalty introduces a non-negligible estimation bias, while a proper utilization of non-convex penalty may lead to an estimator with a refined statistical rate of convergence. To eliminate the estimation bias, in this paper we propose to estimate large sparse covariance matrices using the non-convex penalty. It is challenging to analyze the theoretical properties of the resulting estimator because popular iterative algorithms for convex optimization no longer have global convergence guarantees for non-convex optimization. To tackle this issue, an efficient algorithm based on the majorization-minimization (MM) framework is developed by solving a sequence of convex relaxation subproblems. An approximation solution to each subproblem is obtained via the proximal gradient method with a linear convergence rate. We clearly establish the statistical properties of all the approximate solutions generated by the MM-based algorithm and prove that the proposed estimator achieves the oracle statistical rate in the Frobenius norm under weak technical assumptions. We also consider a modification of the proposed estimation method using the correlation matrix and show that the modified correlation-based covariance estimator enjoys a better rate in the spectral norm. Our theoretical findings are corroborated through extensive numerical experiments on both synthetic data and real-world datasets.
ISSN:1053-587X
1941-0476
DOI:10.1109/TSP.2023.3311523