Large Covariance Matrix Estimation With Oracle Statistical Rate via Majorization-Minimization

The \boldsymbol{\ell}_{\mathbf{1}} penalized covariance estimator has been widely used for estimating large sparse covariance matrices. It is recognized that \boldsymbol{\ell}_{\mathbf{1}} penalty introduces a non-negligible estimation bias, while a proper utilization of non-convex penalty may lead...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on signal processing 2023, Vol.71, p.3328-3342
Hauptverfasser:	Wei, Quan, Zhao, Ziping
Format:	Artikel
Sprache:	eng
Schlagworte:	Bias Convergence Convex analysis Convexity Correlation Correlation analysis Covariance estimation Covariance matrices Covariance matrix Estimation Iterative algorithms Iterative methods majorization-minimization non-convex statistical optimization Optimization positive definiteness Signal processing algorithms Sparse matrices sparsity Symmetric matrices Synthetic data
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The \boldsymbol{\ell}_{\mathbf{1}} penalized covariance estimator has been widely used for estimating large sparse covariance matrices. It is recognized that \boldsymbol{\ell}_{\mathbf{1}} penalty introduces a non-negligible estimation bias, while a proper utilization of non-convex penalty may lead to an estimator with a refined statistical rate of convergence. To eliminate the estimation bias, in this paper we propose to estimate large sparse covariance matrices using the non-convex penalty. It is challenging to analyze the theoretical properties of the resulting estimator because popular iterative algorithms for convex optimization no longer have global convergence guarantees for non-convex optimization. To tackle this issue, an efficient algorithm based on the majorization-minimization (MM) framework is developed by solving a sequence of convex relaxation subproblems. An approximation solution to each subproblem is obtained via the proximal gradient method with a linear convergence rate. We clearly establish the statistical properties of all the approximate solutions generated by the MM-based algorithm and prove that the proposed estimator achieves the oracle statistical rate in the Frobenius norm under weak technical assumptions. We also consider a modification of the proposed estimation method using the correlation matrix and show that the modified correlation-based covariance estimator enjoys a better rate in the spectral norm. Our theoretical findings are corroborated through extensive numerical experiments on both synthetic data and real-world datasets.
ISSN:	1053-587X 1941-0476
DOI:	10.1109/TSP.2023.3311523