← Back to Event List

Stat Colloquium [Virtual]: Dr. Zhao Ren

University of Pittsburgh

Location

Online

Date & Time

April 4, 2025, 11:00 am12:00 pm

Description

Title: Sparse Heteroskedastic PCA in High Dimensions 

Abstract: Principal component analysis (PCA) is one of the most commonly used techniques for dimension reduction and feature extraction. Though it has been well-studied for high- dimensional sparse PCA, little is known when the noise is heteroskedastic, which turns out to be ubiquitous in many scenarios. We propose an iterative algorithm, called SparseHPCA, for the sparse PCA problem in the presence of heteroskedastic noise, which alternatively updates the estimates of the sparse eigenvectors using orthogonal iteration with adaptive thresholdings in one step, and imputes the diagonal values of the sample covariance matrix to reduce the estimation bias due to heteroskedastic noise in the other step. Our procedure is computationally fast and provably optimal under the generalized spiked covariance model, assuming the leading eigenvectors are sparse. A comprehensive simulation study shows its robustness and effectiveness under various settings. The application of our new method to two high-dimensional genomics datasets, i.e., microarray and single-cell RNA sequencing (scRNA-seq) data, demonstrates its ability to preserve inherent cluster structures in downstream analyses. Additionally, we extend SparseHPCA to address the sparse singular value decomposition (sparse SVD) problem in the presence of heteroskedastic noise, further showcasing its versatility. 

Short bio: Dr. Zhao Ren is an Associate Professor in the Department of Statistics at the University of Pittsburgh. Dr. Ren obtained his Ph.D. in Statistics at Yale University in 2014. His research interests are in high-dimensional statistical inference, robust inference, graphical models, nonparametric function estimation, and applications in statistical genomics.