Affiliation:
1. Department of Statistical Science Duke University Durham North Carolina USA
2. School of Data Science The Chinese University of Hong Kong Shenzhen China
3. H. Milton Stewart School of Industrial and Systems Engineering (ISyE) Georgia Institute of Technology Atlanta Georgia USA
Abstract
AbstractRobust principal component analysis (RPCA) is a widely used method for recovering low‐rank structure from data matrices corrupted by significant and sparse outliers. These corruptions may arise from occlusions, malicious tampering, or other causes for anomalies, and the joint identification of such corruptions with low‐rank background is critical for process monitoring and diagnosis. However, existing RPCA methods and their extensions largely do not account for the underlying probabilistic distribution for the data matrices, which in many applications are known and can be highly non‐Gaussian. We thus propose a new method called RPCA for exponential family distributions (), which can perform the desired decomposition into low‐rank and sparse matrices when such a distribution falls within the exponential family. We present a novel alternating direction method of multiplier optimization algorithm for efficient decomposition, under either its natural or canonical parametrization. The effectiveness of is then demonstrated in two applications: the first for steel sheet defect detection and the second for crime activity monitoring in the Atlanta metropolitan area.
Funder
U.S. Department of Energy
National Science Foundation of Sri Lanka
Reference43 articles.
1. Robust PCA via Principal Component Pursuit: A review for a comparative evaluation in video surveillance
2. Distributed optimization and statistical learning via the alternating direction method of multipliers;Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein S.;Found. Trends Mach. Learn.,2011
3. Convex Optimization
4. A Singular Value Thresholding Algorithm for Matrix Completion