RobOMP: Robust variants of Orthogonal Matching Pursuit for sparse representations

Carlos A Loza

doi:10.7287/peerj.preprints.27482v1

RobOMP: Robust variants of Orthogonal Matching Pursuit for sparse representations

Carlos A Loza

Department of Mathematics, Universidad San Francisco de Quito, Quito, Ecuador

DOI: 10.7287/peerj.preprints.27482v1

Published: 2019-01-14
Accepted: 2019-01-14

Subject Areas: Computer Vision, Data Mining and Machine Learning
Keywords: M-Estimation, Matching Pursuit, Representation-based classifier, Robust classification, Sparse representation, Outliers

Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Preprints) and either DOI or URL of the article must be cited.

Cite this article: Loza CA. 2019. RobOMP: Robust variants of Orthogonal Matching Pursuit for sparse representations. PeerJ Preprints 7:e27482v1 https://doi.org/10.7287/peerj.preprints.27482v1

Abstract

Sparse coding aims to find a parsimonious representation of an example given an observation matrix or dictionary. In this regard, Orthogonal Matching Pursuit (OMP) provides an intuitive, simple and fast approximation of the optimal solution. However, its main building block is anchored on the minimization of the Mean Squared Error cost function (MSE). This approach is only optimal if the errors are distributed according to a Gaussian distribution without samples that strongly deviate from the main mode, i.e. outliers. If such assumption is violated, the sparse code will likely be biased and performance will degrade accordingly. In this paper, we introduce five robust variants of OMP (RobOMP) fully based on the theory of M-Estimators under a linear model. The proposed framework exploits efficient Iteratively Reweighted Least Squares (IRLS) techniques to mitigate the effect of outliers and emphasize the samples corresponding to the main mode of the data. This is done adaptively via a learned weight vector that models the distribution of the data in a robust manner. Experiments on synthetic data under several noise distributions and image recognition under different combinations of occlusion and missing pixels thoroughly detail the superiority of RobOMP over MSE-based approaches and similar robust alternatives. We also introduce a denoising framework based on robust, sparse and redundant representations that open the door to potential further applications of the proposed techniques. The five different variants of RobOMP do not require parameter tuning from the user and, hence, constitute principled alternatives to OMP.

Author Comment

This is a submission to PeerJ Computer Science for review.