Comparison of Robust Estimators’ Performance for Detecting Outliers in Multivariate Data

Sharifah Sakinah, Syed Abd Mutalib and Siti Zanariah, Satari and Wan Nur Syahidah, Wan Yusoff (2021) Comparison of Robust Estimators’ Performance for Detecting Outliers in Multivariate Data. Journal of Statistical Modeling and Analytics, 3 (3). pp. 36-64. ISSN 2180-3102. (Published)

[img]
Preview
Pdf
Comparison of Robust Estimators.pdf

Download (1MB) | Preview

Abstract

In multivariate data, outliers are difficult to detect especially when the dimension of the data increase. Mahalanobis distance (MD) has been one of the classical methods to detect outliers for multivariate data. However, the classical mean and covariance matrix in MD suffered from masking and swamping effects if the data contain outliers. Due to this problem, many studies used a robust estimator instead of the classical estimator of mean and covariance matrix. In this study, the performance of five robust estimators namely Fast Minimum Covariance Determinant (FMCD), Minimum Vector Variance (MVV), Covariance Matrix Equality (CME), Index Set Equality (ISE),and Test on Covariance (TOC) are investigated and compared. FMCD has been widely used and is known as among the best robust estimator. However, there are certain conditions that FMCD still lacks. MVV, CME, ISE and TOC are innovative of FMCD. These four robust estimators improve the last step of the FMCD algorithm. Hence, the objective of this study is to observe the performance of these five estimator to detect outliers in multivariate data particularly TOC as TOC is the latest robust estimator. Simulation studies are conducted for two outlier scenarios with various conditions. There are three performance measures, which are pout, pmask and pswamp used to measure the performance of the robust estimators. It is found that the TOC gives better performance in pswamp for most conditions. TOC gives better results for pout and pmask for certain conditions.

Item Type: Article
Uncontrolled Keywords: Mahalanobis distance, Multivariate data, Outliers, Robust Estimators, Test on Covariance
Subjects: Q Science > QA Mathematics
Faculty/Division: Center for Mathematical Science
Institute of Postgraduate Studies
Depositing User: Ms. Siti Zanariah Satari
Date Deposited: 28 Oct 2021 08:21
Last Modified: 28 Oct 2021 08:21
URI: http://umpir.ump.edu.my/id/eprint/32427
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item