A new single linkage robust clustering outlier detection procedures for multivariate data

Sharifah Sakinah, Syed Abd Mutalib and Siti Zanariah, Satari and Wan Nur Syahidah, Wan Yusoff (2023) A new single linkage robust clustering outlier detection procedures for multivariate data. Sains Malaysiana, 52 (8). pp. 2431-2451. ISSN 0126-6039. (Published)

[img] Pdf
A new single linkage robust clustering outlier detection.pdf
Restricted to Repository staff only

Download (1MB) | Request a copy
[img]
Preview
Pdf
A new single linkage robust clustering outlier detection procedures for multivariate data_ABS.pdf

Download (156kB) | Preview

Abstract

Outliers are abnormal data, and the detection of outliers in multivariate data has always been of interest. Unlike univariate data, outlier detection for multivariate data is insufficient with a visual inspection. In this study, we developed a new single linkage robust clustering outlier detection procedure for multivariate data. A robust estimator, Test on Covariance (TOC) is used to robustified the similarity distance measure, producing robust single linkage clustering. The performance of the new single linkage robust clustering outlier detection procedure is investigated via a simulation study using three outlier scenarios and historical multivariate datasets as illustrative examples. Three performance measures are used, which are pout, pmask, and pswamp. The performance of the new single linkage robust clustering procedure also compared with single linkage clustering using Euclidean and Mahalanobis distances as similarity distance measures as well as TOC. It is found that the new single linkage robust clustering procedure performs well in Outlier Scenario 3 when the mean and covariance matrix are shifted. The new procedure also performs well by successfully detecting all outliers, does not have masking effects in two out of five datasets and does not have swamping effect in all datasets. In conclusion, the new single linkage robust clustering outlier detection procedure is a practical and promising approach and good for simultaneously identifying multiple outliers in multivariate data.

Item Type: Article
Additional Information: Indexed by Scopus
Uncontrolled Keywords: Multivariate data; Outliers; Robust clustering; Single linkage clustering; Test on Covariance
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics
Faculty/Division: Institute of Postgraduate Studies
Center for Mathematical Science
Depositing User: Mr Muhamad Firdaus Janih@Jaini
Date Deposited: 28 May 2024 08:04
Last Modified: 28 May 2024 08:04
URI: http://umpir.ump.edu.my/id/eprint/40889
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item