UMP Institutional Repository

Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm

Arif, Hanafi and Sulaiman, Harun and Enggari, Sofika and Rani, Larissa Navia (2016) Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm. Journal of Computer Science and Information Technology, 1 (1). pp. 71-81.

[img] PDF
Detecting Duplicate Entry in Email Field using Alliance Rules-based Algorithm.pdf
Restricted to Repository staff only

Download (383kB) | Request a copy
[img]
Preview
PDF
fskkp-2016-arif-Detecting Duplicate Entry in Email Field1.pdf

Download (40kB) | Preview

Abstract

The way that email has extraordinary significance in present day business communication is certain. Consistently, a bulk of emails is sent from organizations to clients and suppliers, from representatives to their managers and starting with one colleague then onto the next. In this way there is vast of email in data warehouse. Data cleaning is an activity performed on the data sets of data warehouse to upgrade and keep up the quality and consistency of the data. This paper underlines the issues related with dirty data, detection of duplicatein email column. The paper identifies the strategy of data cleaning from adifferent point of view. It provides an algorithm to the discovery of error and duplicates entries in the data sets of existing data warehouse. The paper characterizes the alliance rules based on the concept of mathematical association rules to determine the duplicate entries in email column in data sets.

Item Type: Article
Uncontrolled Keywords: Datacleaning, Algorithm, Alliance rule, Duplication.
Subjects: Q Science > QA Mathematics > QA76 Computer software
Faculty/Division: Faculty of Computer System And Software Engineering
Depositing User: Mrs. Neng Sury Sulaiman
Date Deposited: 11 Nov 2015 01:23
Last Modified: 22 Jul 2016 02:55
URI: http://umpir.ump.edu.my/id/eprint/11101
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item