A systematic review of recurrent neural network adoption in missing data imputation

Nur Aqilah, Fadzil Akbar and Mohd Izham, Mohd Jaya and Mohd Faizal, Ab Razak and Nurul Aqilah, Zamri (2025) A systematic review of recurrent neural network adoption in missing data imputation. International Journal of Computing and Digital Systems, 17 (1571041166). pp. 1-17. ISSN 2210-142X. (Published)

[img]
Preview
Pdf
1571041166.pdf - Published Version

Download (247kB) | Preview

Abstract

Missing data is a pervasive challenge in diverse datasets accross various domains. It is often resulting from human error, system faults, and respondent non-response. Failing to address missing data can lead to inaccurate results during data analysis, as incomplete data sequences introduce biases and compromise the distribution of the synthesized data, and cause a negative impact on the decision-making process. Over the past decade, deep learning methods, particularly Recurrent Neural Network (RNN), have been employed to tackle the problem. This study aims to comprehensively evaluate recent RNN methods for missing data imputation, focusing on their strengths and weaknesses to provide a detailed understanding of the current landscape. A systematic literature review was conducted on RNN-based data imputation methods, covering research articles from 2013 to 2023 that were identified in the SCOPUS database. Out of 362 relevant studies, 70 were selected as primary articles. The findings highlight that Long Short-Term Memory (LSTM) is the most adopted RNN method for data imputation due to its adaptability in processing data of varying lengths as compared to Gated Recurrent Units (GRU) and other hybrid methods. Performance metrics such as Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), Area Under the Receiver Operating Characteristic Curve (AU-ROC), Mean Squared Error (MSE), and Mean Relative Error (MRE) are commonly used to evaluate these models. Future development of a more robust RNN-based imputation methods that integrate optimization algorithms, such as Particle Swarm Optimization (PSO) and Stochastic Gradient Descent (SGD) will further enhance the imputation accuracy and reliability.

Item Type: Article
Uncontrolled Keywords: Systematic literature review; Missing values; Data imputation; Recurrent Neural Network (RNN); Data quality
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Faculty/Division: Faculty of Computing
Institute of Postgraduate Studies
Depositing User: TS. DR. MOHD IZHAM MOHD JAYA
Date Deposited: 26 Feb 2025 07:06
Last Modified: 26 Feb 2025 07:06
URI: http://umpir.ump.edu.my/id/eprint/43919
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item