Noraziah, Ahmad and Sharifah Hafizah, Syed Ahmad Ubaidillah and Noor Azida, Sahabudin (2023) A survey on potential reactive fault tolerance approach for distributed systems in big data. In: 2022 3rd International Conference on Computer Vision and Information Technology (CVIT 2022) , 19 - 21 August 2022 , Beijing, China. pp. 1-8., 12590 (1259009).
Pdf
SPIE - ID-AU1001.pdf Restricted to Repository staff only Download (212kB) | Request a copy |
||
|
Pdf
A survey on potential reactive fault tolerance approach.pdf Download (137kB) | Preview |
Abstract
Due to their unique properties such as high availability and reliability, distributed systems are gaining popularity nowadays. However, the rapid growth of Big Data in distributed systems creates new issues for dataset reliability and availability. In any distributed computer system, the presence and recurrence of failures is an inescapable factor. Both hardware and software components of distributed systems are prone to failure. As a result, the issue of fault tolerance is being recognized as the fundamental theme and essential requirement for the construction and maintenance of the distributed computing paradigm in order to achieve prominence and criticality. Fault tolerance refers to the application that must be executed even in failure conditions by detecting and correcting the fault. Reactive fault tolerance techniques are used to effectively troubleshoot the systems upon occurrences of failures. This paper aims to provide a better understanding of reactive fault tolerance techniques and identifies various approaches used as reactive fault tolerance in distributed systems. Based on the reviews done in this research, there are various reactive fault tolerance techniques that can improve the performance of the distributed systems in terms of availability, reliability, total execution time, and communication cost such as replication, checkpointing task resubmission, and job migration.
Item Type: | Conference or Workshop Item (Lecture) |
---|---|
Uncontrolled Keywords: | Reliability; Distributed system; Reactive fault tolerance; Computational intelligence |
Subjects: | Q Science > QA Mathematics > QA76 Computer software |
Faculty/Division: | Faculty of Computer System And Software Engineering Institute of Postgraduate Studies |
Depositing User: | PM Dr. Noraziah Ahmad |
Date Deposited: | 24 Aug 2023 03:23 |
Last Modified: | 24 Aug 2023 03:27 |
URI: | http://umpir.ump.edu.my/id/eprint/35156 |
Download Statistic: | View Download Statistics |
Actions (login required)
View Item |