A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language

Nur Syahirah, Zulkipli and Siti Zanariah, Satari and Wan Nur Syahidah, Wan Yusoff (2021) A synthetic data generation procedure for univariate circular data with various outliers scenarios using Python programming language. In: 28th Simposium Kebangsaan Sains Matematik, SKSM 2021 , 28-29 July 2021 , Kuantan, Pahang, Virtual. pp. 1-10., 1988 (012111). ISSN 1742-6588

[img]
Preview
Pdf
A synthetic data generation procedure for univariate circular data with various outliers scenarios.pdf
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

Synthetic data is artificial data that is created based on the statistical properties of the original data. The aim of this study is to generate a synthetic or simulated data for univariate circular data that follow von Mises (VM) distribution with various outliers scenario using Python programming language. The procedure of formulation a synthetic data generation is proposed in this study. The synthetic data is generated from various combinations of seven sample size, n and five concentration parameters, K. Moreover, a synthetic data will be generated by formulating a data generation procedure with different condition of outliers scenarios. Three outliers scenarios are proposed in this study to introduce the outliers in synthetic dataset by placing them away from inliers at a specific distance. The number of outliers planted in the dataset are fixed with three outliers. The synthetic data is randomly generated by using Python library and package which are 'numpy', 'random' and von Mises'. In conclusion, the synthetic data of univariate circular data from von Mises distribution is generated and the outliers are successfully introduced in the dataset with three outliers scenarios using Python. This study will be valuable for those who are interested to study univariate circular data with outliers and choose Python as an analysis tool.

Item Type: Conference or Workshop Item (Lecture)
Additional Information: Indexed by Scopus
Uncontrolled Keywords: Analysis tools; Artificial data; Data generation; Python programming language; Statistical properties; Synthetic data; Synthetic data generations; Von Mises distribution
Subjects: Q Science > Q Science (General)
Q Science > QA Mathematics
Faculty/Division: Institute of Postgraduate Studies
Center for Mathematical Science
Depositing User: Mr Muhamad Firdaus Janih@Jaini
Date Deposited: 17 Oct 2022 05:11
Last Modified: 17 Oct 2022 05:11
URI: http://umpir.ump.edu.my/id/eprint/35201
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item