Data Pre-processing Techniques in Data Mining: A Review

Authors

  • Pankaj Saraswat Assistant Professor, Department of Computer Science Engineering, Sanskriti University, Mathura, Uttar Pradesh Author
  • Swapnil Raj Assistant Professor, Department of Computer Science Engineering, Sanskriti University, Mathura, Uttar Pradesh Author

DOI:

https://doi.org/10.55524/

Keywords:

Data mining, Data preprocessing, Dataset pattern, Dataset, KDD, Knowledge Discovery

Abstract

Data mining is the process of finding  interesting patterns and models from massive datasets. In  the field of natural and physical sciences, data collection,  management, and analysis have evolved as the most  trustworthy source of information and emergence of new  findings, information, and products. The development of  the most effective procedures in statistical circumstances  has therefore become standard practice in the academic and industry sectors. Under actual situations, dealing with  enormous datasets, there are bound to be discrepancies  and abnormalities of many types that prohibit us from  knowing the true results of realistic issues. These  concepts and trends are helpful in decision-making  situations. The quality of the data is the most important  factor in data mining. For efficient information mining,  computer-based data pre-processing approaches provide  methods that assist the data under processing in  conforming to conventional structures, hence  significantly improving the efficiency of  computer algorithms. 

Downloads

Download data is not yet available.

References

Storti E, Cattaneo L, Polenghi A, Fumagalli L. Customized knowledge discovery in databases methodology for the control of assembly systems. Machines. 2018;

Guarascio M, Manco G, Ritacco E. Knowledge discovery in databases. In: Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics. 2018.

Methods DP. Data Preprocessing Techniques for Data Mining. Science (80- ). 2011;

Anynomous. Data Preprocessing Techniques for Data Mining. Science (80- ). 2011;

Mohit Sharma. What Steps should one take while doing Data Preprocessing? Hackernoon. 2018.

Nguyen PM, Haghverdi A, de Pue J, Botula YD, Le K V., Waegeman W, et al. Comparison of statistical regression and data-mining techniques in estimating soil water retention of tropical delta soils. Biosyst Eng. 2017;

Deshmukh MA, Gulhane RA. Importance of Clustering in Data Mining. Int J Sci Eng Res. 2016;

Zhang SZ, Qu XK, Sun J Bin. Data integration and mining based on web big data. Int J Multimed Ubiquitous Eng. 2015;

Alasadi SA, Bhaya WS. Review of data preprocessing techniques in data mining. J Eng Appl Sci. 2017; [10] Gama J, Pinto C. Discretization from data streams: Applications to histograms and data mining. In: Proceedings of the ACM Symposium on Applied Computing. 2006.

Downloads

Published

2022-01-30

How to Cite

Data Pre-processing Techniques in Data Mining: A Review . (2022). International Journal of Innovative Research in Computer Science & Technology, 10(1), 122–125. https://doi.org/10.55524/