Data Pre-processing Techniques in Data Mining: A Review
DOI:
https://doi.org/10.55524/Keywords:
Data mining, Data preprocessing, Dataset pattern, Dataset, KDD, Knowledge DiscoveryAbstract
Data mining is the process of finding interesting patterns and models from massive datasets. In the field of natural and physical sciences, data collection, management, and analysis have evolved as the most trustworthy source of information and emergence of new findings, information, and products. The development of the most effective procedures in statistical circumstances has therefore become standard practice in the academic and industry sectors. Under actual situations, dealing with enormous datasets, there are bound to be discrepancies and abnormalities of many types that prohibit us from knowing the true results of realistic issues. These concepts and trends are helpful in decision-making situations. The quality of the data is the most important factor in data mining. For efficient information mining, computer-based data pre-processing approaches provide methods that assist the data under processing in conforming to conventional structures, hence significantly improving the efficiency of computer algorithms.
Downloads
References
Storti E, Cattaneo L, Polenghi A, Fumagalli L. Customized knowledge discovery in databases methodology for the control of assembly systems. Machines. 2018;
Guarascio M, Manco G, Ritacco E. Knowledge discovery in databases. In: Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics. 2018.
Methods DP. Data Preprocessing Techniques for Data Mining. Science (80- ). 2011;
Anynomous. Data Preprocessing Techniques for Data Mining. Science (80- ). 2011;
Mohit Sharma. What Steps should one take while doing Data Preprocessing? Hackernoon. 2018.
Nguyen PM, Haghverdi A, de Pue J, Botula YD, Le K V., Waegeman W, et al. Comparison of statistical regression and data-mining techniques in estimating soil water retention of tropical delta soils. Biosyst Eng. 2017;
Deshmukh MA, Gulhane RA. Importance of Clustering in Data Mining. Int J Sci Eng Res. 2016;
Zhang SZ, Qu XK, Sun J Bin. Data integration and mining based on web big data. Int J Multimed Ubiquitous Eng. 2015;
Alasadi SA, Bhaya WS. Review of data preprocessing techniques in data mining. J Eng Appl Sci. 2017; [10] Gama J, Pinto C. Discretization from data streams: Applications to histograms and data mining. In: Proceedings of the ACM Symposium on Applied Computing. 2006.