Studying the Reduction Techniques for Mining Engineering Datasets

  • Mustafa Ali Abuzaraida
Keywords: Data mining, Data reduction, Engineering dataseT

Abstract

Over the world, companies often have huge datasets as data warehouses collection. The enormous size could make difficulty to analyze the data. The main reason, the complexity of data in terms of number of attributes and number of cases. To overcome this problem could be done by using a sufficient number of attributes and cases before mining this dataset. In Data Mining field, several methods could be used to reduce the attributes number and similar cases. This paper presents a study to test three reduction methods on engineering domain using five datasets. The three methods are: Genetic Algorithm (GA), Principal Component Analysis (PCA), and Johnson technique. The five datasets where obtained from UCI machine learning archive. The study examines which reduction method can be proper for datasets in Engineering field. It can be done by identifying the three reduction methods ranking based on percentage accuracy and number of selected attributes

Reduction Techniques for Mining Engineering Datasets
Published
2018-04-30
How to Cite
Abuzaraida, M. A. (2018). Studying the Reduction Techniques for Mining Engineering Datasets. University of Sindh Journal of Information and Communication Technology , 2(2), 100-104. Retrieved from https://sujo.usindh.edu.pk/index.php/USJICT/article/view/516

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.