Unsupervised Feature Selection Algorithm Based on Information Gain
- DOI
- 10.2991/acsr.k.191223.015How to use a DOI?
- Keywords
- unsupervised, feature selection, mutual information, information gain
- Abstract
Feature selection aims to select a smaller feature subset from the rate data which maintains the characteristics of the original data and has similar or better performance in data mining. traditional information theory often divides the relevance and redundancy of the features into consideration in unsupervised feature selection. This article proposes a supervised feature selection algorithm based on information gain analysis. this algorithm is to analyze the correlation between feature and original data and the redundancy between features and selected features based on the mutual information. The potential information gain of the feature is calculated for the feature sorting. At last, the feature is selected according to the gain penalty factor. The experimental results of multiple classifiers on multiple standard datasets show that the proposed algorithm achieves or better than the classification accuracy of the original data on the basis of effectively reducing the data dimension.
- Copyright
- © 2019, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Zhong Li AU - Yang Jing AU - Lijing Yao AU - Binbin Gan PY - 2019 DA - 2019/12/24 TI - Unsupervised Feature Selection Algorithm Based on Information Gain BT - Proceedings of the 2019 International Conference on Big Data, Electronics and Communication Engineering (BDECE 2019) PB - Atlantis Press SP - 63 EP - 67 SN - 2352-538X UR - https://doi.org/10.2991/acsr.k.191223.015 DO - 10.2991/acsr.k.191223.015 ID - Li2019 ER -