Proceedings of the 2019 International Conference on Big Data, Electronics and Communication Engineering (BDECE 2019)

Unsupervised Feature Selection Algorithm Based on Information Gain

Authors
Zhong Li, Yang Jing, Lijing Yao, Binbin Gan
Corresponding Author
Yang Jing
Available Online 24 December 2019.
DOI
10.2991/acsr.k.191223.015How to use a DOI?
Keywords
unsupervised, feature selection, mutual information, information gain
Abstract

Feature selection aims to select a smaller feature subset from the rate data which maintains the characteristics of the original data and has similar or better performance in data mining. traditional information theory often divides the relevance and redundancy of the features into consideration in unsupervised feature selection. This article proposes a supervised feature selection algorithm based on information gain analysis. this algorithm is to analyze the correlation between feature and original data and the redundancy between features and selected features based on the mutual information. The potential information gain of the feature is calculated for the feature sorting. At last, the feature is selected according to the gain penalty factor. The experimental results of multiple classifiers on multiple standard datasets show that the proposed algorithm achieves or better than the classification accuracy of the original data on the basis of effectively reducing the data dimension.

Copyright
© 2019, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2019 International Conference on Big Data, Electronics and Communication Engineering (BDECE 2019)
Series
Advances in Computer Science Research
Publication Date
24 December 2019
ISBN
978-94-6252-873-4
ISSN
2352-538X
DOI
10.2991/acsr.k.191223.015How to use a DOI?
Copyright
© 2019, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Zhong Li
AU  - Yang Jing
AU  - Lijing Yao
AU  - Binbin Gan
PY  - 2019
DA  - 2019/12/24
TI  - Unsupervised Feature Selection Algorithm Based on Information Gain
BT  - Proceedings of the 2019 International Conference on Big Data, Electronics and Communication Engineering (BDECE 2019)
PB  - Atlantis Press
SP  - 63
EP  - 67
SN  - 2352-538X
UR  - https://doi.org/10.2991/acsr.k.191223.015
DO  - 10.2991/acsr.k.191223.015
ID  - Li2019
ER  -