International Journal of Computational Intelligence Systems

Volume 13, Issue 1, 2020, Pages 479 - 487

Online Streaming Feature Selection via Multi-Conditional Independence and Mutual Information Entropy†

Authors
Hongyi Wang1, Dianlong You2, *
1Department of Commerce and Trade, Qinhuangdao Vocational and Technical College, No. 90, Lianfeng North Road, Beidaihe District, Qinhuangdao, Hebei 066101 China
2School of Information Science and Engineering, Yanshan University, No. 438, West Section of Hebei Street, Qinhuangdao, Hebei 066004, China
*Corresponding author. Email: youdianlong@sina.com
Corresponding Author
Dianlong You
Received 14 January 2020, Accepted 20 April 2020, Available Online 6 May 2020.
DOI
10.2991/ijcis.d.200423.002How to use a DOI?
Keywords
Streaming feature; Feature selection; Conditional independence; Mutual information
Abstract

The goals of feature selection are to remove redundant and irrelevant features from high-dimensional data, extract the “optimal feature subset” of the original feature space to improve the classification accuracy, and reduce the time complexity. Traditional feature selection algorithms are based on static feature spaces that are difficult to apply in dynamic streaming data environments. Existing works, such as Alpha-investing and Online Streaming Feature Selection (OSFS), and Scalable and Accurate OnLine Approach (SAOLA), have been proposed to serve the feature selection with streaming feature, but they have drawbacks, including low prediction accuracy and a large number of selected features if the streaming features exhibit characteristics such as low redundancy and high relevance. To address the limitations of the abovementioned works, we propose the algorithm of Online Streaming Feature Selection via Conditional dependence and Mutual information (OSFSCM) for streaming feature, which is found to be superior to Alpha-investing and OSFS for datasets with low redundancy and high relevance. The efficiency of the proposed OSFSCM algorithm is validated through a performance test on widely used datasets, e.g., NIPS 2003 and Causality Workbench. Through extensive experimental results, we demonstrate that OSFSCM significantly improves the prediction accuracy and requires fewer selected features compared with Alpha-investing and OSFS.

Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Computational Intelligence Systems
Volume-Issue
13 - 1
Pages
479 - 487
Publication Date
2020/05/06
ISSN (Online)
1875-6883
ISSN (Print)
1875-6891
DOI
10.2991/ijcis.d.200423.002How to use a DOI?
Copyright
© 2020 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Hongyi Wang
AU  - Dianlong You
PY  - 2020
DA  - 2020/05/06
TI  - Online Streaming Feature Selection via Multi-Conditional Independence and Mutual Information Entropy†
JO  - International Journal of Computational Intelligence Systems
SP  - 479
EP  - 487
VL  - 13
IS  - 1
SN  - 1875-6883
UR  - https://doi.org/10.2991/ijcis.d.200423.002
DO  - 10.2991/ijcis.d.200423.002
ID  - Wang2020
ER  -