Fuzzy Optimization Multi-objective Clustering Ensemble Model for Multi-source Data Analysis
- DOI
- 10.2991/asum.k.210827.017How to use a DOI?
- Keywords
- Clustering ensemble, multi-source, multi-objective, fuzzy clustering
- Abstract
In modern data analysis, multi-source data appears more and more in real applications. Different data sources provide information about different data. Therefore, multi-source data linking is important to improve the processing performance. However, in practice multi-source data is often heterogeneous, uncertain, and large. This issue is considered a major challenge from multi-source data. Ensemble is a universal machine learning model in which learning techniques can work in parallel, with big data. Clustering ensemble has been shown to outperform any standard clustering algorithm in terms of accuracy and robustness. However, most of the traditional clustering ensemble approaches are based on single-objective function and single-source data. In this paper, we propose a new clustering ensemble method for multi-source data analysis. We call the fuzzy optimized multi-objective clustering ensemble method - FOMOCE. Firstly, a clustering ensemble mathematical model based on the structure of multi-objective clustering function, multi-source data, and dark knowledge is introduced. Then, rules for extracting dark knowledge from the input data, clustering algorithms, and base clusterings are designed and applied. Finally, a clustering ensemble algorithm is proposed for multi-source data analysis. Experiments were performed on benchmark data sets. The experimental results demonstrate the superior performance of the FOMOCE method compared with the existing clustering ensemble methods and multi-source clustering methods.
- Copyright
- © 2021, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Le Thi Cam Binh AU - Pham Van Nha AU - Ngo Thanh Long PY - 2021 DA - 2021/08/30 TI - Fuzzy Optimization Multi-objective Clustering Ensemble Model for Multi-source Data Analysis BT - Joint Proceedings of the 19th World Congress of the International Fuzzy Systems Association (IFSA), the 12th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT), and the 11th International Summer School on Aggregation Operators (AGOP) PB - Atlantis Press SP - 125 EP - 133 SN - 2589-6644 UR - https://doi.org/10.2991/asum.k.210827.017 DO - 10.2991/asum.k.210827.017 ID - Binh2021 ER -