The Strategy of Classification Mining Based on Cloud Computing
- DOI
- 10.2991/ccis-13.2013.14How to use a DOI?
- Keywords
- Classification Mining; Cloud Computing; Parallel; SPRINT; MapReduce
- Abstract
Cloud computing provides cheap and efficient solutions of storing and analyzing mass data. It is very important to research the data mining strategy based on cloud computing from the theoretical view and practical view. In this paper, the strategy of classification mining in cloud computing environment is focused on. Firstly, cloud computing, Hadoop, MapReduce programming model, Classification mining and SPRINT algorithm are introduced. Then the parallelization of SPRINT for cloud computing environment is designed. It includes improved SPRINT algorithm, and the implementation procedure of the improved SPRINT algorithm on MapReduce. Finally, the Hadoop platform is built and the experiment for testing performance of the strategy as well as the improved algorithm has been done. The results show that the strategy designed in this paper can archive higher efficiency when doing classification mining in cloud computing environment.
- Copyright
- © 2013, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Zhang Lijuan AU - Zhao Shuguang PY - 2013/11 DA - 2013/11 TI - The Strategy of Classification Mining Based on Cloud Computing BT - Proceedings of the The 1st International Workshop on Cloud Computing and Information Security PB - Atlantis Press SP - 57 EP - 60 SN - 1951-6851 UR - https://doi.org/10.2991/ccis-13.2013.14 DO - 10.2991/ccis-13.2013.14 ID - Lijuan2013/11 ER -