The Research of Massive Data Analysis and Processing Based on Hadoop
- DOI
- 10.2991/icmmita-15.2015.54How to use a DOI?
- Keywords
- Hadoop; Massive Data; Data Processing
- Abstract
how to quickly extracted from these massive data out of the enterprise value of useful information has become the most vexing problems in the development of application software programmers encounter in the course. Based on the starting point of this issue, this paper analyzes the key technical foundation and other existing distributed storage and computing on the combination of Hadoop cluster technology research as well as their business needs and the actual hardware and software strength, we propose a massive Hadoop-based data processing model and data structure design in several ways, the program process organization and use programming techniques and other methods to introduce the development of the model, and finally applied to model the log data preprocessing large site.
- Copyright
- © 2015, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Julan Yi PY - 2015/11 DA - 2015/11 TI - The Research of Massive Data Analysis and Processing Based on Hadoop BT - Proceedings of the 2015 3rd International Conference on Machinery, Materials and Information Technology Applications PB - Atlantis Press SP - 273 EP - 277 SN - 2352-538X UR - https://doi.org/10.2991/icmmita-15.2015.54 DO - 10.2991/icmmita-15.2015.54 ID - Yi2015/11 ER -