Chinese Tourism Information Search Platform based on Cloud Computing
Authors
Huan Zhao, Xi Chen
Corresponding Author
Huan Zhao
Available Online March 2015.
- DOI
- 10.2991/iiicec-15.2015.273How to use a DOI?
- Keywords
- Nutch; Hadoop; Solr; Chinese word segmentation; cloud computing
- Abstract
Nutch, Solr and Hadoop, three of them are open source applications, which Nutch is a superb web crawler, Hadoop is a cloud platform and Solr can use crawled data and offer word class searching. Nutch only provides one mechanism which segments Chinese sentences into some single characters so that Chinese word cannot be analyzed and processed. This paper proposes a method of Chinese word segmentation in Solr and builds a high performance distributed search engines by integrating Nutch into Hadoop, and finally use Solr to build tourist information search platform.
- Copyright
- © 2015, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Huan Zhao AU - Xi Chen PY - 2015/03 DA - 2015/03 TI - Chinese Tourism Information Search Platform based on Cloud Computing BT - Proceedings of the 2015 International Industrial Informatics and Computer Engineering Conference PB - Atlantis Press SP - 1236 EP - 1240 SN - 2352-538X UR - https://doi.org/10.2991/iiicec-15.2015.273 DO - 10.2991/iiicec-15.2015.273 ID - Zhao2015/03 ER -