The Design of Model for Tibetan Language Search System
- DOI
- 10.2991/cmfe-15.2015.167How to use a DOI?
- Keywords
- Tibetan; word segmentation; indexing; URL database; encoding conversion
- Abstract
In this paper, the prototype of the Tibetan language search system is built and the solutions to key issues for this model are proposed.The characteristics of Tibetan language of web pages are analyzed and extracted. The web page encoding are converted to standard Unicode, which permits for better recognition of Tibetan words for web page and the efficiency for searching informations in Tibetan will be significantly improved. The emergence probability, as well as semantic features, are considered for the Tibetan words classification system, The capability of eliminating unknown words and ambiguity problem are enhanced. This design will increase the search efficiency and help users get better searching results.
- Copyright
- © 2015, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Zhong Wang PY - 2015/07 DA - 2015/07 TI - The Design of Model for Tibetan Language Search System BT - Proceedings of the International Conference on Chemical, Material and Food Engineering PB - Atlantis Press SP - 707 EP - 711 SN - 2352-5401 UR - https://doi.org/10.2991/cmfe-15.2015.167 DO - 10.2991/cmfe-15.2015.167 ID - Wang2015/07 ER -