Tibetan Character Recognition Based on Machine Learning of K-means Algorithm
Authors
Huiwen Gong, Wei Xiang
Corresponding Author
Huiwen Gong
Available Online April 2018.
- DOI
- 10.2991/cmsa-18.2018.78How to use a DOI?
- Keywords
- artificial intelligence; machine learning; Tibetan character recognition; Tesseract -OCR; K-means algorithm
- Abstract
In this paper, we analyze and extract the Tibetan text features structure based on k-means image character recognition algorithm. Through character library file generated from Tessract-ocr training, we improve the accuracy and recognition of image text recognition and extraction and realize the identification of Tibetan.
- Copyright
- © 2018, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Huiwen Gong AU - Wei Xiang PY - 2018/04 DA - 2018/04 TI - Tibetan Character Recognition Based on Machine Learning of K-means Algorithm BT - Proceedings of the 2018 International Conference on Computer Modeling, Simulation and Algorithm (CMSA 2018) PB - Atlantis Press SP - 340 EP - 342 SN - 1951-6851 UR - https://doi.org/10.2991/cmsa-18.2018.78 DO - 10.2991/cmsa-18.2018.78 ID - Gong2018/04 ER -