A Data-driven Approach for Cross Transformation Between Mongolian texts
- DOI
- 10.2991/icssr-13.2013.84How to use a DOI?
- Keywords
- Mongolian texts; cross language transformation; DP; data driven approach
- Abstract
This paper discusses a data-driven approach to transforming different graphic texts of Mongolian. Using the proposed approach, it is possible to transcribe or translate texts between similar languages such as Mongolian graphic texts used in different regions and countries, as well as the Altaic family languages like Uygur Turkic and Kazakh. The approach has been implemented based on DP (dynamic programming) matching supported by the knowledge-based sequence matching, referred to a multilingual dictionary and a data-driven approach of the target language corpus. Experimental results demonstrate that the proposed method achieves 86.4% transformation accuracy (in F-measure) for the NM (Cyrillic) to the TM (Traditional Mongolian) mainly used in the inner Mongolia, and 91.1% NM to Todo, which is mainly used in Xinjiang areas in China.
- Copyright
- © 2013, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Yidemucao Dawa AU - Niyazbek Muheyat AU - Amantay Ayjarken PY - 2013/07 DA - 2013/07 TI - A Data-driven Approach for Cross Transformation Between Mongolian texts BT - Proceedings of the 2nd International Conference on Science and Social Research (ICSSR 2013) PB - Atlantis Press SP - 370 EP - 375 SN - 1951-6851 UR - https://doi.org/10.2991/icssr-13.2013.84 DO - 10.2991/icssr-13.2013.84 ID - Dawa2013/07 ER -