Volume 1, Issue 2, May 2008, Pages 116 - 126
Language Identification of Kannada, Hindi and English Text Words Through Visual Discriminating Features
Authors
M.C. Padma, P.A. Vijaya
Corresponding Author
M.C. Padma
Received 21 September 2007, Revised 29 October 2007, Available Online 1 May 2008.
- DOI
- 10.2991/ijcis.2008.1.2.2How to use a DOI?
- Keywords
- Document mage Processing, Multi-lingual Document, Language Identification, Horizontal Lines, Vertical Lines, Feature Extraction.
- Abstract
In a multilingual country like India, a document may contain text words in more than one language. For a multilingual environment, multi lingual Optical Character Recognition (OCR) system is needed to read the multilingual documents. So, it is necessary to identify different language regions of the document before feeding the document to the OCRs of individual language. The objective of this paper is to propose visual clues based procedure to identify Kannada, Hindi and English text portions of the Indian multilingual document.
- Copyright
- © 2008, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - JOUR AU - M.C. Padma AU - P.A. Vijaya PY - 2008 DA - 2008/05/01 TI - Language Identification of Kannada, Hindi and English Text Words Through Visual Discriminating Features JO - International Journal of Computational Intelligence Systems SP - 116 EP - 126 VL - 1 IS - 2 SN - 1875-6883 UR - https://doi.org/10.2991/ijcis.2008.1.2.2 DO - 10.2991/ijcis.2008.1.2.2 ID - Padma2008 ER -