Proceedings of the First International Conference on Advances in Computer Vision and Artificial Intelligence Technologies (ACVAIT 2022)

A Numeral Script Identification from a Multi-lingual Printed Document Image

Authors
Rajkumar Benne1, *, Shivanand Gornale2, Gayatri Patil2
1Government Autonomous College, Kalaburagi, India
2Rani Channamma University, Belagavi, India
*Corresponding author. Email: rgbenne@gmail.com
Corresponding Author
Rajkumar Benne
Available Online 10 August 2023.
DOI
10.2991/978-94-6463-196-8_16How to use a DOI?
Keywords
Script identification; documents; OCR
Abstract

India is a multi-lingual multi-script country, where a printed document which contains information in the form of texts, images, etc.; the texts part may have composed with characters and numerals of one or more scripts. So, it is necessary Identify the scripts of numerals/characters from multilingual document before feeding them to their individual script OCR systems. In this paper, the system made an attempt to recognize the script of numerals belongs to Kannada, Devanagari, and English based on structural features like water reservoir, aspect ratio, horizontal and vertical strokes. Initially, Bi-script and tri-script numerals script identification experiments are conducted on a dataset of 2100 numerals string(word), by taking 700 samples for each script and noticed average accuracy for tri-script numerals is 93.62%.

Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the First International Conference on Advances in Computer Vision and Artificial Intelligence Technologies (ACVAIT 2022)
Series
Advances in Intelligent Systems Research
Publication Date
10 August 2023
ISBN
978-94-6463-196-8
ISSN
1951-6851
DOI
10.2991/978-94-6463-196-8_16How to use a DOI?
Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Rajkumar Benne
AU  - Shivanand Gornale
AU  - Gayatri Patil
PY  - 2023
DA  - 2023/08/10
TI  - A Numeral Script Identification from a Multi-lingual Printed Document Image
BT  - Proceedings of the First International Conference on Advances in Computer Vision and Artificial Intelligence Technologies (ACVAIT 2022)
PB  - Atlantis Press
SP  - 178
EP  - 186
SN  - 1951-6851
UR  - https://doi.org/10.2991/978-94-6463-196-8_16
DO  - 10.2991/978-94-6463-196-8_16
ID  - Benne2023
ER  -