Proceedings of the First International Conference on Advances in Computer Vision and Artificial Intelligence Technologies (ACVAIT 2022)

Development of Multilingual Speech Recognition and Translation Technologies for Communication and Interaction

Authors
Ali A. AL-Bakhrani1, 4, *, Gehad Abdullah Amran2, Aymen M. Al-Hejri4, 5, S. R. Chavan3, Ramesh Manza3, Sunil Nimbhore3
1Department of Computer Science, Technique Leaders College, Sana’a, Yemen
2Department of Management Science and Engineering, Dalian University of Technology, Dalian, Liaoning, 116024, China
3Department of Computer Science and IT, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, India
4Faculty of Administrative and Computer Sciences, Albaydha University, Albaydha, Yemen
5School of Computational Sciences, Swami Ramanand Teerth Marathwada University, Nanded, Maharashtra, India
*Corresponding author. Email: albakhrani2017@gmail.com
Corresponding Author
Ali A. AL-Bakhrani
Available Online 10 August 2023.
DOI
10.2991/978-94-6463-196-8_54How to use a DOI?
Keywords
TTS; STT; DNN; Speech Recognition; Translation; Python
Abstract

In this study, we find a solution to the problem of recognizing the source language and translating it into the selected target language. This interface is designed to convert the voice or speech into any selected source text, convert it into the targeted text, and save it into wave files. This interface, which in turn solves many problems, including in the field of education and society, can be used in day-to-day life. We have worked on building a software project that solves the problem, as it relies on deep learning techniques in speech recognition. Building the application depends on several main parts: speech recognition, verification of the speaker's language, conversion of speech to text, translation of speech into any language, and conversion into any language. The text of the speaker or translator into voice also allows saving speech in a pdf file and supports translating entire files, as this application has been programmed using the Python programming language.

Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the First International Conference on Advances in Computer Vision and Artificial Intelligence Technologies (ACVAIT 2022)
Series
Advances in Intelligent Systems Research
Publication Date
10 August 2023
ISBN
978-94-6463-196-8
ISSN
1951-6851
DOI
10.2991/978-94-6463-196-8_54How to use a DOI?
Copyright
© 2023 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Ali A. AL-Bakhrani
AU  - Gehad Abdullah Amran
AU  - Aymen M. Al-Hejri
AU  - S. R. Chavan
AU  - Ramesh Manza
AU  - Sunil Nimbhore
PY  - 2023
DA  - 2023/08/10
TI  - Development of Multilingual Speech Recognition and Translation Technologies for Communication and Interaction
BT  - Proceedings of the First International Conference on Advances in Computer Vision and Artificial Intelligence Technologies (ACVAIT 2022)
PB  - Atlantis Press
SP  - 711
EP  - 723
SN  - 1951-6851
UR  - https://doi.org/10.2991/978-94-6463-196-8_54
DO  - 10.2991/978-94-6463-196-8_54
ID  - AL-Bakhrani2023
ER  -