Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018)

Continuous speech recognition model based on CTC Technology

Authors
Yumeng Wang, Jianmin Zhao
Corresponding Author
Yumeng Wang
Available Online May 2018.
DOI
10.2991/ncce-18.2018.25How to use a DOI?
Keywords
Connectionist Temporical Classification;Speech Recognition; End to End.
Abstract

In end to end speech recognition,the linguistic knowledge such as pronunciation 1exicon is not essential??,and therefore the performance of the ASR systems based on CTC is weaker than that of the baseline.Aiming at this problem,a strategy combining the existing linguistic knowledge and the acoustic modeling based on CTC is proposed.and the tri??"phone is taken as the basic units in acoustic modeling.Thus,the sparse problem of the modeling unit is effectively solved.and the discrimination and robustness of the CTC model are improved substantially.

Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018)
Series
Advances in Intelligent Systems Research
Publication Date
May 2018
ISBN
978-94-6252-517-7
ISSN
1951-6851
DOI
10.2991/ncce-18.2018.25How to use a DOI?
Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Yumeng Wang
AU  - Jianmin Zhao
PY  - 2018/05
DA  - 2018/05
TI  - Continuous speech recognition model based on CTC Technology
BT  - Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018)
PB  - Atlantis Press
SP  - 149
EP  - 152
SN  - 1951-6851
UR  - https://doi.org/10.2991/ncce-18.2018.25
DO  - 10.2991/ncce-18.2018.25
ID  - Wang2018/05
ER  -