Continuous speech recognition model based on CTC Technology
Authors
Yumeng Wang, Jianmin Zhao
Corresponding Author
Yumeng Wang
Available Online May 2018.
- DOI
- 10.2991/ncce-18.2018.25How to use a DOI?
- Keywords
- Connectionist Temporical Classification;Speech Recognition; End to End.
- Abstract
In end to end speech recognition,the linguistic knowledge such as pronunciation 1exicon is not essential??,and therefore the performance of the ASR systems based on CTC is weaker than that of the baseline.Aiming at this problem,a strategy combining the existing linguistic knowledge and the acoustic modeling based on CTC is proposed.and the tri??"phone is taken as the basic units in acoustic modeling.Thus,the sparse problem of the modeling unit is effectively solved.and the discrimination and robustness of the CTC model are improved substantially.
- Copyright
- © 2018, the Authors. Published by Atlantis Press.
- Open Access
- This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
Cite this article
TY - CONF AU - Yumeng Wang AU - Jianmin Zhao PY - 2018/05 DA - 2018/05 TI - Continuous speech recognition model based on CTC Technology BT - Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018) PB - Atlantis Press SP - 149 EP - 152 SN - 1951-6851 UR - https://doi.org/10.2991/ncce-18.2018.25 DO - 10.2991/ncce-18.2018.25 ID - Wang2018/05 ER -