Skip to main content Skip to main navigation menu Skip to site footer
Bulletin of Abai KazNPU. Series of Physical and mathematical sciences

RESEARCH OF ACOUSTIC AND LINGUISTIC MODELING BASED ON REPETITIVE NEURAL NETWORKS FOR SPEECH RECOGNITION OF CHILDREN

Published March 2022

168

128

N. Mekebayev+
Kazakh National Women's Pedagogical University
Ch Tuyebaev+
al-Farabi Kazakh National university, Almaty, Kazakhstan
https://orcid.org/0000-0001-9769-1239
K. Sabrayev+
Abai Кazakh National Pedagogical university, Almaty, Kazakhstan
https://orcid.org/0000-0001-9141-4935
А Yerkebay+
Kazakh national women's teacher training university, Almaty, Kazakhstan
https://orcid.org/0000-0002-5812-2137
Kazakh National Women's Pedagogical University
al-Farabi Kazakh National university, Almaty, Kazakhstan
Abai Кazakh National Pedagogical university, Almaty, Kazakhstan
Kazakh national women's teacher training university, Almaty, Kazakhstan
Abstract

The ordinary automatic speech recognition(ASR) frameworks utilize the GMM-HMM for acoustic modeling and the n-gram for language modeling. In the course of the last decade, the deep feed-forward neural network (DFNN) has nearly replaced the GMM in acoustic modeling. The current ASR systems are predominantly dependent on the DFNN-HMM acoustic model and the n-gram language model (LM). Inferable from better long-termcontext displaying capacity, the recurrent neural network(RNN) based LMs have as of now been accounted for to yield lower perplexitythan the n-gram LMs. As of late a variation of RNN, the long-short term memory(LSTM) has been effectively investigated inacoustic modeling. Strangely, the assessment of an ASR systemem ploying both RNN-based acoustic and semantic demonstrating is yetto be accounted for. Further, we note that most of these advancementsare explored in the context of adults’ ASR only. Persuaded bythose works, in this paper we investigate LSTM-based acoustic modeling joined with RNN-based LM for children’s ASR.Our exploratory outcomes show that such consolidated RNN-based modeling is found viable in both coordinated and mismatched children's ASR tasks.

.pdf (Русский)
Language

Қазақ

How to Cite

[1]
Mekebayev Н., Tuyebaev Ш., Sabrayev Қ. and Yerkebay А. 2022. RESEARCH OF ACOUSTIC AND LINGUISTIC MODELING BASED ON REPETITIVE NEURAL NETWORKS FOR SPEECH RECOGNITION OF CHILDREN. Bulletin of Abai KazNPU. Series of Physical and mathematical sciences. 77, 1 (Mar. 2022), 119–126. DOI:https://doi.org/10.51889/2022-1.1728-7901.16.