Analyzing temporal properties of speech trajectory using graph structures towards speech recognition
Parabattina Bhagath, Malempati Shanmukha, Gnana Nagasri Puthi
Abstract
Speech signal analysis aims to identify patterns within data to develop effec tive recognition algorithms. This process primarily utilizes feature extraction techniques such as linear predictive coding (LPC), linear predictive cepstral co efficients (LPCCs), and Mel-frequency cepstral coefficients (MFCCs). These features are crucial for constructing recognition algorithms that leverage both statistical and deep learning methods. While deep learning models require ex tensive datasets, they often prove unsuitable for low-resource languages. The Hidden Markov model (HMM)is the most widely adopted statistical framework in speech processing. However, HMMs are characterized by state-dependent models, where each state interacts only with its neighboring states. This limita tion restricts HMMs from capturing long-term signal properties, highlighting the need for addressing these constraints at the feature extraction stage. Most feature extraction methods rely on short-term signal processing, which further limits the comprehension of speech utterances. To overcome these limitations, alter native methods are necessary to capture more comprehensive patterns. This pa per presents a graph-based approach for analyzing speech trajectories and their temporal properties, which are subsequently validated using HMMs in speech recognition tasks. Graph-based representations on a low-resource Telugu dataset improve recognition accuracy by 13% while reducing processing time compared to traditional LPC.
Keywords
graph eigenvalues; graph signal processing; speech analysis; structural processing; speech trajectory;
DOI:
http://doi.org/10.12928/telkomnika.v23i6.26893
Refbacks
There are currently no refbacks.
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License .
TELKOMNIKA Telecommunication, Computing, Electronics and Control ISSN: 1693-6930 , e-ISSN: 2302-9293 Universitas Ahmad Dahlan , 4th Campus Jl. Ringroad Selatan, Kragilan, Tamanan, Banguntapan, Bantul, Yogyakarta, Indonesia 55191 Phone: +62 (274) 563515, 511830, 379418, 371120 Fax: +62 274 564604
<div class="statcounter"><a title="Web Analytics" href="http://statcounter.com/" target="_blank"><img class="statcounter" src="//c.statcounter.com/10241713/0/0b6069be/0/" alt="Web Analytics"></a></div> View TELKOMNIKA Stats