Analyzing temporal properties of speech trajectory using graph structures towards speech recognition

Parabattina Bhagath, Malempati Shanmukha, Gnana Nagasri Puthi

Abstract


Speech signal analysis aims to identify patterns within data to develop effec tive recognition algorithms. This process primarily utilizes feature extraction techniques such as linear predictive coding (LPC), linear predictive cepstral co efficients (LPCCs), and Mel-frequency cepstral coefficients (MFCCs). These features are crucial for constructing recognition algorithms that leverage both statistical and deep learning methods. While deep learning models require ex tensive datasets, they often prove unsuitable for low-resource languages. The Hidden Markov model (HMM)is the most widely adopted statistical framework in speech processing. However, HMMs are characterized by state-dependent models, where each state interacts only with its neighboring states. This limita tion restricts HMMs from capturing long-term signal properties, highlighting the need for addressing these constraints at the feature extraction stage. Most feature extraction methods rely on short-term signal processing, which further limits the comprehension of speech utterances. To overcome these limitations, alter native methods are necessary to capture more comprehensive patterns. This pa per presents a graph-based approach for analyzing speech trajectories and their temporal properties, which are subsequently validated using HMMs in speech recognition tasks. Graph-based representations on a low-resource Telugu dataset improve recognition accuracy by 13% while reducing processing time compared to traditional LPC.

Keywords


graph eigenvalues; graph signal processing; speech analysis; structural processing; speech trajectory;

Full Text:

PDF


DOI: http://doi.org/10.12928/telkomnika.v23i6.26893

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

TELKOMNIKA Telecommunication, Computing, Electronics and Control
ISSN: 1693-6930, e-ISSN: 2302-9293
Universitas Ahmad Dahlan, 4th Campus
Jl. Ringroad Selatan, Kragilan, Tamanan, Banguntapan, Bantul, Yogyakarta, Indonesia 55191
Phone: +62 (274) 563515, 511830, 379418, 371120
Fax: +62 274 564604

View TELKOMNIKA Stats