Spoken Term Detection on Low Resource Languages

Reddy, B Naresh (2015) Spoken Term Detection on Low Resource Languages. Masters thesis, Indian Institute of Technology Hyderabad.

EE13M0001.pdf - Submitted Version

Download (1MB) | Preview


Developing efficient speech processing systems for low-resource languages is an immensely challenging problem. One potentially effective approach to address the lack of resources for any particular language, is to employ data from multiple languages for building speech processing sub-systems. This thesis investigates possible methodologies for Spoken Term Detection (STD) from low- resource Indian languages. The task of STD intend to search for a query keyword, given in text form, from a considerably large speech database. This is usually done by matching templates of feature vectors, representing sequence of phonemes from the query word and the continuous speech from the database. Typical set of features used to represent speech signals in most of the speech processing systems are the mel frequency cepstral coefficients (MFCC). As speech is a very complexsignal, holding information about the textual message, speaker identity, emotional and health state of the speaker, etc., the MFCC features derived from it will also contain information about all these factors. For eficient template matching, we need to neutralize the speaker variability in features and stabilize them to represent the speech variability alone.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Item Type: Thesis (Masters)
Uncontrolled Keywords: Multi-Layer Perceptron(MLP), Hidden Markov Model(HMM), Spoken Term Detection
Subjects: Others > Electricity
Divisions: Department of Electrical Engineering
Depositing User: Library Staff
Date Deposited: 05 Jan 2016 08:44
Last Modified: 29 Jul 2019 10:17
URI: http://raiith.iith.ac.in/id/eprint/2097
Publisher URL:
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 2097 Statistics for this ePrint Item