Unsupervised Segmentation of Speech Signals Using Kernel-Gram Matrices

Bhati, Saurabhchand and Nayak, Shekhar and Kodukula, Sri Rama Murty (2018) Unsupervised Segmentation of Speech Signals Using Kernel-Gram Matrices. Communications in Computer and Information Science, 841. pp. 139-149. ISSN 1865-0929

Full text not available from this repository. (Request a copy)


The objective of this paper is to develop an unsupervised method for segmentation of speech signals into phoneme-like units. The proposed algorithm is based on the observation that the feature vectors from the same segment exhibit higher degree of similarity than the feature vectors across the segments. The kernel-Gram matrix of an utterance is formed by computing the similarity between every pair of feature vectors in the Gaussian kernel space. The kernel-Gram matrix consists of square patches, along with the principle diagonal, corresponding to different phoneme-like segments in the speech signal. It detects the number of segments, as well as their boundaries automatically. The proposed approach does not assume any information about input utterances like exact distribution of segment length or correct number of segments in an utterance. The proposed method out-performs the state-of-the-art blind segmentation algorithms on Zero Resource 2015 databases and TIMIT database.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Kodukula, Sri Rama Murtyhttps://orcid.org/0000-0002-6355-5287
Item Type: Article
Uncontrolled Keywords: Blind segmentationGaussian kernelKernel-Gram matrixPhonetic segmentation
Subjects: Electrical Engineering
Depositing User: Team Library
Date Deposited: 21 May 2018 11:22
Last Modified: 21 May 2018 11:22
URI: http://raiith.iith.ac.in/id/eprint/3938
Publisher URL: http://doi.org/10.1007/978-981-13-0020-2_13
OA policy: http://www.sherpa.ac.uk/romeo/issn/1865-0929/
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 3938 Statistics for this ePrint Item