Novel speech duration modifier for packet based communication system

Mani, S K and Dhiman, J K and Kodukula, Sri Rama Murty (2014) Novel speech duration modifier for packet based communication system. In: 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, 14-18 September, 2014, Max Atria at Singapore ExpoSingapore; Singapore.

IS140838.PDF - Published Version

Download (164kB)


In this paper, we propose a real-time method for duration modification of speech for packet based communication system. While there is rich literature available on duration modification, it fails to clearly address the issues in real-time implementation of the same. Most of the duration modification methods rely on accurate estimation of pitch marks, which is not feasible in a real-time scenario. The proposed method modifies the duration of Linear Prediction residual of individual frames without using any look-ahead delay and knowledge of pitch marks. In this method, multiples of pitch period is repeated or removed from a frame depending on a scheduling algorithm. The subjective quality of the proposed method was found to be better than waveform similarity overlap and add (WSOLA) technique as well as Linear Prediction Pitch Synchronous Overlap and Add (LP-PSOLA) technique.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Kodukula, Sri Rama Murty
Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: Linear prediction; Look-ahead delay; Packet-based communication; Speech duration; Voice over IP; WSOLA
Subjects: Electrical Engineering > Wireless Communication
Divisions: Department of Electrical Engineering
Depositing User: Team Library
Date Deposited: 02 Dec 2014 05:38
Last Modified: 05 Dec 2017 04:05
Publisher URL:
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 1102 Statistics for this ePrint Item