Speech Source Separation Using ICA in Constant Q Transform Domain

D.V.L.N, Dheeraj Sai and K.S, Kishor and Kodukula, Sri Rama Murty (2018) Speech Source Separation Using ICA in Constant Q Transform Domain. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2-6 September 2018, Hyderabad; India.

Full text not available from this repository. (Request a copy)


In order to separate individual sources from convoluted speech mixtures, complex-domain independent component analysis (ICA) is employed on the individual frequency bins of time-frequency representations of the speech mixtures, obtained using short-time Fourier transform (STFT). The frequency components computed using STFT are separated by constant frequency difference with a constant frequency resolution. However, it is well known that the human auditory mechanism offers better resolution at lower frequencies. Hence, the perceptual quality of the extracted sources critically depends on the separation achieved in the lower frequency components. In this paper, we propose to perform source separation on the time-frequency representation computed though constant Q transform (CQT), which offers non uniform logarithmic binning in the frequency domain. Complex-domain ICA is performed on the individual bins of the CQT in order to get separated components in each frequency bin which are suitably scaled and permuted to obtain separated sources in the CQT domain. The estimated sources are obtained by applying inverse constant Q transform to the scaled and permuted sources. In comparison with the STFT based frequency domain ICA methods, there has been a consistent improvement of 3dB or more in the Signal to Interference Ratios of the extracted sources.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Kodukula, Sri Rama Murtyhttps://orcid.org/0000-0002-6355-5287
Item Type: Conference or Workshop Item (Other)
Additional Information: ISSN: 2308457X
Uncontrolled Keywords: Complex-valued ICA, Constant Q transform, Non uniform logarithmic binning, Non uniform logarithmic binning
Subjects: Electrical Engineering > Wireless Communication
Electrical Engineering > Process Control
Electrical Engineering > Power System
Electrical Engineering > Automation & Control Systems
Electrical Engineering > Electrical and Electronic
Electrical Engineering > Instruments and Instrumentation
Divisions: Department of Electrical Engineering
Depositing User: . LibTrainee 2021
Date Deposited: 27 Jul 2022 06:46
Last Modified: 27 Jul 2022 06:46
URI: http://raiith.iith.ac.in/id/eprint/7787
Publisher URL: http://doi.org/10.21437/Interspeech.2018-1732
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 7787 Statistics for this ePrint Item