Abstract
In this paper we investigate the use of observation weights and contextual time-frequency information for clustering-based blind source separation. Previous clustering-based approaches have successfully used clustering techniques to estimate time-frequency separation masks; however, these approaches generally disregard the structured nature of speech signals. Motivated by the homogenous behavior of speech signals, we propose to modify the established fuzzy c-means algorithm to bias the clustering results in favor of cluster membership homogeneity within localized neighborhoods in the time-frequency space. This problem can be solved by using a two-stage algorithm: firstly, the estimation of data weights to indicate the reliability of each data point, and secondly, the integration of local contextual information into the cluster update equations from neighboring time-frequency slots. The proposed algorithm is evaluated in a three-fold manner using simulated, real recordings and public benchmark data; notable improvement in source separation performance over previous clustering approaches was achieved. © 2014 IEEE.
Original language | English |
---|---|
Title of host publication | IEEE International Conference on Acoustics, Speech and Signal Processing |
Place of Publication | Florence, Italy |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 157-160 |
ISBN (Print) | 9781479949755 |
DOIs | |
Publication status | Published - 2014 |
Event | 2014 IEEE Workshop on Statistical Signal Processing - Gold Coast, Australia Duration: 29 Jun 2014 → 2 Jul 2014 |
Conference
Conference | 2014 IEEE Workshop on Statistical Signal Processing |
---|---|
Country/Territory | Australia |
City | Gold Coast |
Period | 29/06/14 → 2/07/14 |