Abstract
Sparse representations of signals have received a great deal of attention in recent years, and the sparse representation classifier has very lately appeared in a speaker recognition system. This approach represents the (sparse) GMM mean supervector of an unknown speaker as a linear combination of an over-complete dictionary of GMM supervectors of many speaker models, and l(1)-norm minimization results in a non-zero coefficient corresponding to the unknown speaker class index. Here this approach is tested on large databases, introducing channel-/session-variability compensation, and fused with a GMM-SVM system. Evaluations on the NIST 2001 SRE and NIST 2006 SRE database show that when the outputs of the MFCC UBM-GMM based classifier (for NIST 2001 SRE) or MFCC GMM-SVM based classifier (for NIST 2006 SRE) are fused with the MFCC GMM-Sparse Representation Classifier (GMM-SRC) based classifier, an absolute gain of 1.27% and 0.25% in EER can be achieved respectively.
Original language | English |
---|---|
Title of host publication | 2011 IEEE International Conference On Acoustics, Speech, And Signal Processing |
Place of Publication | United States |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 4548-4551 |
Volume | 1 |
ISBN (Print) | 9781457705397 |
Publication status | Published - 2011 |
Event | 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2011 - Prague, Czech Republic Duration: 22 May 2011 → 27 May 2011 |
Conference
Conference | 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2011 |
---|---|
Abbreviated title | ICASSP 2011 |
Country/Territory | Czech Republic |
City | Prague |
Period | 22/05/11 → 27/05/11 |