Abstract
© 2015 IEEE. Motivated by the recent progresses in the use of deep learning techniques for acoustic speech recognition, we present in this paper a visual deep bottleneck feature (DBNF) learning scheme using a stacked auto-encoder combined with other techniques. Experimental results show that our proposed deep feature learning scheme yields approximately 24% relative improvement for visual speech accuracy. To the best of our knowledge, this is the first study which uses deep bottleneck feature on visual speech recognition. Our work firstly shows that the deep bottleneck visual feature is able to achieve a significant accuracy improvement on visual speech recognition.
Original language | English |
---|---|
Title of host publication | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 1518-1522 |
Volume | 2015-August |
ISBN (Print) | 9781467369978 |
DOIs | |
Publication status | Published - 2015 |
Event | Extracting deep bottleneck features for visual speech recognition - South Brisbane, Queensland Duration: 1 Jan 2015 → … |
Conference
Conference | Extracting deep bottleneck features for visual speech recognition |
---|---|
Period | 1/01/15 → … |