Extracting deep bottleneck features for visual speech recognition

Research output: Chapter in Book/Conference paperConference paperpeer-review

13 Citations (Scopus)


© 2015 IEEE. Motivated by the recent progresses in the use of deep learning techniques for acoustic speech recognition, we present in this paper a visual deep bottleneck feature (DBNF) learning scheme using a stacked auto-encoder combined with other techniques. Experimental results show that our proposed deep feature learning scheme yields approximately 24% relative improvement for visual speech accuracy. To the best of our knowledge, this is the first study which uses deep bottleneck feature on visual speech recognition. Our work firstly shows that the deep bottleneck visual feature is able to achieve a significant accuracy improvement on visual speech recognition.
Original languageEnglish
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
PublisherIEEE, Institute of Electrical and Electronics Engineers
ISBN (Print)9781467369978
Publication statusPublished - 2015
EventExtracting deep bottleneck features for visual speech recognition - South Brisbane, Queensland
Duration: 1 Jan 2015 → …


ConferenceExtracting deep bottleneck features for visual speech recognition
Period1/01/15 → …


Dive into the research topics of 'Extracting deep bottleneck features for visual speech recognition'. Together they form a unique fingerprint.

Cite this