A deep neural network for audio-visual person recognition

Research output: Chapter in Book/Conference paperConference paper

6 Citations (Scopus)

Abstract

© 2015 IEEE. This paper presents applications of special types of deep neural networks (DNNs) for audio-visual biometrics. A common example is the DBN-DNN that uses the generative weights of deep belief networks (DBNs) to initialize the feature detecting layers of deterministic feed forward DNNs. In this paper, we propose the DBM-DNN that uses the generative weights of deep Boltzmann machines (DBMs) for initialization of DNNs. Then, a softmax layer is added on top and the DNNs are trained discriminatively. Our experimental results show that lower error rates can be achieved using the DBM-DNN compared to the support vector machine (SVM), linear regression-based classifier (LRC) and the DBN-DNN. Experiments were carried out on two publicly available audio-visual datasets: the VidTIMIT and MOBIO.
Original languageEnglish
Title of host publicationBiometrics Theory, Applications and Systems (BTAS), 2015 IEEE 7th International Conference
Place of PublicationUSA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages1-6
VolumeN/A
ISBN (Print)9781479987764
DOIs
Publication statusPublished - 2015
EventBiometrics Theory, Applications and Systems (BTAS) 2015 - Virginia, United States
Duration: 8 Sep 201511 Sep 2015

Conference

ConferenceBiometrics Theory, Applications and Systems (BTAS) 2015
CountryUnited States
CityVirginia
Period8/09/1511/09/15

Fingerprint Dive into the research topics of 'A deep neural network for audio-visual person recognition'. Together they form a unique fingerprint.

Cite this