TY - JOUR
T1 - Convolutional neural network with second-order pooling for underwater target classification
AU - Cao, Xu
AU - Togneri, Roberto
AU - Zhang, Xiaomin
AU - Yu, Yang
PY - 2019/4/15
Y1 - 2019/4/15
N2 - Underwater target classification using passive sonar remains a critical issue due to the changeable ocean environment. Convolutional Neural Networks (CNNs) have shown success in learning invariant features using local filtering and max pooling. In this paper, we propose a novel classification framework which combines the CNN architecture with the second-order pooling (SOP) to capture the temporal correlations from the timefrequency (T-F) representation of the radiated acoustic signals. The convolutional layers are used to learn the local features with a set of kernel filters from the T-F inputs which are extracted by the constant-Q transform (CQT). Instead of using max pooling, the proposed SOP operator is designed to learn the co-occurrences of different CNN filters using the temporal feature trajectory of CNN features for each frequency subband. To preserve the frequency distinctions, the correlated features of each frequency subband are retained. The pooling results are normalized with signed square-root and l2 normalization, and then input into the softmax classifier. The whole network can be trained in an end-to-end fashion. To explore the generalization ability to unseen conditions, the proposed CNN model is evaluated on the real radiated acoustic signals recorded at new sea depths. The experimental results demonstrate that the proposed method yields an 8% improvement in classification accuracy over the state-of-the-art deep learning methods.
AB - Underwater target classification using passive sonar remains a critical issue due to the changeable ocean environment. Convolutional Neural Networks (CNNs) have shown success in learning invariant features using local filtering and max pooling. In this paper, we propose a novel classification framework which combines the CNN architecture with the second-order pooling (SOP) to capture the temporal correlations from the timefrequency (T-F) representation of the radiated acoustic signals. The convolutional layers are used to learn the local features with a set of kernel filters from the T-F inputs which are extracted by the constant-Q transform (CQT). Instead of using max pooling, the proposed SOP operator is designed to learn the co-occurrences of different CNN filters using the temporal feature trajectory of CNN features for each frequency subband. To preserve the frequency distinctions, the correlated features of each frequency subband are retained. The pooling results are normalized with signed square-root and l2 normalization, and then input into the softmax classifier. The whole network can be trained in an end-to-end fashion. To explore the generalization ability to unseen conditions, the proposed CNN model is evaluated on the real radiated acoustic signals recorded at new sea depths. The experimental results demonstrate that the proposed method yields an 8% improvement in classification accuracy over the state-of-the-art deep learning methods.
KW - Acoustics
KW - Computer architecture
KW - constant-Q transform
KW - Convolutional neural networks
KW - convolutional neural networks
KW - Correlation
KW - Feature extraction
KW - second-order pooling
KW - Time-frequency analysis
KW - Transforms
KW - underwater target classification
UR - http://www.scopus.com/inward/record.url?scp=85058627798&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2018.2886368
DO - 10.1109/JSEN.2018.2886368
M3 - Article
AN - SCOPUS:85058627798
SN - 1530-437X
VL - 19
SP - 3058
EP - 3066
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 8
M1 - 8573835
ER -