Low resource named entity recognition using contextual word representation and neural cross-lingual knowledge transfer

Soyeon Caren Han, Yingru Lin, Siqu Long, Josiah Poon

Research output: Chapter in Book/Conference paperConference paperpeer-review

Abstract

Low resource Named Entity Recognition can be solved by transferring knowledge from a high to a low-resource language with shared multilingual embedding spaces. In this paper, we focus on the extreme low-resource NER scenario of unsupervised cross-lingual knowledge transfer, where no labelled training data or parallel corpus is available. We apply word-alignment with the contextualised word embedding and propose an efficient cross-lingual centroid-based space translation mechanism for contextual embedding. We found that the proposed alignment mechanism works well between different languages, compared to current state-of-the-art models. Moreover, word order differences is another problem to be resolved in cross-lingual NER. We alleviate this issue by incorporating a transformer, which relies entirely on an attention mechanism to draw global dependency between input and output. Our method was evaluated against state-of-the-art results, and it indicate that our approach was better in terms of the performance and the amount of resources.

Original languageEnglish
Title of host publicationNeural Information Processing - 26th International Conference, ICONIP 2019, Proceedings
EditorsTom Gedeon, Kok Wai Wong, Minho Lee
PublisherSpringer
Pages299-311
Number of pages13
ISBN (Print)9783030367077
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event26th International Conference on Neural Information Processing, ICONIP 2019 - Sydney, Australia
Duration: 12 Dec 201915 Dec 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11953 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference26th International Conference on Neural Information Processing, ICONIP 2019
Country/TerritoryAustralia
CitySydney
Period12/12/1915/12/19

Fingerprint

Dive into the research topics of 'Low resource named entity recognition using contextual word representation and neural cross-lingual knowledge transfer'. Together they form a unique fingerprint.

Cite this