Named entity correction in neural machine translation using the attention alignment map

Jangwon Lee, Jungi Lee, Minho Lee, Gil Jin Jang

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Neural machine translation (NMT) methods based on various artificial neural network models have shown remarkable performance in diverse tasks and have become mainstream for machine translation currently. Despite the recent successes of NMT applications, a predefined vocabulary is still required, meaning that it cannot cope with out-of-vocabulary (OOV) or rarely occurring words. In this paper, we propose a postprocessing method for correcting machine translation outputs using a named entity recognition (NER) model to overcome the problem of OOV words in NMT tasks. We use attention alignment mapping (AAM) between the named entities of input and output sentences, and mistranslated named entities are corrected using word look-up tables. The proposed method corrects named entities only, so it does not require retraining of existing NMT models. We carried out translation experiments on a Chinese-to-Korean translation task for Korean historical documents, and the evaluation results demonstrated that the proposed method improved the bilingual evaluation understudy (BLEU) score by 3.70 from the baseline.

Original languageEnglish
Article number7026
JournalApplied Sciences (Switzerland)
Volume11
Issue number15
DOIs
StatePublished - 1 Aug 2021

Keywords

  • Named entity recognition
  • Natural language processing
  • Neural machine translation
  • Neural networks
  • Recurrent neural networks

Fingerprint

Dive into the research topics of 'Named entity correction in neural machine translation using the attention alignment map'. Together they form a unique fingerprint.

Cite this