Exploring cross-lingual transfer learning with unsupervised machine translation

Chao Wang; Judith Gaspers; Quynh Ngoc Thi Do; Hui Jiang

Publication

Exploring cross-lingual transfer learning with unsupervised machine translation

By Chao Wang, Judith Gaspers, Quynh Ngoc Thi Do, Hui Jiang

2021

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

In Natural Language Understanding (NLU), to facilitate Cross-Lingual Transfer Learning (CLTL), especially CLTL between distant languages, we integrate CLTL with Machine Translation (MT), and thereby propose a novel CLTL model named Translation Aided Language Learner (TALL). TALL is constructed as a standard transformer, where the encoder is a pre-trained multilingual language model. The training of TALL includes an MT-oriented pre-training and an NLU-oriented fine-tuning. To make use of unannotated data, we implement the recently proposed Unsupervised Machine Translation (UMT) technique in the MT-oriented pre-training of TALL. The experimental results show that the application of UMT enables TALL to consistently achieve better CLTL performance than our baseline model, which is the pre-trained multilingual language model serving as the encoder of TALL, without using more annotated data, and the performance gain is relatively prominent in the case of distant languages.

Exploring cross-lingual transfer learning with unsupervised machine translation

Latest news

Work with us