Multi-task pre-finetuning for zero-shot cross lingual transfer

Moukthika Yerramilli; Pritam Varma; Anurag Dwarakanath

Publication

Multi-task pre-finetuning for zero-shot cross lingual transfer

By Moukthika Yerramilli, Pritam Varma, Anurag Dwarakanath

2021

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

Building machine learning models for low resource languages is extremely challenging due to the lack of available training data (either un-annotated or annotated). To support such scenarios, zero-shot cross lingual transfer is used where the machine learning model is trained on a resource rich language and is directly tested on the resource poor language. In this paper, we present a technique which improves the performance of zero-shot cross lingual transfer. Our method performs multi-task pre-finetuning on a resource rich language using a multilingual pre-trained model. The pre-finetuned model is then tested in a zero-shot manner on the resource poor languages. We test the performance of our method on 8 languages and for two tasks, namely, Intent Classification (IC) & Named Entity Recognition (NER) using the MultiAtis++ dataset. The results show that our method improves IC performance in 7 out of 8 languages and NER performance in 4 languages. Our method also leads to faster convergence during finetuning. The usage of pre-finetuning demonstrates a data efficient way for supporting new languages and geographies across the world.

Multi-task pre-finetuning for zero-shot cross lingual transfer

Latest news

Work with us