Conversational AI

Controlling formality in machine translation

Transfer learning using limited contrastive data improves formality accuracy without compromising performance.

By Maria Nădejde, Benjamin Hsu

December 19, 2022

4 min read

Neural machine translation (NMT) systems typically return a single translation for each input text segment. This means that when the input segment is ambiguous, the model must choose a translation from among various valid options, without regard to the intended use case or target audience. For example, a translator taking English inputs will often need to choose among multiple levels of formality or grammatical register — the tu and vous of French, for instance, or the tú and usted of Spanish — in the output.

In the past, training NMT models with formality control has relied on large labeled datasets. Creating high-quality labeled translations for many diverse languages is time consuming and expensive, and these earlier efforts were limited to specific languages.

Culture and context

Without formality control, leaving models to choose between different valid translation options can lead to translations that are inconsistent or that can be perceived as rude or jarring in certain scenarios (e.g., business, customer service, gaming chat) or for speakers from certain cultures. For example, when asking “Are you sure?” in German, a customer support agent would use the formal register — “Sind Sie sich sicher?” — while in game chat players would talk to each other using an informal register — “Bist du dir sicher?”.

Translation formality.png — Valid translation options in German, with formal and informal register.

How formality is expressed grammatically and lexically can vary widely by language. In many Indo-European languages (e.g., German, Hindi, Italian, and Spanish), the formal and informal registers are distinguished by the second-person pronouns and/or corresponding verb agreement. Japanese and Korean have more extensive means of expressing polite, respectful, and humble speech, including morphological markings on the main verb and on some nouns and adjectives; specific lexical choices; and longer sentences.

Learning to control formality with limited data

For the initial release of CoCoA-MT, we focused on six language pairs (English→ {German, Spanish, French, Hindi, Italian, Japanese}) across three spoken-language domains: customer support chat, topical chat, and telephone conversations. We asked professional translators to generate formal and informal translations from English source segments. Informal translations were post-edited from formal translations, where translators were instructed to make the minimal necessary changes (e.g., changing verb inflections, swapping pronouns). Translators additionally annotated phrases to indicate formality level, and we were able to use those annotations to develop a segment-level metric for measuring formality accuracy.

Lego example.png — Contrastive reference translations labeled with different formality levels. Phrase-level formality markers in the target languages are annotated with *[F]text[/F]*.

To leverage a small amount of labeled contrastive data, we propose framing formality control as a transfer learning problem. Our method begins with a generic NMT model, which is fine-tuned on contrastive examples from the CoCoA-MT dataset.

Training methodology.png — A schematic of the proposed method for training formality-controlled neural-machine-translation systems.

In our paper, we show that this fine-tuning strategy can successfully control the formality of a generic NMT system without losing generic quality. We also showed that the formality-controlled system was effective in out-of-domain settings (i.e., settings not matching the training domain).

Fine-tuning results.png — Fine-tuning with an equal mixture of contrastive examples from CoCoA-MT and unlabeled parallel data increased accuracy on formal and informal segments across all three domains, including on the held out (call-center) domain. At the same time, generic accuracy on the MuST-C dataset remained steady.

What’s next?

We’ve seen that contrastive labeled data and transfer learning are means to effectively train models with a limited amount of data while preserving generic quality and generalizing to unseen domains. But challenges remain, especially as we look to expand formality customization to all 75 languages supported by Amazon Translate. For now, Amazon Translate customers can take advantage of this latest research and control the formality level when translating into French, German, Hindi, Italian, Japanese, Spain Spanish, Mexican Spanish, Korean, Dutch, Canadian French, and Portugal Portuguese. Researchers can look forward to IWSLT 2023, where we will organize a shared task on formality control in collaboration with the University of Maryland, College Park.

Acknowledgments: Anna Currey

About the Author

Maria Nădejde

Maria Nadejde is a senior applied scientist with Amazon Translate.

Benjamin Hsu

Controlling formality in machine translation

Transfer learning using limited contrastive data improves formality accuracy without compromising performance.

Culture and context

Learning to control formality with limited data

What’s next?

Related content

Work with us