Prompt perturbation consistency learning for robust language models

Yao Qiang; Nandi Subhrangshu; Ninareh Mehrabi; Greg Ver Steeg; Anoop Kumar; Anna Rumshisky; Aram Galstyan

Publication

Prompt perturbation consistency learning for robust language models

By Yao Qiang, Nandi Subhrangshu, Ninareh Mehrabi, Greg Ver Steeg, Anoop Kumar, Anna Rumshisky, Aram Galstyan

2024

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

Large language models (LLMs) have demonstrated impressive performance on a number of natural language processing tasks, such as question answering and text summarization. However, their performance on sequence labeling tasks, such as intent classification and slot filling (IC-SF), which is a central component in personal assistant systems, lags significantly behind discriminative models. Furthermore, there is a lack of substantive research on robustness of LLMs to various perturbations in the input prompts. The contributions of this paper are three-fold. First, we show that finetuning sufficiently large LLMs can produce ICSF performance comparable to discriminative models. Next, we systematically analyze the performance deterioration of those fine-tuned models due to three distinct yet relevant types of input perturbations - oronyms, synonyms, and paraphrasing. Finally, we propose an efficient mitigation approach, prompt perturbation consistency learning (PPCL), which works by regularizing the divergence between losses from clean and perturbed samples. Our experiments show that PPCL can recover on an average 59% and 69% of the performance drop for IC and SF tasks, respectively. Furthermore, PPCL beats data augmentation approach while using ten times fewer augmented data samples.

Prompt perturbation consistency learning for robust language models

Latest news

Work with us