Uniform training and marginal decoding for multi-reference question-answer generation

Svitlana Vakulenko; Bill Byrne; Adrià de Gispert

Publication

Uniform training and marginal decoding for multi-reference question-answer generation

By Svitlana Vakulenko, Bill Byrne, Adrià de Gispert

2023

Download Copy BibTeX GitHub

Share

Download

Copy BibTeX

GitHub

Share

Question generation is an important task that helps to improve question answering performance and augment search interfaces with possible suggested questions. While multiple approaches have been proposed for this task, none addresses the goal of generating a diverse set of questions given the same input context. The main reason for this is the lack of multi-reference datasets for training such models. We propose to bridge this gap by seeding a baseline question generation model with named entities as candidate answers. This allows us to automatically synthesize an unlimited number of question-answer pairs. We then propose an approach designed to leverage such multi-reference annotations, and demonstrate its advantages over the standard training and decoding strategies used in question generation. An experimental evaluation on synthetic, as well as manually annotated data shows that our approach can be used in creating a single generative model that produces a diverse set of question-answer pairs per input sentence.

Uniform training and marginal decoding for multi-reference question-answer generation

Latest news

Work with us