AVA: an automatic eValuation approach for question answering systems

Thuy Vu; Alessandro Moschitti

Publication

AVA: an automatic eValuation approach for question answering systems

By Thuy Vu, Alessandro Moschitti

2021

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

We introduce AVA, an automatic evaluation approach for Question Answering, which given a set of questions associated with Gold Standard answers (references), can estimate system Accuracy. AVA uses Transformer-based language models to encode question, answer, and reference texts. This allows for effectively assessing answer correctness using similarity between the reference and an automatic answer, biased towards the question semantics. To design, train, and test AVA, we built multiple large training, development, and test sets on public and industrial benchmarks. Our innovative solutions achieve up to 74.7% F1 score in predicting human judgment for single answers. Additionally, AVA can be used to evaluate the overall system Accuracy with an error lower than 7% at 95% of confidence when measured on several QA systems.

AVA: an automatic eValuation approach for question answering systems

Latest news

Work with us