Publication:
Evaluating for Diversity in Question Generation over Text
M. Schlichtkrull, Weiwei Cheng • @arXiv • 17 August 2020
TLDR: It is shown through automatic and human evaluation that the proposed variational encoder-decoder model improves diversity without loss of quality, and how the evaluation scheme reflects this improvement.
Citations: 5
Abstract: Generating diverse and relevant questions over text is a task with widespread applications. We argue that commonly-used evaluation metrics such as BLEU and METEOR are not suitable for this task due to the inherent diversity of reference questions, and propose a scheme for extending conventional metrics to reflect diversity. We furthermore propose a variational encoder-decoder model for this task. We show through automatic and human evaluation that our variational model improves diversity without loss of quality, and demonstrate how our evaluation scheme reflects this improvement.
Related Fields of Study
loading
Citations
Sort by
Previous
Next
Showing results 1 to 0 of 0
Previous
Next
References
Sort by
Previous
Next
Showing results 1 to 0 of 0
Previous
Next