Towards Coherent and Engaging Spoken Dialog Response Generation Using Automatic Conversation Evaluators

AI-generated keywords: Dialog System Coherence Engagement Reranking Loss Function

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Existing end-to-end open domain dialog systems trained with maximum likelihood objective have limitations
  • Lack of generalizability and generic response problem are common issues in these systems
  • Authors propose a system that evaluates chatbot responses for coherence and engagement at each turn
  • Turn-level dialog quality feedback is highly correlated with human evaluation
  • Feedback is used to mitigate problems and improve the overall quality of the dialog system
  • Two mechanisms presented: reranking and direct modification of loss function during training
  • Studies show that incorporating explicit feedback improves response generation models
  • Models incorporating both reranking and direct modification produce more engaging and coherent responses compared to traditional models
  • Improvement observed through automatic evaluation metrics and human evaluation
  • Proposed approach enhances end-to-end open domain dialog systems by incorporating explicit feedback on coherence and engagement at each turn
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Sanghyun Yi, Rahul Goel, Chandra Khatri, Tagyoung Chung, Behnam Hedayatnia, Anu Venkatesh, Raefer Gabriel, Dilek Hakkani-Tur

Abstract: Encoder-decoder based neural architectures serve as the basis of state-of-the-art approaches in end-to-end open domain dialog systems. Since most of such systems are trained with a maximum likelihood(MLE) objective they suffer from issues such as lack of generalizability and the generic response problem, i.e., a system response that can be an answer to a large number of user utterances, e.g., "Maybe, I don't know." Having explicit feedback on the relevance and interestingness of a system response at each turn can be a useful signal for mitigating such issues and improving system quality by selecting responses from different approaches. Towards this goal, we present a system that evaluates chatbot responses at each dialog turn for coherence and engagement. Our system provides explicit turn-level dialog quality feedback, which we show to be highly correlated with human evaluation. To show that incorporating this feedback in the neural response generation models improves dialog quality, we present two different and complementary mechanisms to incorporate explicit feedback into a neural response generation model: reranking and direct modification of the loss function during training. Our studies show that a response generation model that incorporates these combined feedback mechanisms produce more engaging and coherent responses in an open-domain spoken dialog setting, significantly improving the response quality using both automatic and human evaluation.

Submitted to arXiv on 30 Apr. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1904.13015v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper titled "Towards Coherent and Engaging Spoken Dialog Response Generation Using Automatic Conversation Evaluators" discusses the limitations of existing end-to-end open domain dialog systems, which are primarily trained with a maximum likelihood objective. These systems often suffer from lack of generalizability and the generic response problem, where the system generates responses that can be applied to a wide range of user utterances without providing specific or engaging answers. To address these issues, the authors propose a system that evaluates chatbot responses at each turn for coherence and engagement. This system provides explicit turn-level dialog quality feedback, which has been found to be highly correlated with human evaluation. The goal is to use this feedback as a signal for mitigating the aforementioned problems and improving the overall quality of the dialog system by selecting responses from different approaches. To demonstrate the effectiveness of incorporating explicit feedback into neural response generation models, two mechanisms are presented: reranking and direct modification of the loss function during training. These mechanisms aim to improve dialog quality by incorporating the feedback into the model's decision-making process. The authors conducted studies to evaluate their proposed approach in an open-domain spoken dialog setting. The results show that response generation models that incorporate both reranking and direct modification produce more engaging and coherent responses compared to traditional models trained with maximum likelihood objective alone. This improvement is observed through both automatic evaluation metrics and human evaluation. Overall, this paper presents a novel approach to enhance end-to-end open domain dialog systems by incorporating explicit feedback on coherence and engagement at each turn. The proposed mechanisms effectively improve response quality, addressing issues related to lack of generalizability and generic responses commonly seen in such systems.
Created on 26 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.