Learning From Free-Text Human Feedback -- Collect New Datasets Or Extend Existing Ones?

AI-generated keywords: Dialog Systems Free-Text Human Feedback Synthetic Dialog Generation Annotated Datasets Conversational AI

AI-generated Key Points

  • Researchers Dominic Petrak, Nafise Sadat Moosavi, Ye Tian, Nikolai Rozanov, and Iryna Gurevych focus on learning from free-text human feedback for dialog systems.
  • They address the scarcity of annotated data in conversational AI by using synthetic dialog generation to augment existing datasets with necessary annotations.
  • The authors investigate dialog datasets such as MultiWoZ, SGD, BABI, PersonaChat, Wizards-of-Wikipedia, and the human-bot split of the Self-Feeding Chatbot to assess their approach's feasibility.
  • Through observations and analysis of free-text human feedback in dialogs, they develop new taxonomies for annotating this type of data and examine its impact on response generation for language generation models like GPT-2, LLAMA, and Flan-T5.
  • The research identifies error types, user response types, and their relationships within these datasets.
  • Accepted for presentation at EMNLP 2023, this research has the potential to enhance conversational AI technology significantly by improving existing datasets with free-text human feedback annotations.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dominic Petrak, Nafise Sadat Moosavi, Ye Tian, Nikolai Rozanov, Iryna Gurevych

Accepted to be presented at EMNLP 2023
License: CC BY-NC-SA 4.0

Abstract: Learning from free-text human feedback is essential for dialog systems, but annotated data is scarce and usually covers only a small fraction of error types known in conversational AI. Instead of collecting and annotating new datasets from scratch, recent advances in synthetic dialog generation could be used to augment existing dialog datasets with the necessary annotations. However, to assess the feasibility of such an effort, it is important to know the types and frequency of free-text human feedback included in these datasets. In this work, we investigate this question for a variety of commonly used dialog datasets, including MultiWoZ, SGD, BABI, PersonaChat, Wizards-of-Wikipedia, and the human-bot split of the Self-Feeding Chatbot. Using our observations, we derive new taxonomies for the annotation of free-text human feedback in dialogs and investigate the impact of including such data in response generation for three SOTA language generation models, including GPT-2, LLAMA, and Flan-T5. Our findings provide new insights into the composition of the datasets examined, including error types, user response types, and the relations between them.

Submitted to arXiv on 24 Oct. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2310.15758v1

Dominic Petrak, Nafise Sadat Moosavi, Ye Tian, Nikolai Rozanov, and Iryna Gurevych explore the importance of learning from free-text human feedback for dialog systems. Their research focuses on addressing the scarcity of annotated data in conversational AI and proposes a solution using synthetic dialog generation to augment existing datasets with necessary annotations. The authors investigate commonly used dialog datasets such as MultiWoZ, SGD, BABI, PersonaChat, Wizards-of-Wikipedia, and the human-bot split of the Self-Feeding Chatbot to assess the feasibility of their approach. Through their observations and analysis of free-text human feedback in dialogs, they develop new taxonomies for annotating this type of data and examine its impact on response generation for state-of-the-art language generation models like GPT-2, LLAMA, and Flan-T5. This work provides valuable insights into the composition of these datasets by identifying error types, user response types, and their relationships. Accepted for presentation at EMNLP 2023, this research has the potential to significantly contribute to advancements in conversational AI technology through enhancing existing datasets with free-text human feedback annotations.
Created on 26 Nov. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.