Low-Resource Adaptation of Open-Domain Generative Chatbots

AI-generated keywords: Chatbot Model Framework Conversational Abilities Evaluation

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Increasing the size of chatbot models improves performance
  • Limitations in latency and connectivity when deploying digital assistants on devices
  • Need to reduce the size of chatbot models for compatibility with user devices
  • Low-parameter models can retain conversational abilities while improving in a specific domain
  • Proposed generic framework addresses various question types, tracks references, and eliminates inconsistent and toxic responses
  • Framework seamlessly transitions between casual chatting and performing transactional tasks
  • Evaluation of framework using automatic (Perplexity) and human (SSA - Sensibleness and Specificity Average) metrics
  • Achieved comparable performance while reducing model parameters by 90%
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Greyson Gerhard-Young, Raviteja Anantha, Srinivas Chappidi, Björn Hoffmeister

Preview draft

Abstract: Recent work building open-domain chatbots has demonstrated that increasing model size improves performance. On the other hand, latency and connectivity considerations dictate the move of digital assistants on the device. Giving a digital assistant like Siri, Alexa, or Google Assistant the ability to discuss just about anything leads to the need for reducing the chatbot model size such that it fits on the user's device. We demonstrate that low parameter models can simultaneously retain their general knowledge conversational abilities while improving in a specific domain. Additionally, we propose a generic framework that accounts for variety in question types, tracks reference throughout multi-turn conversations, and removes inconsistent and potentially toxic responses. Our framework seamlessly transitions between chatting and performing transactional tasks, which will ultimately make interactions with digital assistants more human-like. We evaluate our framework on 1 internal and 4 public benchmark datasets using both automatic (Perplexity) and human (SSA - Sensibleness and Specificity Average) evaluation metrics and establish comparable performance while reducing model parameters by 90%.

Submitted to arXiv on 13 Aug. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2108.06329v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In recent work on open-domain chatbots, it has been shown that increasing the size of the model improves its performance. However, there are limitations in terms of latency and connectivity when it comes to deploying digital assistants on devices. This necessitates reducing the size of chatbot models so that they can fit on user devices like Siri, Alexa, or Google Assistant. In this study, the authors demonstrate that low-parameter models can retain their conversational abilities while also improving in a specific domain. They propose a generic framework that addresses various question types, tracks references in multi-turn conversations and eliminates inconsistent and potentially toxic responses. The framework seamlessly transitions between casual chatting and performing transactional tasks making interactions with digital assistants more human-like. The authors evaluate their framework on both internal and public benchmark datasets using automatic (Perplexity) and human (SSA - Sensibleness and Specificity Average) evaluation metrics. They achieve comparable performance while reducing model parameters by 90%.
Created on 26 Dec. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.