Knowledge Infused Decoding

AI-generated keywords: Knowledge Infused Decoding Generative Language Models Pre-trained LMs Natural Language Generation External Knowledge

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors: Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah
  • Introduces Knowledge Infused Decoding (KID) algorithm for generative language models
  • Addresses limitations of pre-trained LMs in recalling factually correct knowledge within specific contexts
  • KID interacts with externally created knowledge trie and is continuously updated using reinforcement learning
  • Evaluated on six diverse knowledge-intensive NLG tasks with strong performance in few-shot scenarios
  • Human evaluation confirms KID enhances generation of more relevant and factual language compared to baseline models
  • Code for implementing KID available on GitHub at https://github.com/microsoft/KID
  • Presented at ICLR 2022 and contributes insights into improving generative LMs through dynamic external knowledge infusion during decoding
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah

In ICLR 2022
License: CC BY-NC-ND 4.0

Abstract: Pre-trained language models (LMs) have been shown to memorize a substantial amount of knowledge from the pre-training corpora; however, they are still limited in recalling factually correct knowledge given a certain context. Hence, they tend to suffer from counterfactual or hallucinatory generation when used in knowledge-intensive natural language generation (NLG) tasks. Recent remedies to this problem focus on modifying either the pre-training or task fine-tuning objectives to incorporate knowledge, which normally require additional costly training or architecture modification of LMs for practical applications. We present Knowledge Infused Decoding (KID) -- a novel decoding algorithm for generative LMs, which dynamically infuses external knowledge into each step of the LM decoding. Specifically, we maintain a local knowledge memory based on the current context, interacting with a dynamically created external knowledge trie, and continuously update the local memory as a knowledge-aware constraint to guide decoding via reinforcement learning. On six diverse knowledge-intensive NLG tasks, task-agnostic LMs (e.g., GPT-2 and BART) armed with KID outperform many task-optimized state-of-the-art models, and show particularly strong performance in few-shot scenarios over seven related knowledge-infusion techniques. Human evaluation confirms KID's ability to generate more relevant and factual language for the input context when compared with multiple baselines. Finally, KID also alleviates exposure bias and provides stable generation quality when generating longer sequences. Code for KID is available at https://github.com/microsoft/KID.

Submitted to arXiv on 06 Apr. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2204.03084v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Knowledge Infused Decoding," authors Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, and Ahmed Hassan Awadallah introduce a novel decoding algorithm called Knowledge Infused Decoding (KID) for generative language models (LMs). The study addresses the limitations of pre-trained LMs in recalling factually correct knowledge within specific contexts. This often leads to counterfactual or hallucinatory generation in knowledge-intensive natural language generation tasks. Existing solutions typically involve modifying pre-training or task fine-tuning objectives to incorporate knowledge. However, these methods require additional training or architectural adjustments. <br> This memory interacts with an externally created knowledge trie and is continuously updated as a knowledge-aware constraint using reinforcement learning. The effectiveness of KID was evaluated on six diverse knowledge-intensive NLG tasks. In these tasks, Particularly strong performance was observed in few-shot scenarios compared to seven related knowledge-infusion techniques. Human evaluation confirmed that KID enhances the generation of more relevant and factual language based on input context when compared to multiple baseline models. Additionally, The code for implementing KID is openly available on GitHub at https://github.com/microsoft/KID. This research was presented at ICLR 2022 and contributes valuable insights into improving the performance of generative LMs through dynamic external knowledge infusion during decoding processes.
Created on 23 Oct. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.