Fine-grained Hallucination Detection and Editing for Language Models

AI-generated keywords: Hallucination Detection

AI-generated Key Points

  • Authors address the issue of hallucinations in large language models (LMs) that generate factually incorrect statements
  • Introduce a novel task of automatic fine-grained hallucination detection to address the issue
  • Present a comprehensive taxonomy categorizing hallucinations into six hierarchically defined types for more nuanced error detection and correction
  • Propose FAVA (Fine-grained Automatic VAlidation model) to detect and correct fine-grained hallucinations, outperforming ChatGPT in both automatic and human evaluations
  • Research contributes to improving factuality in LM-generated text by enhancing FActScores by 5-10%
  • Offer a detailed and comprehensive method for detecting and correcting hallucinations in language models at finer levels of granularity
  • Use carefully designed synthetic data for accurate evaluation and improvement of fine-grained hallucination detection methods
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi

License: CC BY 4.0

Abstract: Large language models (LMs) are prone to generate diverse factually incorrect statements, which are widely called hallucinations. Current approaches predominantly focus on coarse-grained automatic hallucination detection or editing, overlooking nuanced error levels. In this paper, we propose a novel task -- automatic fine-grained hallucination detection -- and present a comprehensive taxonomy encompassing six hierarchically defined types of hallucination. To facilitate evaluation, we introduce a new benchmark that includes fine-grained human judgments on two LM outputs across various domains. Our analysis reveals that ChatGPT and Llama 2-Chat exhibit hallucinations in 60% and 75% of their outputs, respectively, and a majority of these hallucinations fall into categories that have been underexplored. As an initial step to address this, we train FAVA, a retrieval-augmented LM by carefully designing synthetic data generations to detect and correct fine-grained hallucinations. On our benchmark, our automatic and human evaluations show that FAVA significantly outperforms ChatGPT on fine-grained hallucination detection by a large margin though a large room for future improvement still exists. FAVA's suggested edits also improve the factuality of LM-generated text, resulting in 5-10% FActScore improvements.

Submitted to arXiv on 12 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2401.06855v1

In their paper "Fine-grained Hallucination Detection and Editing for Language Models," Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, and Hannaneh Hajishirzi address the issue of hallucinations in large language models (LMs). These LMs often generate factually incorrect statements, known as hallucinations, which can hinder their practical use. <br><br> The authors introduce a novel task of automatic fine-grained hallucination detection to address the issue of hallucinations in large language models.<br><br> While existing approaches focus on coarse-grained automatic hallucination detection or editing, this paper presents a comprehensive taxonomy that categorizes hallucinations into six hierarchically defined types. This allows for a more nuanced approach to detecting and correcting errors at finer levels of granularity. <br><br> To evaluate the effectiveness of their approach, the researchers introduce a new benchmark that includes fine-grained human judgments on outputs from two LMs across various domains. Their analysis reveals that popular models like ChatGPT and Llama 2-Chat exhibit hallucinations in a significant percentage of their outputs. Moreover, many of these hallucinations fall into categories that have not been extensively explored. <br><br> In response to these findings, the authors propose FAVA (<b>Fine-grained Automatic VAlidation model</b>), a retrieval-augmented LM trained to detect and correct fine-grained hallucinations. Through carefully designed synthetic data generations, FAVA significantly outperforms ChatGPT in fine-grained hallucination detection according to both automatic and human evaluations. <br><br> This research contributes to addressing the challenge of hallucinations in language generation models by introducing a more nuanced approach to detecting and correcting errors at finer levels of granularity.<br><br> While there is room for further improvement, FAVA's suggested edits also enhance the factuality of LM-generated text by improving FActScores by 5-10%. <br><br> The authors' approach offers a more detailed and comprehensive method for detecting and correcting hallucinations in language models, leading to improved factuality in generated text.<br><br> Through their proposed model, FAVA, the authors not only address the issue of hallucinations but also improve the overall factuality of language model-generated text.<br><br> The use of carefully designed synthetic data allows for more accurate evaluation and improvement of fine-grained hallucination detection methods. Overall, this research contributes to addressing the challenge of hallucinations in language generation models by introducing a more nuanced approach to detecting and correcting errors at finer levels of granularity.
Created on 12 Sep. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.