LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback
AI-generated Key Points
- Novel bootstrapping framework using self-generated feedback to improve reasoning abilities of Large Language Models (LLMs) in detecting betrayal and deception in Diplomacy games
- Framework comprises three stages: suggestion, feedback collection, and modification
- LLM-generated feedback showed superior quality and significantly enhanced model's ability to detect lies, achieving a 39% improvement in lying-F1 score without additional training data
- LLM-generated feedback was longer and more informative compared to human feedback, outperforming human feedback by 29% in lying-F1 score while being more cost-effective
- Additional human study conducted to identify common errors made by LLMs like GPT-3 and consider ethics considerations
- Leveraging LLM-generated feedback can enhance model performance and offer an economical alternative for improving lie detection capabilities in natural language tasks
Authors: Tanushree Banerjee, Richard Zhu, Runzhe Yang, Karthik Narasimhan
Abstract: Large Language Models (LLMs) excel at generating human-like dialogues and comprehending text. However, understanding the subtleties of complex exchanges in language remains a challenge. We propose a bootstrapping framework that leverages self-generated feedback to enhance LLM reasoning capabilities for lie detection. The framework consists of three stages: suggestion, feedback collection, and modification. In the suggestion stage, a cost-effective language model generates initial predictions based on game state and dialogue. The feedback-collection stage involves a language model providing feedback on these predictions. In the modification stage, a more advanced language model refines the initial predictions using the auto-generated feedback. We investigate the application of the proposed framework for detecting betrayal and deception in Diplomacy games, and compare it with feedback from professional human players. The LLM-generated feedback exhibits superior quality and significantly enhances the performance of the model. Our approach achieves a 39% improvement over the zero-shot baseline in lying-F1 without the need for any training data, rivaling state-of-the-art supervised learning results.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.