Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying

AI-generated keywords: Large Language Models Logical and Mathematical Reasoning Argumentation Theory Critical Questions Toulmin's Model

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors Federico Castagna, Isabel Sassoon, and Simon Parsons address the challenge faced by Large Language Models (LLMs) in logical and mathematical reasoning tasks despite advancements in AI research.
  • LLMs excel at identifying data patterns but struggle with generalizing and solving reasoning problems beyond their training data.
  • The authors propose leveraging critical questions from argumentation theory, specifically drawing on Toulmin's model of argumentation, to enhance LLMs' ability to identify logical errors and improve performance.
  • Ensuring that conclusions drawn by LLMs are valid based on accepted premises is emphasized to mirror sound argumentative procedures.
  • The proposed approach involves guiding models through a reasoning pipeline to assess and correct logical mistakes before generating responses to user prompts.
  • The study demonstrates improved performance compared to baseline models and Chain-of-Thought (CoT) implementations through the integration of critical questioning techniques.
  • Extensive evaluation across various LLMs using MT-Bench Reasoning and Math tasks validates the effectiveness of integrating critical questioning techniques into the reasoning process of LLMs.
  • This study presents a promising strategy for enhancing the reasoning capabilities of advanced language models in AI research.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Federico Castagna, Isabel Sassoon, Simon Parsons

License: CC BY-NC-ND 4.0

Abstract: Studies have underscored how, regardless of the recent breakthrough and swift advances in AI research, even state-of-the-art Large Language models (LLMs) continue to struggle when performing logical and mathematical reasoning. The results seem to suggest that LLMs still work as (highly advanced) data pattern identifiers, scoring poorly when attempting to generalise and solve reasoning problems the models have never previously seen or that are not close to samples presented in their training data. To address this compelling concern, this paper makes use of the notion of critical questions from the literature on argumentation theory, focusing in particular on Toulmin's model of argumentation. We show that employing these critical questions can improve the reasoning capabilities of LLMs. By probing the rationale behind the models' reasoning process, the LLM can assess whether some logical mistake is occurring and correct it before providing the final reply to the user prompt. The underlying idea is drawn from the gold standard of any valid argumentative procedure: the conclusion is valid if it is entailed by accepted premises. Or, to paraphrase such Aristotelian principle in a real-world approximation, characterised by incomplete information and presumptive logic, the conclusion is valid if not proved otherwise. This approach successfully steers the models' output through a reasoning pipeline, resulting in better performance against the baseline and its Chain-of-Thought (CoT) implementation. To this end, an extensive evaluation of the proposed approach on the MT-Bench Reasoning and Math tasks across a range of LLMs is provided.

Submitted to arXiv on 19 Dec. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2412.15177v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying," authors Federico Castagna, Isabel Sassoon, and Simon Parsons address the persistent challenge faced by Large Language Models (LLMs) in logical and mathematical reasoning tasks despite significant advancements in AI research. They highlight that LLMs excel as data pattern identifiers but struggle when tasked with generalizing and solving reasoning problems beyond their training data. To tackle this issue, the authors propose leveraging critical questions from argumentation theory, specifically drawing on Toulmin's model of argumentation. By incorporating these critical questions into the reasoning process of LLMs, they aim to enhance their ability to identify logical errors and improve overall performance. The authors emphasize the importance of ensuring that conclusions drawn by LLMs are valid based on accepted premises, mirroring the principles of sound argumentative procedures. This approach involves guiding the models through a reasoning pipeline where they can assess and correct any logical mistakes before generating responses to user prompts. Through this method, the authors demonstrate improved performance compared to baseline models and Chain-of-Thought (CoT) implementations. To validate their proposed approach, an extensive evaluation is conducted across various LLMs using MT-Bench Reasoning and Math tasks. The results showcase the effectiveness of integrating critical questioning techniques into the reasoning process of LLMs, ultimately enhancing their ability to tackle complex logical and mathematical challenges. Overall, this study sheds light on a promising strategy for bolstering the reasoning capabilities of advanced language models in AI research.
Created on 20 Dec. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.