ProAI: Proactive Multi-Agent Conversational AI with Structured Knowledge Base for Psychiatric Diagnosis

AI-generated keywords: ProAI

AI-generated Key Points

  • ProAI is a goal-oriented and proactive conversational AI framework designed to enhance diagnostic capabilities
  • It takes a proactive approach by asking relevant questions and steering conversations towards specific objectives
  • Simulation of patient interactions using an LLM agent with clinically informed symptomatology and behavior
  • Evaluation of diagnostic accuracy using Critical Node Recall (CN-Recall) and Differential Diagnosis Accuracy (DDx-ACC) metrics
  • User experience evaluation based on Helpfulness and Empathy dimensions
  • Doctor evaluation based on Specialty and Precision metrics related to clinical quality, coherence, adherence to guidelines, accuracy, and specificity of diagnoses
  • Future considerations include expanding evaluation to include a broader range of psychiatric disorders, conducting clinical trials with real patients, and automating knowledge graph construction for different diagnostic domains
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuqi Wu, Guangya Wan, Jingjing Li, Shengming Zhao, Lingfeng Ma, Tianyi Ye, Ion Pop, Yanbo Zhang, Jie Chen

21 pages, 8 figures
License: CC BY-NC-SA 4.0

Abstract: Most LLM-driven conversational AI systems operate reactively, responding to user prompts without guiding the interaction. Most LLM-driven conversational AI systems operate reactively, responding to user prompts without guiding the interaction. However, many real-world applications-such as psychiatric diagnosis, consulting, and interviews-require AI to take a proactive role, asking the right questions and steering conversations toward specific objectives. Using mental health differential diagnosis as an application context, we introduce ProAI, a goal-oriented, proactive conversational AI framework. ProAI integrates structured knowledge-guided memory, multi-agent proactive reasoning, and a multi-faceted evaluation strategy, enabling LLMs to engage in clinician-style diagnostic reasoning rather than simple response generation. Through simulated patient interactions, user experience assessment, and professional clinical validation, we demonstrate that ProAI achieves up to 83.3% accuracy in mental disorder differential diagnosis while maintaining professional and empathetic interaction standards. These results highlight the potential for more reliable, adaptive, and goal-driven AI diagnostic assistants, advancing LLMs beyond reactive dialogue systems.

Submitted to arXiv on 28 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2502.20689v1

, , , , In this study, we introduce ProAI, a goal-oriented and proactive conversational AI framework designed to enhance the diagnostic capabilities of language model-driven conversational systems. Most existing conversational AI systems operate reactively, responding to user prompts without actively guiding the interaction. However, ProAI takes a proactive approach by asking relevant questions and steering conversations towards specific objectives. Drawing inspiration from Wu et al. (2023, 2024), we simulate patient interactions using an LLM agent that embodies specific mental disorders with clinically informed symptomatology and behavior. Through multi-round conversations, ProAI engages in diagnostic reasoning to accurately identify the patient's condition. The system's diagnostic accuracy is evaluated using Critical Node Recall (CN-Recall) and Differential Diagnosis Accuracy (DDx-ACC) metrics, which assess its thoroughness in assessing essential criteria nodes and ability to reach correct diagnostic conclusions while ruling out alternative conditions. User experience evaluation is crucial for ensuring a positive patient experience during clinical interviews. We measure two key dimensions - Helpfulness and Empathy - to evaluate the effectiveness of the agent's medical consultation and its ability to demonstrate understanding and build rapport with patients. Additionally, doctor evaluation ensures that diagnostic decisions are based on rigorous medical reasoning by assessing Specialty and Precision metrics related to clinical quality, coherence, adherence to guidelines, accuracy, and specificity of diagnoses. While ProAI demonstrates strong performance in mental health differential diagnosis, several limitations need consideration. Future work should expand evaluation to include a broader range of psychiatric disorders and tasks beyond mental health diagnosis. Clinical trials with real patients would further validate the system's practical utility. Automation of knowledge graph construction for different diagnostic domains could streamline system development. Overall, our study highlights the potential for more reliable, adaptive, and goal-driven AI diagnostic assistants by advancing LLMs beyond reactive dialogue systems. By combining different LLMs in a hybrid system approach like "Two Agents Mixed," we can achieve a better balance between accuracy and user experience in clinical AI systems through thoughtful design and strategic model selection.
Created on 13 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.