ProAI: Proactive Multi-Agent Conversational AI with Structured Knowledge Base for Psychiatric Diagnosis

AI-generated keywords: ProAI

AI-generated Key Points

ProAI is a goal-oriented and proactive conversational AI framework designed to enhance diagnostic capabilities
It takes a proactive approach by asking relevant questions and steering conversations towards specific objectives
Simulation of patient interactions using an LLM agent with clinically informed symptomatology and behavior
Evaluation of diagnostic accuracy using Critical Node Recall (CN-Recall) and Differential Diagnosis Accuracy (DDx-ACC) metrics
User experience evaluation based on Helpfulness and Empathy dimensions
Doctor evaluation based on Specialty and Precision metrics related to clinical quality, coherence, adherence to guidelines, accuracy, and specificity of diagnoses
Future considerations include expanding evaluation to include a broader range of psychiatric disorders, conducting clinical trials with real patients, and automating knowledge graph construction for different diagnostic domains

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuqi Wu, Guangya Wan, Jingjing Li, Shengming Zhao, Lingfeng Ma, Tianyi Ye, Ion Pop, Yanbo Zhang, Jie Chen

arXiv: 2502.20689v1 - DOI (cs.AI)

21 pages, 8 figures

License: CC BY-NC-SA 4.0

Abstract: Most LLM-driven conversational AI systems operate reactively, responding to user prompts without guiding the interaction. Most LLM-driven conversational AI systems operate reactively, responding to user prompts without guiding the interaction. However, many real-world applications-such as psychiatric diagnosis, consulting, and interviews-require AI to take a proactive role, asking the right questions and steering conversations toward specific objectives. Using mental health differential diagnosis as an application context, we introduce ProAI, a goal-oriented, proactive conversational AI framework. ProAI integrates structured knowledge-guided memory, multi-agent proactive reasoning, and a multi-faceted evaluation strategy, enabling LLMs to engage in clinician-style diagnostic reasoning rather than simple response generation. Through simulated patient interactions, user experience assessment, and professional clinical validation, we demonstrate that ProAI achieves up to 83.3% accuracy in mental disorder differential diagnosis while maintaining professional and empathetic interaction standards. These results highlight the potential for more reliable, adaptive, and goal-driven AI diagnostic assistants, advancing LLMs beyond reactive dialogue systems.

Submitted to arXiv on 28 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2502.20689v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

, , , , In this study, we introduce ProAI, a goal-oriented and proactive conversational AI framework designed to enhance the diagnostic capabilities of language model-driven conversational systems. Most existing conversational AI systems operate reactively, responding to user prompts without actively guiding the interaction. However, ProAI takes a proactive approach by asking relevant questions and steering conversations towards specific objectives. Drawing inspiration from Wu et al. (2023, 2024), we simulate patient interactions using an LLM agent that embodies specific mental disorders with clinically informed symptomatology and behavior. Through multi-round conversations, ProAI engages in diagnostic reasoning to accurately identify the patient's condition. The system's diagnostic accuracy is evaluated using Critical Node Recall (CN-Recall) and Differential Diagnosis Accuracy (DDx-ACC) metrics, which assess its thoroughness in assessing essential criteria nodes and ability to reach correct diagnostic conclusions while ruling out alternative conditions. User experience evaluation is crucial for ensuring a positive patient experience during clinical interviews. We measure two key dimensions - Helpfulness and Empathy - to evaluate the effectiveness of the agent's medical consultation and its ability to demonstrate understanding and build rapport with patients. Additionally, doctor evaluation ensures that diagnostic decisions are based on rigorous medical reasoning by assessing Specialty and Precision metrics related to clinical quality, coherence, adherence to guidelines, accuracy, and specificity of diagnoses. While ProAI demonstrates strong performance in mental health differential diagnosis, several limitations need consideration. Future work should expand evaluation to include a broader range of psychiatric disorders and tasks beyond mental health diagnosis. Clinical trials with real patients would further validate the system's practical utility. Automation of knowledge graph construction for different diagnostic domains could streamline system development. Overall, our study highlights the potential for more reliable, adaptive, and goal-driven AI diagnostic assistants by advancing LLMs beyond reactive dialogue systems. By combining different LLMs in a hybrid system approach like "Two Agents Mixed," we can achieve a better balance between accuracy and user experience in clinical AI systems through thoughtful design and strategic model selection.

- ProAI is a goal-oriented and proactive conversational AI framework designed to enhance diagnostic capabilities
- It takes a proactive approach by asking relevant questions and steering conversations towards specific objectives
- Simulation of patient interactions using an LLM agent with clinically informed symptomatology and behavior
- Evaluation of diagnostic accuracy using Critical Node Recall (CN-Recall) and Differential Diagnosis Accuracy (DDx-ACC) metrics
- User experience evaluation based on Helpfulness and Empathy dimensions
- Doctor evaluation based on Specialty and Precision metrics related to clinical quality, coherence, adherence to guidelines, accuracy, and specificity of diagnoses
- Future considerations include expanding evaluation to include a broader range of psychiatric disorders, conducting clinical trials with real patients, and automating knowledge graph construction for different diagnostic domains

Summary- ProAI is a smart computer program that helps doctors figure out what might be wrong with a person's health. - It asks important questions and guides conversations to find answers. - It pretends to talk to patients like a real doctor would, using special information about symptoms and behaviors. - Doctors check how accurate ProAI is by looking at certain numbers like CN-Recall and DDx-ACC. - People also rate how helpful and caring ProAI is, while doctors look at its performance based on clinical quality. Definitions1. Conversational AI: A computer program that can talk with people in a way that feels natural, like having a conversation. 2. Diagnostic: Figuring out what is causing a problem or illness in someone's body. 3. Symptomatology: The study of symptoms or signs of an illness or condition. 4. Metrics: Measurements used to evaluate the performance or effectiveness of something. 5. Empathy: Understanding and sharing the feelings of others, showing care and concern for their well-being.

Introduction

Conversational AI has rapidly evolved in recent years, with the introduction of language model-driven conversational systems. These systems use large pre-trained models to generate human-like responses and engage in natural conversations with users. However, most existing conversational AI operates reactively, responding to user prompts without actively guiding the interaction towards a specific goal. In this research paper, we introduce ProAI - a proactive and goal-oriented conversational AI framework designed to enhance diagnostic capabilities. The system simulates patient interactions using an LLM (Language Model) agent that embodies specific mental disorders with clinically informed symptomatology and behavior. Through multi-round conversations, ProAI engages in diagnostic reasoning to accurately identify the patient's condition.

Evaluation Metrics

To evaluate the performance of ProAI, we use two main metrics - Critical Node Recall (CN-Recall) and Differential Diagnosis Accuracy (DDx-ACC). CN-Recall measures the system's thoroughness in assessing essential criteria nodes during diagnosis. DDx-ACC evaluates its ability to reach correct diagnostic conclusions while ruling out alternative conditions. User experience is also crucial for ensuring a positive patient experience during clinical interviews. We measure two key dimensions - Helpfulness and Empathy - to evaluate the effectiveness of ProAI's medical consultation and its ability to demonstrate understanding and build rapport with patients. Additionally, doctor evaluation ensures that diagnostic decisions are based on rigorous medical reasoning by assessing Specialty and Precision metrics related to clinical quality, coherence, adherence to guidelines, accuracy, and specificity of diagnoses.

Limitations & Future Work

While our study demonstrates strong performance in mental health differential diagnosis using ProAI, there are several limitations that need consideration. Firstly, future work should expand evaluation to include a broader range of psychiatric disorders beyond those simulated in this study. Moreover, conducting clinical trials with real patients would further validate the practical utility of ProAI. This would also help in gathering feedback and improving the system's performance based on real-world interactions. Another area for future work is the automation of knowledge graph construction for different diagnostic domains. This could streamline system development and make it easier to adapt ProAI for various medical specialties.

Conclusion

In conclusion, our study highlights the potential for more reliable, adaptive, and goal-driven AI diagnostic assistants by advancing LLMs beyond reactive dialogue systems. By combining different LLMs in a hybrid system approach like "Two Agents Mixed," we can achieve a better balance between accuracy and user experience in clinical AI systems through thoughtful design and strategic model selection. ProAI has shown promising results in mental health differential diagnosis, but there is still room for improvement and expansion into other medical domains. With further research and development, ProAI has the potential to revolutionize clinical interviews by providing accurate diagnoses while also creating a positive patient experience through proactive conversation guidance and empathy-building capabilities.

Created on 13 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

64.6%

Capabilities of Gemini Models in Medicine

cs.AI

62.7%

Knowledge Graph Based Agent for Complex, Knowledge-Intensive QA in Medicine

cs.AI

62.5%

IoT-Based Preventive Mental Health Using Knowledge Graphs and Standards for B…

cs.AI

59.9%

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligenc…

cs.AI

58.9%

Towards a Psychological Generalist AI: A Survey of Current Applications of La…

cs.AI

58.9%

Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

cs.AI

58.9%

AgentKit: Flow Engineering with Graphs, not Coding

cs.AI

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.