Learning Co-Speech Gesture for Multimodal Aphasia Type Detection

AI-generated keywords: Aphasia Language disorder Brain damage Multimodal graph neural network Co-speech gestures

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Aphasia is a language disorder resulting from brain damage
  • Accurate identification of aphasia types is crucial for effective treatment
  • Limited focus on developing methods to detect different types of aphasia
  • Multimodal graph neural network proposed for aphasia type detection using speech and gesture patterns
  • Model learns correlation between speech and gesture modalities for each aphasia type
  • Generates textual representations sensitive to gesture information, leading to accurate detection of aphasia types
  • Extensive experiments conducted, achieving state-of-the-art results with an F1 score of 84.2%
  • Gesture features more effective than acoustic features in detecting aphasia types
  • Codes provided for reproducibility
  • Novel approach leveraging co-speech gestures for detecting different types of aphasia
  • Emphasizes importance of incorporating gesture information in identifying aphasia types
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Daeun Lee, Sejung Son, Hyolim Jeon, Seungbae Kim, Jinyoung Han

EMNLP 2023 accepted

Abstract: Aphasia, a language disorder resulting from brain damage, requires accurate identification of specific aphasia types, such as Broca's and Wernicke's aphasia, for effective treatment. However, little attention has been paid to developing methods to detect different types of aphasia. Recognizing the importance of analyzing co-speech gestures for distinguish aphasia types, we propose a multimodal graph neural network for aphasia type detection using speech and corresponding gesture patterns. By learning the correlation between the speech and gesture modalities for each aphasia type, our model can generate textual representations sensitive to gesture information, leading to accurate aphasia type detection. Extensive experiments demonstrate the superiority of our approach over existing methods, achieving state-of-the-art results (F1 84.2\%). We also show that gesture features outperform acoustic features, highlighting the significance of gesture expression in detecting aphasia types. We provide the codes for reproducibility purposes.

Submitted to arXiv on 18 Oct. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2310.11710v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Aphasia is a language disorder that results from brain damage. Accurate identification of specific aphasia types is crucial for effective treatment. However, there has been limited focus on developing methods to detect different types of aphasia. Recognizing the significance of analyzing co-speech gestures in distinguishing aphasia types, the authors propose a multimodal graph neural network for aphasia type detection using speech and corresponding gesture patterns. The proposed model learns the correlation between speech and gesture modalities for each aphasia type, enabling it to generate textual representations that are sensitive to gesture information. This leads to accurate detection of aphasia types. The authors conducted extensive experiments to evaluate their approach and found that it outperforms existing methods, achieving state-of-the-art results with an F1 score of 84.2%. Furthermore, the authors demonstrate that gesture features are more effective than acoustic features in detecting aphasia types, highlighting the significance of gesture expression in this context. To facilitate reproducibility, the authors provide the codes associated with their work. In summary, this study presents a novel approach for detecting different types of aphasia by leveraging co-speech gestures. The proposed multimodal graph neural network achieves superior performance compared to existing methods and emphasizes the importance of incorporating gesture information in identifying aphasia types.
Created on 13 Jan. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.