MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task

AI-generated keywords: MUSER Stress Detection Multimodal Features Affective Computing Human-Computer Interaction

AI-generated Key Points

  • Automatic detection of human stress is crucial for AI agents involved in affective computing and human-computer interaction.
  • Stress and emotion are both human affective states, with stress having significant implications on the regulation and expression of emotion.
  • MUSER is a transformer-based model architecture and a novel multi-task learning algorithm with speed-based dynamic sampling strategy that explores the inter-dependence between stress and emotion.
  • The method was evaluated on the Multimodal Stressed Emotion (MuSE) dataset, which includes both stress and emotion labels, making it an ideal benchmark for an in-depth analysis of their inter-dependence.
  • MUSER makes four main contributions: demonstrating the inter-dependence between stress and emotion via quantitative analyses on linguistic and acoustic features; establishing a state-of-the art stress detection model with a transformer structure as well as a novel speed-based dynamic sampling strategy for multi-task learning; achieving superior results on the MuSE dataset via multi-task training with both stress and emotion labels; showing that their speed-based dynamic sampling significantly outperforms other widely used methods.
  • Previous studies have explored unimodal approaches such as textual modality or acoustic features for unimodal stress detection, but multimodal features usually result in better performances.
  • MUSER provides an effective solution for detecting human stress using multiple modalities.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yiqun Yao, Michalis Papakostas, Mihai Burzo, Mohamed Abouelenien, Rada Mihalcea

NAACL 2021 accepted
License: CC BY 4.0

Abstract: The capability to automatically detect human stress can benefit artificial intelligent agents involved in affective computing and human-computer interaction. Stress and emotion are both human affective states, and stress has proven to have important implications on the regulation and expression of emotion. Although a series of methods have been established for multimodal stress detection, limited steps have been taken to explore the underlying inter-dependence between stress and emotion. In this work, we investigate the value of emotion recognition as an auxiliary task to improve stress detection. We propose MUSER -- a transformer-based model architecture and a novel multi-task learning algorithm with speed-based dynamic sampling strategy. Evaluations on the Multimodal Stressed Emotion (MuSE) dataset show that our model is effective for stress detection with both internal and external auxiliary tasks, and achieves state-of-the-art results.

Submitted to arXiv on 17 May. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2105.08146v1

The ability to automatically detect human stress is crucial for artificial intelligent agents involved in affective computing and human-computer interaction. Stress and emotion are both human affective states, with stress having significant implications on the regulation and expression of emotion. While several methods have been established for multimodal stress detection, limited steps have been taken to explore the underlying inter-dependence between stress and emotion. To address this gap, a team of researchers proposed MUSER - a transformer-based model architecture and a novel multi-task learning algorithm with speed-based dynamic sampling strategy. The team investigated the value of emotion recognition as an auxiliary task to improve stress detection. Their method was evaluated on the Multimodal Stressed Emotion (MuSE) dataset, which includes both stress and emotion labels, making it an ideal benchmark for an in-depth analysis of their inter-dependence. To test the generalization ability of their method, they also used an external emotion dataset for the auxiliary task - OMG-Emotion dataset. Their paper makes four main contributions: firstly, they demonstrate the inter-dependence between stress and emotion via quantitative analyses on linguistic and acoustic features; secondly, they establish a state-of-the art stress detection model with a transformer structure as well as a novel speed-based dynamic sampling strategy for multi-task learning; thirdly, they achieve superior results on the MuSE dataset via multi-task training with both stress and emotion labels; finally, experimental results show that their speed-based dynamic sampling significantly outperforms other widely used methods. Previous studies have explored unimodal approaches such as textual modality or acoustic features for unimodal stress detection. However, these approaches only have access to partial information about the expression of stress while multiple modalities can potentially be informative at the same time. As demonstrated by previous work on human sentiment and emotion prediction, multimodal features usually result in better performances. In conclusion, MUSER provides an effective solution for detecting human stress using multiple modalities. Their approach shows promising results and can be used in various applications such as affective computing, human computer interaction ,and mental health monitoring .
Created on 09 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.