StutterNet: Stuttering Detection Using Time Delay Neural Network

AI-generated keywords: StutterNet Deep Learning TDNN UCLASS Disfluencies

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • StutterNet is a novel approach to detecting stuttering using deep learning techniques
  • It relies solely on the acoustic signal, unlike most existing methods that use automatic speech recognition (ASR) combined with language models for stuttering detection
  • The system uses a time-delay neural network (TDNN) that captures contextual aspects of disfluent utterances
  • StutterNet outperforms the state-of-the-art residual neural network based method when evaluated on the UCLASS stuttering dataset consisting of over 100 speakers
  • The number of trainable parameters in StutterNet is substantially less due to the parameter sharing scheme of TDNN, making it an efficient and effective tool for detecting stuttering in real-world scenarios
  • StutterNet represents an important advancement in the field of stuttering detection through its innovative use of deep learning techniques and reliance solely on acoustic signals
  • It has significant potential for improving the accuracy and efficiency of stuttering detection in a wide range of applications.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shakeel A. Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

Accepted in EUSIPCO 2021: European Signal Processing Conference

Abstract: This paper introduces StutterNet, a novel deep learning based stuttering detection capable of detecting and identifying various types of disfluencies. Most of the existing work in this domain uses automatic speech recognition (ASR) combined with language models for stuttering detection. Compared to the existing work, which depends on the ASR module, our method relies solely on the acoustic signal. We use a time-delay neural network (TDNN) suitable for capturing contextual aspects of the disfluent utterances. We evaluate our system on the UCLASS stuttering dataset consisting of more than 100 speakers. Our method achieves promising results and outperforms the state-of-the-art residual neural network based method. The number of trainable parameters of the proposed method is also substantially less due to the parameter sharing scheme of TDNN.

Submitted to arXiv on 12 May. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2105.05599v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

StutterNet is a novel approach to detecting stuttering using deep learning techniques. Unlike most existing methods in this domain, which rely on automatic speech recognition (ASR) combined with language models for stuttering detection, StutterNet relies solely on the acoustic signal. The system uses a time-delay neural network (TDNN) that is capable of capturing contextual aspects of disfluent utterances. The proposed method achieves promising results and outperforms the state-of-the-art residual neural network based method when evaluated on the UCLASS stuttering dataset consisting of over 100 speakers. Additionally, the number of trainable parameters in StutterNet is substantially less due to the parameter sharing scheme of TDNN. This makes it an efficient and effective tool for detecting stuttering in real-world scenarios. Overall, StutterNet represents an important advancement in the field of stuttering detection through its innovative use of deep learning techniques and reliance solely on acoustic signals. It has significant potential for improving the accuracy and efficiency of stuttering detection in a wide range of applications.
Created on 20 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.