Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries

AI-generated keywords: Sign language recognition Few-shot learning Online dictionaries Training dataset Localization

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors Matyáš Boháček and Marek Hrúz address challenges in current sign language recognition models:
  • Models require large training datasets of laboratory-like videos, which are difficult and costly to collect.
  • Limited availability of publicly accessible systems, especially for less-populated sign languages.
  • Proposal to overcome limitations and democratize technology:
  • Utilize online text-to-video dictionaries containing annotated data on various attributes and sign languages.
  • Introduce UWB-SL-Wild few-shot dataset sourced from dictionary-scraped videos to reflect actual distribution of online sign language data.
  • Approach presented in the study:
  • Select glosses overlapping with existing datasets like WLASL100 and ASLLVD for transfer learning experiments.
  • Novel approach to training sign language recognition models in a few-shot scenario.
  • Results of the proposed method:
  • State-of-the-art results on ASLLVD-Skeleton and ASLLVD-Skeleton-20 datasets with top-1 accuracy rates of $30.97%$ and $95.45%$, respectively.
  • Contribution to advancing sign language recognition technology:
  • Addressing challenges related to training data availability.
  • Making technology more inclusive across diverse linguistic communities.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Matyáš Boháček, Marek Hrúz

6 pages, 2 figures, IEEE Face & Gestures 2023
License: CC BY-NC-ND 4.0

Abstract: Today's sign language recognition models require large training corpora of laboratory-like videos, whose collection involves an extensive workforce and financial resources. As a result, only a handful of such systems are publicly available, not to mention their limited localization capabilities for less-populated sign languages. Utilizing online text-to-video dictionaries, which inherently hold annotated data of various attributes and sign languages, and training models in a few-shot fashion hence poses a promising path for the democratization of this technology. In this work, we collect and open-source the UWB-SL-Wild few-shot dataset, the first of its kind training resource consisting of dictionary-scraped videos. This dataset represents the actual distribution and characteristics of available online sign language data. We select glosses that directly overlap with the already existing datasets WLASL100 and ASLLVD and share their class mappings to allow for transfer learning experiments. Apart from providing baseline results on a pose-based architecture, we introduce a novel approach to training sign language recognition models in a few-shot scenario, resulting in state-of-the-art results on ASLLVD-Skeleton and ASLLVD-Skeleton-20 datasets with top-1 accuracy of $30.97~\%$ and $95.45~\%$, respectively.

Submitted to arXiv on 10 Jan. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2301.03769v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries," authors Matyáš Boháček and Marek Hrúz address the challenges faced by current sign language recognition models. These models typically require large training datasets of laboratory-like videos, which can be difficult and costly to collect. This leads to a limited availability of publicly accessible systems, especially for less-populated sign languages. To overcome these limitations and democratize the technology, the authors propose utilizing online text-to-video dictionaries that contain annotated data on various attributes and sign languages. In this study, the researchers introduce the UWB-SL-Wild few-shot dataset sourced from dictionary-scraped videos. This dataset reflects the actual distribution and characteristics of online sign language data, providing a valuable resource for training recognition models in a few-shot fashion. By selecting glosses that overlap with existing datasets like WLASL100 and ASLLVD, the authors enable transfer learning experiments and facilitate comparisons between different datasets. Additionally, the paper presents a novel approach to training sign language recognition models in a few-shot scenario. The proposed method yields state-of-the-art results on ASLLVD-Skeleton and ASLLVD-Skeleton-20 datasets with impressive top-1 accuracy rates of $30.97~\%$ and $95.45~\%$, respectively. These results demonstrate the effectiveness of leveraging online dictionaries for training sign language recognition models and highlight the potential for broader accessibility and localization capabilities in this field. Overall, this work significantly contributes to advancing sign language recognition technology by addressing challenges related to training data availability and making it more inclusive across diverse linguistic communities. has been greatly improved through , utilizing as a valuable . This approach also has the potential to enhance capabilities and make sign language recognition more accessible for all.
Created on 15 May. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.