Efficient Few-Shot Learning Without Prompts

AI-generated keywords: SetFit Few-shot learning Sentence Transformers PEFT PET

AI-generated Key Points

  • Recent few-shot learning methods (PEFT and PET) have limitations
  • These methods rely on manually crafted prompts and large language models
  • Authors propose a new framework called SetFit for efficient and prompt-free few-shot fine-tuning of Sentence Transformers (ST)
  • SetFit uses contrastive Siamese fine-tuning on pretrained ST to generate rich text embeddings
  • Classification head is trained using these embeddings, no prompts or verbalizers required
  • SetFit achieves high accuracy with significantly fewer parameters compared to existing techniques
  • SetFit is an order of magnitude faster to train compared to PEFT and PET methods
  • SetFit can be applied in multilingual settings by switching the ST body
  • Paper provides insights into related approaches (ADAPET, PERFECT PERFECT) and highlights differences from SetFit
  • Performance of different PLM backbones is compared and discussed in the paper
  • Code for SetFit is available on GitHub along with datasets provided by authors
  • Overall, SetFit offers an efficient and effective solution for few-shot learning without relying on manually crafted prompts or billion-parameter language models.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat, Moshe Wasserblat, Oren Pereg

License: CC BY 4.0

Abstract: Recent few-shot methods, such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), have achieved impressive results in label-scarce settings. However, they are difficult to employ since they are subject to high variability from manually crafted prompts, and typically require billion-parameter language models to achieve high accuracy. To address these shortcomings, we propose SetFit (Sentence Transformer Fine-tuning), an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers (ST). SetFit works by first fine-tuning a pretrained ST on a small number of text pairs, in a contrastive Siamese manner. The resulting model is then used to generate rich text embeddings, which are used to train a classification head. This simple framework requires no prompts or verbalizers, and achieves high accuracy with orders of magnitude less parameters than existing techniques. Our experiments show that SetFit obtains comparable results with PEFT and PET techniques, while being an order of magnitude faster to train. We also show that SetFit can be applied in multilingual settings by simply switching the ST body. Our code is available at https://github.com/huggingface/setfit and our datasets at https://huggingface.co/setfit .

Submitted to arXiv on 22 Sep. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2209.11055v1

The paper discusses the limitations of recent few-shot learning methods such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), which rely on manually crafted prompts and require large language models to achieve high accuracy. To address these challenges, the authors propose a new framework called SetFit (Sentence Transformer Fine-tuning) for efficient and prompt-free few-shot fine-tuning of Sentence Transformers (ST). SetFit works by first fine-tuning a pretrained ST on a small number of text pairs in a contrastive Siamese manner. This process generates rich text embeddings, which are then used to train a classification head. Notably, SetFit does not require prompts or verbalizers and achieves high accuracy with significantly fewer parameters compared to existing techniques. The experiments conducted by the authors demonstrate that SetFit achieves comparable results to PEFT and PET methods while being an order of magnitude faster to train. Additionally, SetFit can be applied in multilingual settings by simply switching the ST body. The paper also provides additional insights into related approaches such as ADAPET and PERFECT PERFECT, highlighting their strengths and differences from SetFit. The authors compare the performance of different PLM backbones and discuss the experimental results. The code for SetFit is available on GitHub along with datasets provided by the authors. Overall, SetFit offers an efficient and effective solution for few-shot learning without relying on manually crafted prompts or billion-parameter language models.
Created on 24 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.