CosFace: Large Margin Cosine Loss for Deep Face Recognition

AI-generated keywords: Face recognition Deep convolutional neural networks Large margin cosine loss Discrimination power State-of-the-art performance

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Face recognition advancements facilitated by deep convolutional neural networks (CNN)
  • Limitations of traditional softmax loss function in discriminating facial features
  • Introduction of alternative loss functions like central loss, large margin softmax loss, and angular softmax loss
  • Proposal of a novel approach called large margin cosine loss (LMCL) to enhance feature discrimination
  • Reformulation of softmax loss as cosine loss by L2 normalizing feature vectors and weight vectors
  • Introduction of a cosine margin term to enhance decision margins in angular space
  • Leveraging normalization and maximizing cosine decision margins to achieve minimum intra-class variance and maximum inter-class variance
  • Evaluation of the model trained with LMCL (CosFace) through experiments on popular face recognition datasets such as MegaFace Challenge, Youtube Faces (YTF), and Labeled Face in the Wild (LFW)
  • Outperformance of CosFace compared to existing methods, achieving state-of-the-art performance on benchmark datasets
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Zhifeng Li, Dihong Gong, Jingchao Zhou, Wei Liu

Abstract: Face recognition has achieved revolutionary advancement owing to the advancement of the deep convolutional neural network (CNN). The central task of face recognition, including face verification and identification, involves face feature discrimination. However, traditional softmax loss of deep CNN usually lacks the power of discrimination. To address this problem, recently several loss functions such as central loss \cite{centerloss}, large margin softmax loss \cite{lsoftmax}, and angular softmax loss \cite{sphereface} have been proposed. All these improvement algorithms share the same idea: maximizing inter-class variance and minimizing intra-class variance. In this paper, we design a novel loss function, namely large margin cosine loss (LMCL), to realize this idea from a different perspective. More specifically, we reformulate the softmax loss as cosine loss by L2 normalizing both features and weight vectors to remove radial variation, based on which a cosine margin term \emph{$m$} is introduced to further maximize decision margin in angular space. As a result, minimum intra-class variance and maximum inter-class variance are achieved by normalization and cosine decision margin maximization. We refer to our model trained with LMCL as CosFace. To test our approach, extensive experimental evaluations are conducted on the most popular public-domain face recognition datasets such as MegaFace Challenge, Youtube Faces (YTF) and Labeled Face in the Wild (LFW). We achieve the state-of-the-art performance on these benchmark experiments, which confirms the effectiveness of our approach.

Submitted to arXiv on 29 Jan. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1801.09414v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "CosFace: Large Margin Cosine Loss for Deep Face Recognition," authors Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Zhifeng Li, Dihong Gong, Jingchao Zhou, and Wei Liu discuss the significant advancements in face recognition facilitated by deep convolutional neural networks (CNN). The primary focus of face recognition tasks such as verification and identification lies in discriminating facial features. However, the traditional softmax loss function used in deep CNNs often falls short in terms of discrimination power. To address this limitation, recent research has introduced alternative loss functions like central loss, large margin softmax loss, and angular softmax loss. These approaches aim to maximize inter-class variance while minimizing intra-class variance. This study proposes a novel approach called large margin cosine loss (LMCL) to tackle this challenge from a different perspective. The LMCL reformulates the softmax loss as cosine loss by L2 normalizing both feature vectors and weight vectors to eliminate radial variation. Additionally, a cosine margin term is introduced to enhance decision margins in angular space further. By leveraging normalization and maximizing cosine decision margins, the LMCL achieves minimum intra-class variance and maximum inter-class variance. The model trained with LMCL - referred to as CosFace - is evaluated through extensive experiments on popular face recognition datasets such as MegaFace Challenge, Youtube Faces (YTF), and Labeled Face in the Wild (LFW). The results demonstrate that CosFace outperforms existing methods and achieves state-of-the-art performance on these benchmark datasets. This confirms the effectiveness of the proposed approach in enhancing face recognition accuracy through improved feature discrimination.
Created on 26 Sep. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.