A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and Evaluations

AI-generated keywords: Large Vision-Language Models LVLM safety attacks defenses evaluation methods

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Large Vision-Language Models (LVLMs) safety is a paramount concern for researchers and practitioners.
  • The survey by Mang Ye et al. focuses on attacks, defenses, and evaluation methods related to LVLM safety.
  • A unified framework is introduced to address vulnerabilities in LVLMs and strategies to mitigate them.
  • The authors present a classification framework that distinguishes between inference and training phases with nuanced subcategories.
  • Existing limitations in LVLM safety research are highlighted, along with future directions for fortifying model robustness.
  • Safety evaluations on the LVLM Deepseek Janus-Pro offer strategic recommendations for advancing LVLM safety measures.
  • The survey serves as a cornerstone for future research efforts and emphasizes the importance of security and ethical integrity in model development.
  • A public repository has been established by the authors to compile and update advancements in LVLM safety: https://github.com/XuankunRong/Awesome-LVLM-Safety.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mang Ye, Xuankun Rong, Wenke Huang, Bo Du, Nenghai Yu, Dacheng Tao

22 pages, 2 figures

Abstract: With the rapid advancement of Large Vision-Language Models (LVLMs), ensuring their safety has emerged as a crucial area of research. This survey provides a comprehensive analysis of LVLM safety, covering key aspects such as attacks, defenses, and evaluation methods. We introduce a unified framework that integrates these interrelated components, offering a holistic perspective on the vulnerabilities of LVLMs and the corresponding mitigation strategies. Through an analysis of the LVLM lifecycle, we introduce a classification framework that distinguishes between inference and training phases, with further subcategories to provide deeper insights. Furthermore, we highlight limitations in existing research and outline future directions aimed at strengthening the robustness of LVLMs. As part of our research, we conduct a set of safety evaluations on the latest LVLM, Deepseek Janus-Pro, and provide a theoretical analysis of the results. Our findings provide strategic recommendations for advancing LVLM safety and ensuring their secure and reliable deployment in high-stakes, real-world applications. This survey aims to serve as a cornerstone for future research, facilitating the development of models that not only push the boundaries of multimodal intelligence but also adhere to the highest standards of security and ethical integrity. Furthermore, to aid the growing research in this field, we have created a public repository to continuously compile and update the latest work on LVLM safety: https://github.com/XuankunRong/Awesome-LVLM-Safety .

Submitted to arXiv on 14 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2502.14881v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the rapidly evolving landscape of Large Vision-Language Models (LVLMs), ensuring their safety has become a paramount concern for researchers and practitioners alike. This comprehensive survey, authored by Mang Ye, Xuankun Rong, Wenke Huang, Bo Du, Nenghai Yu, and Dacheng Tao, delves deep into the realm of LVLM safety with a focus on attacks, defenses, and evaluation methods. The survey introduces a unified framework that intricately weaves together these critical components to provide a holistic view of the vulnerabilities inherent in LVLMs and the corresponding strategies to mitigate them. By dissecting the lifecycle of LVLMs, the authors present a classification framework that delineates between inference and training phases while offering nuanced subcategories for deeper insights. Moreover, the survey sheds light on existing limitations in LVLM safety research and outlines future directions aimed at fortifying the robustness of these models. Through a series of safety evaluations conducted on the cutting-edge LVLM known as Deepseek Janus-Pro, the authors offer strategic recommendations to advance LVLM safety measures and ensure their secure deployment in high-stakes real-world applications. This seminal work not only serves as a cornerstone for future research endeavors but also paves the way for developing models that not only push boundaries of multimodal intelligence but also adhere to stringent standards of security and ethical integrity. To further support ongoing research in this domain, the authors have established a public repository aimed at continuously compiling and updating latest advancements in LVLM safety: https://github.com/XuankunRong/Awesome-LVLM-Safety. With 22 pages and 2 figures encapsulating their findings,this survey stands as an invaluable resource for researchers, industry professionals,and policymakers seeking to navigate intricate landscape of LVLM safety with precision and foresight.
Created on 04 Mar. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.