MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark

AI-generated keywords: Multi-target multi-camera tracking real-world dynamics diverse camera configurations MTMMC dataset challenging test-bed

AI-generated Key Points

  • Multi-target multi-camera tracking is crucial in visual surveillance, crowd behavior analysis, and anomaly detection.
  • Existing datasets have limitations in modeling real-world dynamics and diverse camera configurations.
  • The MTMMC dataset addresses this issue by offering a large-scale dataset captured by 16 multi-modal cameras in campus and factory environments.
  • The dataset includes spatially aligned and temporally synchronized RGB and thermal cameras to enhance tracking accuracy.
  • MTMMC benefits fields like person detection, re-identification, and multiple object tracking.
  • Baselines and new learning setups are provided on this dataset for future studies.
  • The datasets, models, and test server will be publicly available with users agreeing to the terms outlined in the Use Agreement.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Sanghyun Woo, Kwanyong Park, Inkyu Shin, Myungchul Kim, In So Kweon

Accepted on CVPR 2024
License: CC BY-NC-SA 4.0

Abstract: Multi-target multi-camera tracking is a crucial task that involves identifying and tracking individuals over time using video streams from multiple cameras. This task has practical applications in various fields, such as visual surveillance, crowd behavior analysis, and anomaly detection. However, due to the difficulty and cost of collecting and labeling data, existing datasets for this task are either synthetically generated or artificially constructed within a controlled camera network setting, which limits their ability to model real-world dynamics and generalize to diverse camera configurations. To address this issue, we present MTMMC, a real-world, large-scale dataset that includes long video sequences captured by 16 multi-modal cameras in two different environments - campus and factory - across various time, weather, and season conditions. This dataset provides a challenging test-bed for studying multi-camera tracking under diverse real-world complexities and includes an additional input modality of spatially aligned and temporally synchronized RGB and thermal cameras, which enhances the accuracy of multi-camera tracking. MTMMC is a super-set of existing datasets, benefiting independent fields such as person detection, re-identification, and multiple object tracking. We provide baselines and new learning setups on this dataset and set the reference scores for future studies. The datasets, models, and test server will be made publicly available.

Submitted to arXiv on 29 Mar. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2403.20225v1

Multi-target multi-camera tracking is a critical task in various fields such as visual surveillance, crowd behavior analysis, and anomaly detection. The existing datasets for this task have limitations in modeling real-world dynamics and generalizing to diverse camera configurations due to their synthetic or artificially constructed nature. To address this issue, the MTMMC dataset has been introduced. It offers a large-scale dataset captured by 16 multi-modal cameras in campus and factory environments under different time, weather, and season conditions. This dataset provides a challenging test-bed for studying multi-camera tracking under diverse real-world complexities. It includes spatially aligned and temporally synchronized RGB and thermal cameras as an additional input modality, enhancing the accuracy of multi-camera tracking. MTMMC serves as a super-set of existing datasets and benefits independent fields like person detection, re-identification, and multiple object tracking. The authors provide baselines and new learning setups on this dataset while setting reference scores for future studies. The datasets, models, and test server will be made publicly available. Additionally, users must agree to the terms and conditions outlined in the Multi-Target Multi-Modal Camera Tracking Dataset Use Agreement before accessing the dataset. This refined summary highlights the significance of MTMMC in advancing research on multi-target multi-camera tracking in real-world scenarios.
Created on 21 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.