Language Prompt for Autonomous Driving

AI-generated keywords: NuPrompt PromptTrack Language Prompts Autonomous Driving Trajectory

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Growing trend in computer vision community to capture objects based on natural language prompts
  • Lack of paired prompt-instance data for driving scenarios
  • Introduction of NuPrompt - object-centric language prompt set for driving scenes in 3D, multi-view, and multi-frame space
  • NuPrompt consists of 35,367 language descriptions referring to an average of 5.3 object tracks
  • Introduction of PromptTrack - a baseline model based on Transformer architecture
  • Experimental results show impressive performance of PromptTrack on NuPrompt
  • Dataset and code will be made publicly available at GitHub repository (https://github.com/wudongming97/Prompt4Driving)
  • Research addresses bottleneck in utilizing language prompts for driving scenarios
  • Promising results from PromptTrack contribute to advancing research in autonomous driving.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dongming Wu, Wencheng Han, Tiancai Wang, Yingfei Liu, Xiangyu Zhang, Jianbing Shen

Abstract: A new trend in the computer vision community is to capture objects of interest following flexible human command represented by a natural language prompt. However, the progress of using language prompts in driving scenarios is stuck in a bottleneck due to the scarcity of paired prompt-instance data. To address this challenge, we propose the first object-centric language prompt set for driving scenes within 3D, multi-view, and multi-frame space, named NuPrompt. It expands Nuscenes dataset by constructing a total of 35,367 language descriptions, each referring to an average of 5.3 object tracks. Based on the object-text pairs from the new benchmark, we formulate a new prompt-based driving task, \ie, employing a language prompt to predict the described object trajectory across views and frames. Furthermore, we provide a simple end-to-end baseline model based on Transformer, named PromptTrack. Experiments show that our PromptTrack achieves impressive performance on NuPrompt. We hope this work can provide more new insights for the autonomous driving community. Dataset and Code will be made public at \href{https://github.com/wudongming97/Prompt4Driving}{https://github.com/wudongming97/Prompt4Driving}.

Submitted to arXiv on 08 Sep. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2309.04379v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the computer vision community, there is a growing trend to capture objects of interest based on natural language prompts given by humans. To overcome the challenge of lack of paired prompt-instance data for driving scenarios, the authors propose NuPrompt - the first object-centric language prompt set specifically designed for driving scenes in 3D, multi-view and multi-frame space. NuPrompt consists of 35,367 language descriptions which refer to an average of 5.3 object tracks providing a rich source of information for training and evaluation purposes. The authors introduce PromptTrack - a simple end-to-end baseline model based on Transformer architecture - to demonstrate the effectiveness of their approach. Experimental results show that PromptTrack achieves impressive performance on NuPrompt. The authors plan to make both the dataset and code publicly available at their GitHub repository (https://github.com/wudongming97/Prompt4Driving). This research addresses an important bottleneck in utilizing language prompts for driving scenarios by introducing NuPrompt and formulating a new prompt-based driving task with promising results from PromptTrack model contributing to advancing research in autonomous driving.
Created on 04 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.