Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing
AI-generated Key Points
- Autonomous driving in high-speed racing environments poses challenges for scene understanding due to rapid changes in the track environment
- Traditional sequential network approaches struggle to keep up with real-time knowledge and decision-making demands
- The Parallel Perception Network (PPN) architecture consists of two independent neural networks - segmentation and reconstruction networks - running in parallel on separate accelerated hardware
- PPN utilizes true hardware-enabled parallelism to match the high velocity of the autonomous agent, resulting in faster processing speeds
- The model is trained on a system equipped with two NVIDIA T4 GPUs using various loss functions, including edge preservation, leading to a 2x speedup in model inference time compared to traditional configurations
- GPUs are utilized as hardware accelerators for high-performance computing during both training and inference stages, leveraging advancements in GPU computing and performance within NVIDIA's CUDA platform
- By employing separate accelerated hardware for true hardware-enabled parallel computing, latency issues in real-time perception for multiple tasks in autonomous driving scenarios are effectively mitigated
Authors: Suwesh Prasad Sah
Abstract: Autonomous driving in high-speed racing, as opposed to urban environments, presents significant challenges in scene understanding due to rapid changes in the track environment. Traditional sequential network approaches may struggle to meet the real-time knowledge and decision-making demands of an autonomous agent covering large displacements in a short time. This paper proposes a novel baseline architecture for developing sophisticated models capable of true hardware-enabled parallelism, achieving neural processing speeds that mirror the agent's high velocity. The proposed model (Parallel Perception Network (PPN)) consists of two independent neural networks, segmentation and reconstruction networks, running parallelly on separate accelerated hardware. The model takes raw 3D point cloud data from the LiDAR sensor as input and converts it into a 2D Bird's Eye View Map on both devices. Each network independently extracts its input features along space and time dimensions and produces outputs parallelly. The proposed method's model is trained on a system with two NVIDIA T4 GPUs, using a combination of loss functions, including edge preservation, and demonstrates a 2x speedup in model inference time compared to a sequential configuration. Implementation is available at: https://github.com/suwesh/Parallel-Perception-Network. Learned parameters of the trained networks are provided at: https://huggingface.co/suwesh/ParallelPerceptionNetwork.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.