H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem

AI-generated keywords: H-TSP Hierarchical Reinforcement Learning Travelling Salesman Problem End-to-End Learning Framework Solution Quality

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors propose H-TSP framework for Large-Scale Travelling Salesman Problem (TSP)
  • H-TSP utilizes hierarchical reinforcement learning with upper-level and lower-level policies
  • Approach directly produces solutions without time-consuming search procedures
  • Extensive experiments show H-TSP achieves comparable solution quality with significant time reduction
  • First end-to-end deep reinforcement learning approach scaling to TSP instances up to 10,000 nodes
  • Holds promise for practical applications like on-call routing and ride-hailing services
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xuanhao Pan, Yan Jin, Yuandong Ding, Mingxiao Feng, Li Zhao, Lei Song, Jiang Bian

Accepted by AAAI 2023, February 2023

Abstract: We propose an end-to-end learning framework based on hierarchical reinforcement learning, called H-TSP, for addressing the large-scale Travelling Salesman Problem (TSP). The proposed H-TSP constructs a solution of a TSP instance starting from the scratch relying on two components: the upper-level policy chooses a small subset of nodes (up to 200 in our experiment) from all nodes that are to be traversed, while the lower-level policy takes the chosen nodes as input and outputs a tour connecting them to the existing partial route (initially only containing the depot). After jointly training the upper-level and lower-level policies, our approach can directly generate solutions for the given TSP instances without relying on any time-consuming search procedures. To demonstrate effectiveness of the proposed approach, we have conducted extensive experiments on randomly generated TSP instances with different numbers of nodes. We show that H-TSP can achieve comparable results (gap 3.42% vs. 7.32%) as SOTA search-based approaches, and more importantly, we reduce the time consumption up to two orders of magnitude (3.32s vs. 395.85s). To the best of our knowledge, H-TSP is the first end-to-end deep reinforcement learning approach that can scale to TSP instances of up to 10000 nodes. Although there are still gaps to SOTA results with respect to solution quality, we believe that H-TSP will be useful for practical applications, particularly those that are time-sensitive e.g., on-call routing and ride hailing service.

Submitted to arXiv on 19 Apr. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2304.09395v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem," authors Xuanhao Pan, Yan Jin, Yuandong Ding, Mingxiao Feng, Li Zhao, Lei Song, and Jiang Bian propose an innovative end-to-end learning framework based on hierarchical reinforcement learning to tackle the challenging large-scale Travelling Salesman Problem (TSP). The H-TSP framework is designed to efficiently construct solutions for TSP instances by utilizing two key components: an upper-level policy that selects a small subset of nodes from the total set to be traversed (up to 200 nodes in experiments), and a lower-level policy that generates a tour connecting these chosen nodes to the existing partial route. By training these policies jointly, the approach can directly produce solutions for TSP instances without resorting to time-consuming search procedures. To validate the effectiveness of their proposed approach, the authors conducted extensive experiments on randomly generated TSP instances with varying numbers of nodes. The results demonstrate that H-TSP achieves comparable solution quality (with only a 3.42% performance gap compared to state-of-the-art search-based methods) while significantly reducing time consumption by up to two orders of magnitude (3.32 seconds versus 395.85 seconds). Notably, H-TSP stands out as the first end-to-end deep reinforcement learning approach capable of scaling to TSP instances containing up to 10,000 nodes. Despite some remaining gaps in solution quality compared to existing approaches, the authors believe that H-TSP holds promise for practical applications where time sensitivity is crucial—such as on-call routing and ride-hailing services. Accepted for presentation at AAAI 2023 in February 2023, this research represents a significant advancement in addressing large-scale TSPs through innovative hierarchical reinforcement learning techniques.
Created on 27 Feb. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.