Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems

AI-generated keywords: Job Routing Heterogeneous Queueing Systems Reinforcement Learning Policy Gradient-based Algorithm ACHQ

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors Neharika Jali, Guannan Qu, Weina Wang, and Gauri Joshi present a study on efficiently routing jobs in heterogeneous queueing systems.
  • They propose ACHQ, an efficient policy gradient-based algorithm for determining optimal policies in multi-server systems.
  • ACHQ utilizes a low-dimensional soft threshold policy parameterization to exploit the underlying queueing structure.
  • The authors provide guarantees of stationary-point convergence for the general case and demonstrate convergence to an approximate global optimum for the special case of two servers.
  • Through simulations, they show that ACHQ can improve expected response time by up to approximately 30% compared to a greedy policy that routes jobs to the fastest available server.
  • This research has been accepted for presentation at AISTATS 2024 and offers valuable insights into optimizing job routing in heterogeneous queueing systems using innovative reinforcement learning techniques.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Neharika Jali, Guannan Qu, Weina Wang, Gauri Joshi

Accepted to AISTATS 2024

Abstract: We consider the problem of efficiently routing jobs that arrive into a central queue to a system of heterogeneous servers. Unlike homogeneous systems, a threshold policy, that routes jobs to the slow server(s) when the queue length exceeds a certain threshold, is known to be optimal for the one-fast-one-slow two-server system. But an optimal policy for the multi-server system is unknown and non-trivial to find. While Reinforcement Learning (RL) has been recognized to have great potential for learning policies in such cases, our problem has an exponentially large state space size, rendering standard RL inefficient. In this work, we propose ACHQ, an efficient policy gradient based algorithm with a low dimensional soft threshold policy parameterization that leverages the underlying queueing structure. We provide stationary-point convergence guarantees for the general case and despite the low-dimensional parameterization prove that ACHQ converges to an approximate global optimum for the special case of two servers. Simulations demonstrate an improvement in expected response time of up to ~30% over the greedy policy that routes to the fastest available server.

Submitted to arXiv on 02 Feb. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.01147v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Authors Neharika Jali, Guannan Qu, Weina Wang, and Gauri Joshi present a study on efficiently routing jobs in heterogeneous queueing systems. The authors address the challenge of determining an optimal policy for multi-server systems by proposing ACHQ - an efficient policy gradient-based algorithm that utilizes a low-dimensional soft threshold policy parameterization to exploit the underlying queueing structure. They provide guarantees of stationary-point convergence for the general case and demonstrate that despite its low-dimensional parameterization, ACHQ converges to an approximate global optimum for the special case of two servers. Through simulations, they show that ACHQ can improve expected response time by up to approximately 30% compared to a greedy policy that routes jobs to the fastest available server. This research has been accepted for presentation at AISTATS 2024 and offers valuable insights into optimizing job routing in heterogeneous queueing systems using innovative reinforcement learning techniques.
Created on 04 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.