Dynamic Pricing on E-commerce Platform with Deep Reinforcement Learning

AI-generated keywords: Dynamic Pricing E-commerce Deep Reinforcement Learning Markov Decision Process Revenue Conversion Rates

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors introduce a framework for dynamic pricing in E-commerce using deep reinforcement learning (DRL)
  • Model dynamic pricing as a Markov Decision Process (MDP) with four groups of business data representing different states
  • Three key contributions compared to existing DRL-based dynamic pricing algorithms:
  • Extend discrete set problem to continuous price set for more flexibility and accuracy
  • Introduce novel metric called Difference of Revenue Conversion Rates (DRCR) as reward function
  • Address cold-start issue of MDP through pre-training with historical sales data
  • Offline assessments on real datasets from Alibaba Inc. and online field experiments on Tmall.com show superiority of DRCR over traditional revenue-based metrics
  • Continuous price sets outperform discrete sets in extensive field experiments on 1000 stock keeping units (SKUs)
  • Framework surpasses manual pricing strategies implemented by operational experts, showcasing superior performance in online retail environments
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jiaxi Liu, Yidong Zhang, Xiaoqing Wang, Yuming Deng, Xingyu Wu

9 pages, 7 figures

Abstract: In this paper we present an end-to-end framework for addressing the problem of dynamic pricing on E-commerce platform using methods based on deep reinforcement learning (DRL). By using four groups of different business data to represent the states of each time period, we model the dynamic pricing problem as a Markov Decision Process (MDP). Compared with the state-of-the-art DRL-based dynamic pricing algorithms, our approaches make the following three contributions. First, we extend the discrete set problem to the continuous price set. Second, instead of using revenue as the reward function directly, we define a new function named difference of revenue conversion rates (DRCR). Third, the cold-start problem of MDP is tackled by pre-training and evaluation using some carefully chosen historical sales data. Our approaches are evaluated by both offline evaluation method using real dataset of Alibaba Inc., and online field experiments on Tmall.com, a major online shopping website owned by Alibaba Inc.. In particular, experiment results suggest that DRCR is a more appropriate reward function than revenue, which is widely used by current literature. In the end, field experiments, which last for months on 1000 stock keeping units (SKUs) of products demonstrate that continuous price sets have better performance than discrete sets and show that our approaches significantly outperformed the manual pricing by operation experts.

Submitted to arXiv on 05 Dec. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1912.02572v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Dynamic Pricing on E-commerce Platform with Deep Reinforcement Learning," authors Jiaxi Liu, Yidong Zhang, Xiaoqing Wang, Yuming Deng, and Xingyu Wu introduce an innovative framework for addressing dynamic pricing challenges in E-commerce using deep reinforcement learning (DRL) techniques. The study focuses on modeling the dynamic pricing problem as a Markov Decision Process (MDP) by utilizing four distinct groups of business data to represent different states during each time period. The researchers propose three key contributions compared to existing DRL-based dynamic pricing algorithms. Firstly, they extend the traditional discrete set problem to a continuous price set, enhancing the flexibility and accuracy of pricing decisions. Secondly, instead of directly using revenue as the reward function, they introduce a novel metric called the difference of revenue conversion rates (DRCR), which proves to be more effective in optimizing pricing strategies. Thirdly, they address the cold-start issue of MDP by pre-training and evaluating models with carefully selected historical sales data. To evaluate their approach, the team conducts offline assessments using real datasets from Alibaba Inc. and online field experiments on Tmall.com, a prominent online shopping platform owned by Alibaba Inc. The results indicate that DRCR outperforms traditional revenue-based metrics commonly used in literature. Furthermore, extensive field experiments conducted over several months on 1000 stock keeping units (SKUs) demonstrate that continuous price sets yield superior performance compared to discrete sets. Ultimately, the researchers show that their framework significantly surpasses manual pricing strategies implemented by operational experts. Overall, this study provides valuable insights into leveraging deep reinforcement learning for dynamic pricing optimization in E-commerce settings and highlights the importance of innovative reward functions and continuous price sets for achieving superior performance in online retail environments.
Created on 12 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.