PACE: Improving Prompt with Actor-Critic Editing for Large Language Model

AI-generated keywords: PACE LLM Actor-Critic Performance Prompts

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper introduces a method called Prompt with Actor-Critic Editing (PACE) to enhance the performance of large language models (LLMs) by automatically editing prompts.
  • LLMs are affected by the quality of human-written prompts, and PACE addresses this issue by leveraging the actor-critic algorithm from reinforcement learning.
  • PACE treats LLMs as both actors and critics, refining prompts based on feedback from both actors performing the prompt and critics criticizing the response.
  • This process allows LLMs to align the prompt more effectively with a specific task by incorporating real responses and thinking from LLMs.
  • Extensive experiments on 24 instruction induction tasks and 21 big-bench tasks demonstrate that PACE significantly improves the relative performance of medium/low-quality human-written prompts by up to 98%.
  • PACE achieves comparable performance to high-quality prompts and exhibits notable efficacy in prompt generation.
  • Overall, PACE offers an automated approach for enhancing LLM performance by refining prompts, reducing the need for manual effort in improving prompt quality.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yihong Dong, Kangcheng Luo, Xue Jiang, Zhi Jin, Ge Li

Abstract: Large language models (LLMs) have showcased remarkable potential across various tasks by conditioning on prompts. However, the quality of different human-written prompts leads to substantial discrepancies in LLMs' performance, and improving prompts usually necessitates considerable human effort and expertise. To this end, this paper proposes Prompt with Actor-Critic Editing (PACE) for LLMs to enable automatic prompt editing. Drawing inspiration from the actor-critic algorithm in reinforcement learning, PACE leverages LLMs as the dual roles of actors and critics, conceptualizing prompt as a type of policy. PACE refines prompt, taking into account the feedback from both actors performing prompt and critics criticizing response. This process helps LLMs better align prompt to a specific task, thanks to real responses and thinking from LLMs. We conduct extensive experiments on 24 instruction induction tasks and 21 big-bench tasks. Experimental results indicate that PACE elevates the relative performance of medium/low-quality human-written prompts by up to 98\%, which has comparable performance to high-quality human-written prompts. Moreover, PACE also exhibits notable efficacy for prompt generation.

Submitted to arXiv on 19 Aug. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2308.10088v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper titled "PACE: Improving Prompt with Actor-Critic Editing for Large Language Model" introduces a method called Prompt with Actor-Critic Editing (PACE) to enhance the performance of large language models (LLMs) by automatically editing prompts. LLMs have shown great potential in various tasks when conditioned on prompts, but the quality of human-written prompts can significantly affect their performance. To address this issue, PACE leverages the actor-critic algorithm from reinforcement learning and treats LLMs as both actors and critics. It considers prompts as a type of policy and refines them based on feedback from both actors performing the prompt and critics criticizing the response. This process allows LLMs to align the prompt more effectively with a specific task by incorporating real responses and thinking from LLMs. The researchers conducted extensive experiments on 24 instruction induction tasks and 21 big-bench tasks. The results demonstrate that PACE significantly improves the relative performance of medium/low-quality human-written prompts by up to 98%, achieving comparable performance to high-quality prompts. Additionally, PACE also exhibits notable efficacy in prompt generation. Overall, PACE offers an automated approach for enhancing LLM performance by refining prompts, reducing the need for manual effort in improving prompt quality.
Created on 28 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.