Should ChatGPT and Bard Share Revenue with Their Data Providers? A New Business Model for the AI Era

AI-generated keywords: Revenue Sharing AI Tools Data Providers Scoring System Healthy Ecosystem

AI-generated Key Points

  • AI tools such as ChatGPT and Bard require more and better quality data to continuously improve
  • Sharing revenue between AI tools and their data providers could transform the current hostile zero-sum game relationship into a collaborative and mutually beneficial one that drives forward AI technology and builds a healthy ecosystem
  • Current revenue-sharing business models do not work for AI tools in the forthcoming AI era since new metrics such as prompts and cost per prompt for generative AI tools will replace traditional website-based traffic and action metrics
  • A completely new revenue-sharing business model needs to establish a prompt-based scoring system to measure data engagement of each data provider
  • The proposed scoring system would encourage more data owners to participate in the revenue-sharing program, creating an environment where all parties benefit
  • This model must be almost independent of AI tools and easily explained to data providers, applying not only to large language models but also other types of multimodal AI tools like text-to-image generators or those used in healthcare.
  • Future work should focus on refining this model further so that it can be applied across different sectors beyond language models.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dong Zhang

22 pages, 8 figures, 2 tables
License: CC BY 4.0

Abstract: With various AI tools such as ChatGPT becoming increasingly popular, we are entering a true AI era. We can foresee that exceptional AI tools will soon reap considerable profits. A crucial question arise: should AI tools share revenue with their training data providers in additional to traditional stakeholders and shareholders? The answer is Yes. Large AI tools, such as large language models, always require more and better quality data to continuously improve, but current copyright laws limit their access to various types of data. Sharing revenue between AI tools and their data providers could transform the current hostile zero-sum game relationship between AI tools and a majority of copyrighted data owners into a collaborative and mutually beneficial one, which is necessary to facilitate the development of a virtuous cycle among AI tools, their users and data providers that drives forward AI technology and builds a healthy AI ecosystem. However, current revenue-sharing business models do not work for AI tools in the forthcoming AI era, since the most widely used metrics for website-based traffic and action, such as clicks, will be replaced by new metrics such as prompts and cost per prompt for generative AI tools. A completely new revenue-sharing business model, which must be almost independent of AI tools and be easily explained to data providers, needs to establish a prompt-based scoring system to measure data engagement of each data provider. This paper systematically discusses how to build such a scoring system for all data providers for AI tools based on classification and content similarity models, and outlines the requirements for AI tools or third parties to build it. Sharing revenue with data providers using such a scoring system would encourage more data owners to participate in the revenue-sharing program. This will be a utilitarian AI era where all parties benefit.

Submitted to arXiv on 04 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.02555v1

As AI tools such as ChatGPT and Bard become increasingly popular, the question arises of whether they should share revenue with their data providers in addition to traditional stakeholders and shareholders. The answer is yes, as large AI tools always require more and better quality data to continuously improve. However, current copyright laws limit their access to various types of data. Sharing revenue between AI tools and their data providers could transform the current hostile zero-sum game relationship into a collaborative and mutually beneficial one that drives forward AI technology and builds a healthy ecosystem. Current revenue-sharing business models do not work for AI tools in the forthcoming AI era since the most widely used metrics for website-based traffic and action will be replaced by new metrics such as prompts and cost per prompt for generative AI tools. A completely new revenue-sharing business model needs to establish a prompt-based scoring system to measure data engagement of each data provider. This paper systematically discusses how to build such a scoring system based on classification and content similarity models, outlining the requirements for third parties or tool developers to build it. The proposed scoring system would encourage more data owners to participate in the revenue-sharing program, creating an environment where all parties benefit. However, this model must be almost independent of AI tools and easily explained to data providers. For example, it could apply not only to large language models but also other types of multimodal AI tools like text-to-image generators or those used in healthcare. In conclusion, sharing revenue with data providers using an effective scoring system would facilitate collaboration among all stakeholders involved in developing exceptional AI tools that reap considerable profits while building a healthy ecosystem. Future work should focus on refining this model further so that it can be applied across different sectors beyond language models.
Created on 12 Jun. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.