The Matthew Effect of AI Programming Assistants: A Hidden Bias in Software Evolution

AI-generated keywords: AI-assisted programming large language models software engineering algorithmic tasks framework selection

AI-generated Key Points

  • The impact of AI-assisted programming on software development practices is explored
  • Large language models (LLMs) influence coding paradigms through concepts like vibe coding and agentic coding
  • Extensive experiments conducted on algorithmic programming tasks and framework selection tasks to analyze the interaction between AI-driven programming and the software ecosystem
  • Discovery of a significant Matthew effect where LLM-generated code's success rate is correlated with the popularity of the programming language or framework being used, potentially reinforcing existing hierarchies
  • Detailed experimental infrastructure outlined for conducting tasks using proprietary LLM APIs and three specific AI programming tools
  • Methodology involves standardizing prompts across different technologies while maintaining consistent functional requirements
  • In-depth analysis of AI coding processes, including cleaning up non-executable content from generated responses to ensure only functional code remains
  • Approach to judging solutions generated by AI assistants using platforms like LeetCode highlighted
  • Study provides insights into hidden biases present in AI-driven programming assistance at both language and framework levels, opening discussions about their impact on software ecosystem trajectories, innovation, and diversity in software development practices.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Fei Gu, Zi Liang, Hongzong LI, Jiahao MA

License: CC BY 4.0

Abstract: AI-assisted programming is rapidly reshaping software development, with large language models (LLMs) enabling new paradigms such as vibe coding and agentic coding. While prior works have focused on prompt design and code generation quality, the broader impact of LLM-driven development on the iterative dynamics of software engineering remains underexplored. In this paper, we conduct large-scale experiments on thousands of algorithmic programming tasks and hundreds of framework selection tasks to systematically investigate how AI-assisted programming interacts with the software ecosystem. Our analysis reveals \textbf{a striking Matthew effect: the more popular a programming language or framework, the higher the success rate of LLM-generated code}. The phenomenon suggests that AI systems may reinforce existing popularity hierarchies, accelerating convergence around dominant tools while hindering diversity and innovation. We provide a quantitative characterization of this effect and discuss its implications for the future evolution of programming ecosystems.

Submitted to arXiv on 27 Sep. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2509.23261v1

The authors of this paper explore the impact of AI-assisted programming on software development practices. They discuss the emergence of large language models (LLMs) and their influence on coding paradigms through concepts like vibe coding and agentic coding. The study conducts extensive experiments on algorithmic programming tasks and framework selection tasks to analyze the interaction between AI-driven programming and the software ecosystem. One notable finding is the discovery of a significant Matthew effect, where LLM-generated code's success rate is directly correlated with the popularity of the programming language or framework being used. This highlights how AI systems may reinforce existing hierarchies and potentially stifle diversity and innovation within the programming community. The paper also outlines a detailed experimental infrastructure for conducting these tasks using proprietary LLM APIs and three specific AI programming tools. The methodology involves standardizing prompts across different technologies while maintaining consistent functional requirements. The authors also delve into an in-depth analysis of AI coding processes, discussing how they clean up non-executable content from generated responses to ensure only functional code remains. Additionally, they explain their approach to judging solutions generated by AI assistants using platforms like LeetCode. Overall, this comprehensive study provides valuable insights into hidden biases present in AI-driven programming assistance at both language and framework levels. By shedding light on these structural biases, it opens up discussions about how they shape software ecosystem trajectories and influence factors like innovation and diversity in software development practices.
Created on 10 Nov. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.