The Matthew Effect of AI Programming Assistants: A Hidden Bias in Software Evolution

AI-generated keywords: AI-assisted programming large language models software engineering algorithmic tasks framework selection

AI-generated Key Points

The impact of AI-assisted programming on software development practices is explored
Large language models (LLMs) influence coding paradigms through concepts like vibe coding and agentic coding
Extensive experiments conducted on algorithmic programming tasks and framework selection tasks to analyze the interaction between AI-driven programming and the software ecosystem
Discovery of a significant Matthew effect where LLM-generated code's success rate is correlated with the popularity of the programming language or framework being used, potentially reinforcing existing hierarchies
Detailed experimental infrastructure outlined for conducting tasks using proprietary LLM APIs and three specific AI programming tools
Methodology involves standardizing prompts across different technologies while maintaining consistent functional requirements
In-depth analysis of AI coding processes, including cleaning up non-executable content from generated responses to ensure only functional code remains
Approach to judging solutions generated by AI assistants using platforms like LeetCode highlighted
Study provides insights into hidden biases present in AI-driven programming assistance at both language and framework levels, opening discussions about their impact on software ecosystem trajectories, innovation, and diversity in software development practices.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Fei Gu, Zi Liang, Hongzong LI, Jiahao MA

arXiv: 2509.23261v1 - DOI (cs.SE)

License: CC BY 4.0

Abstract: AI-assisted programming is rapidly reshaping software development, with large language models (LLMs) enabling new paradigms such as vibe coding and agentic coding. While prior works have focused on prompt design and code generation quality, the broader impact of LLM-driven development on the iterative dynamics of software engineering remains underexplored. In this paper, we conduct large-scale experiments on thousands of algorithmic programming tasks and hundreds of framework selection tasks to systematically investigate how AI-assisted programming interacts with the software ecosystem. Our analysis reveals \textbf{a striking Matthew effect: the more popular a programming language or framework, the higher the success rate of LLM-generated code}. The phenomenon suggests that AI systems may reinforce existing popularity hierarchies, accelerating convergence around dominant tools while hindering diversity and innovation. We provide a quantitative characterization of this effect and discuss its implications for the future evolution of programming ecosystems.

Submitted to arXiv on 27 Sep. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2509.23261v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

The authors of this paper explore the impact of AI-assisted programming on software development practices. They discuss the emergence of large language models (LLMs) and their influence on coding paradigms through concepts like vibe coding and agentic coding. The study conducts extensive experiments on algorithmic programming tasks and framework selection tasks to analyze the interaction between AI-driven programming and the software ecosystem. One notable finding is the discovery of a significant Matthew effect, where LLM-generated code's success rate is directly correlated with the popularity of the programming language or framework being used. This highlights how AI systems may reinforce existing hierarchies and potentially stifle diversity and innovation within the programming community. The paper also outlines a detailed experimental infrastructure for conducting these tasks using proprietary LLM APIs and three specific AI programming tools. The methodology involves standardizing prompts across different technologies while maintaining consistent functional requirements. The authors also delve into an in-depth analysis of AI coding processes, discussing how they clean up non-executable content from generated responses to ensure only functional code remains. Additionally, they explain their approach to judging solutions generated by AI assistants using platforms like LeetCode. Overall, this comprehensive study provides valuable insights into hidden biases present in AI-driven programming assistance at both language and framework levels. By shedding light on these structural biases, it opens up discussions about how they shape software ecosystem trajectories and influence factors like innovation and diversity in software development practices.

- The impact of AI-assisted programming on software development practices is explored
- Large language models (LLMs) influence coding paradigms through concepts like vibe coding and agentic coding
- Extensive experiments conducted on algorithmic programming tasks and framework selection tasks to analyze the interaction between AI-driven programming and the software ecosystem
- Discovery of a significant Matthew effect where LLM-generated code's success rate is correlated with the popularity of the programming language or framework being used, potentially reinforcing existing hierarchies
- Detailed experimental infrastructure outlined for conducting tasks using proprietary LLM APIs and three specific AI programming tools
- Methodology involves standardizing prompts across different technologies while maintaining consistent functional requirements
- In-depth analysis of AI coding processes, including cleaning up non-executable content from generated responses to ensure only functional code remains
- Approach to judging solutions generated by AI assistants using platforms like LeetCode highlighted
- Study provides insights into hidden biases present in AI-driven programming assistance at both language and framework levels, opening discussions about their impact on software ecosystem trajectories, innovation, and diversity in software development practices.

Summary1. Scientists are studying how computers can help people write programs better. 2. Big computer programs can change how we write code by introducing new ideas like vibe coding and agentic coding. 3. They did many tests to see how well computer-generated code works with different programming tasks. 4. They found that popular programming languages make the computer code more successful, which can make some languages more important than others. 5. The researchers created a special way to test these computer programs using specific tools. Definitions- AI-assisted programming: Using computers to help write software. - Large language models (LLMs): Big computer programs that understand and generate human-like text. - Algorithmic programming tasks: Solving problems using step-by-step instructions for computers. - Framework selection tasks: Choosing the best set of tools for building software projects. - Matthew effect: When success leads to more success, creating advantages for already popular things. - Hierarchies: Systems where some things are ranked higher or lower than others based on importance or popularity. - Infrastructure: The basic physical and organizational structures needed for something to work properly. - Functional requirements: Specific features or capabilities that a software program must have to work correctly. - Non-executable content: Text that doesn't directly make the program run but is still part of the code's output. - AI assistants: Computer programs that help people solve problems or complete tasks using artificial intelligence technology. - Hidden biases: Unfair preferences or prejudices that affect decisions without being obvious

The Impact of AI-Assisted Programming on Software Development Practices

In recent years, the use of artificial intelligence (AI) in software development has gained significant attention. With the emergence of large language models (LLMs), there has been a shift towards AI-assisted programming, where developers can rely on machine-generated code to complete tasks. This trend has sparked discussions about its potential impact on traditional coding paradigms and the overall software ecosystem. A research paper titled "The Impact of AI-Assisted Programming on Software Development Practices" delves into this topic by conducting extensive experiments and analyses. The authors explore how LLMs are influencing coding practices through concepts like vibe coding and agentic coding. They also investigate the interaction between AI-driven programming and the software ecosystem by examining algorithmic programming tasks and framework selection tasks.

The Rise of Large Language Models (LLMs)

Large language models refer to advanced natural language processing (NLP) systems that can generate human-like text based on massive amounts of data. These models have significantly improved over time, with some being able to generate coherent paragraphs that are difficult to distinguish from those written by humans. One notable example is OpenAI's GPT-3 model, which contains 175 billion parameters and is trained on a diverse range of internet texts. This model has shown remarkable abilities in completing various NLP tasks, including generating code snippets for different programming languages.

Vibe Coding and Agentic Coding

Vibe coding refers to using an LLM as a creative partner during the development process. Developers can input prompts or ideas into an LLM system, which then generates possible solutions or suggestions for them to consider. On the other hand, agentic coding involves relying entirely on an LLM system for completing a task without any human intervention or input. In this case, developers act more as supervisors rather than actively participating in the coding process.

The Matthew Effect

One of the key findings of this research paper is the discovery of a significant Matthew effect in AI-assisted programming. The term "Matthew effect" refers to the phenomenon where success breeds success, and those who are already successful have an advantage over others. In the context of LLM-generated code, this means that its success rate is directly correlated with the popularity of the programming language or framework being used. This finding highlights how AI systems may reinforce existing hierarchies and potentially stifle diversity and innovation within the programming community.

Experimental Infrastructure and Methodology

To conduct their experiments, the authors developed a detailed experimental infrastructure using proprietary LLM APIs and three specific AI programming tools. They standardized prompts across different technologies while maintaining consistent functional requirements for each task. The methodology also involved cleaning up non-executable content from generated responses to ensure only functional code remains. Additionally, they explain their approach to judging solutions generated by AI assistants using platforms like LeetCode.

Hidden Biases in AI-Driven Programming Assistance

Through their analyses, the authors also shed light on hidden biases present in AI-driven programming assistance at both language and framework levels. These biases can stem from various factors such as data used to train LLMs or inherent biases within programming communities themselves. By bringing attention to these structural biases, this research opens up discussions about how they shape software ecosystem trajectories and influence factors like innovation and diversity in software development practices.

In Conclusion

The research paper "The Impact of AI-Assisted Programming on Software Development Practices" provides valuable insights into how LLMs are influencing coding paradigms and shaping software development practices. Through extensive experiments and analyses, it highlights potential issues such as hidden biases that need to be addressed for a more inclusive and diverse software ecosystem. As technology continues to advance, it is crucial to have these discussions and continuously evaluate the impact of AI on various industries, including software development. By understanding the potential consequences of relying on AI for programming tasks, we can work towards creating a more equitable and innovative future for all developers.

Created on 10 Nov. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

60.9%

An Empirical Study on Usage and Perceptions of LLMs in a Software Engineering…

cs.SE

59.6%

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

cs.SE

57.4%

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intel…

cs.SE

55.0%

Can AI Serve as a Substitute for Human Subjects in Software Engineering Resea…

cs.SE

53.8%

Automated Unit Test Improvement using Large Language Models at Meta

cs.SE

53.5%

Evaluating and Explaining Large Language Models for Code Using Syntactic Stru…

cs.SE

53.5%

Practices and Challenges of Using GitHub Copilot: An Empirical Study

cs.SE

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.