Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task

AI-generated keywords: Large Language Models Neural and Behavioral Effects Essay Writing Tasks EEG Analysis Natural Language Processing

AI-generated Key Points

  • Study investigated effects of Large Language Models (LLMs) in essay writing tasks on neural and behavioral levels
  • Participants divided into LLM, Search Engine, and Brain-only groups for sessions 1-3
  • EEG used to measure cognitive load during essay writing; NLP analyzed essays
  • Significant differences in brain connectivity among groups: Brain-only strong networks, Search Engine moderate engagement, LLM weak connectivity
  • LLM-to-Brain participants showed reduced alpha and beta connectivity in session 4
  • Brain-to-LLM users exhibited higher memory recall and activation of specific brain areas similar to Search Engine users
  • LLM group had lowest self-reported ownership of essays and struggled with quoting own work accurately
  • Over four months, LLM users consistently underperformed at neural, linguistic, and behavioral levels compared to other groups
  • Study raises concerns about cognitive costs of relying on LLMs for educational tasks; emphasizes need for further research into AI's role in education
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Nataliya Kosmyna, Eugene Hauptmann, Ye Tong Yuan, Jessica Situ, Xian-Hao Liao, Ashly Vivian Beresnitzky, Iris Braunstein, Pattie Maes

206 pages, 92 figures, 4 tables and appendix
License: CC BY-NC-SA 4.0

Abstract: This study explores the neural and behavioral consequences of LLM-assisted essay writing. Participants were divided into three groups: LLM, Search Engine, and Brain-only (no tools). Each completed three sessions under the same condition. In a fourth session, LLM users were reassigned to Brain-only group (LLM-to-Brain), and Brain-only users were reassigned to LLM condition (Brain-to-LLM). A total of 54 participants took part in Sessions 1-3, with 18 completing session 4. We used electroencephalography (EEG) to assess cognitive load during essay writing, and analyzed essays using NLP, as well as scoring essays with the help from human teachers and an AI judge. Across groups, NERs, n-gram patterns, and topic ontology showed within-group homogeneity. EEG revealed significant differences in brain connectivity: Brain-only participants exhibited the strongest, most distributed networks; Search Engine users showed moderate engagement; and LLM users displayed the weakest connectivity. Cognitive activity scaled down in relation to external tool use. In session 4, LLM-to-Brain participants showed reduced alpha and beta connectivity, indicating under-engagement. Brain-to-LLM users exhibited higher memory recall and activation of occipito-parietal and prefrontal areas, similar to Search Engine users. Self-reported ownership of essays was the lowest in the LLM group and the highest in the Brain-only group. LLM users also struggled to accurately quote their own work. While LLMs offer immediate convenience, our findings highlight potential cognitive costs. Over four months, LLM users consistently underperformed at neural, linguistic, and behavioral levels. These results raise concerns about the long-term educational implications of LLM reliance and underscore the need for deeper inquiry into AI's role in learning.

Submitted to arXiv on 10 Jun. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.08872v1

This study investigated the effects of using Large Language Models (LLMs) in essay writing tasks on neural and behavioral levels. Participants were divided into three groups: LLM, Search Engine, and Brain-only (no tools), with each group completing three sessions under the same condition. A fourth session was conducted where participants from the LLM group switched to the Brain-only group (LLM-to-Brain), and participants from the Brain-only group switched to the LLM condition (Brain-to-LLM). A total of 54 participants took part in Sessions 1-3, with 18 completing session 4. The study utilized electroencephalography (EEG) to measure cognitive load during essay writing and analyzed essays using Natural Language Processing (NLP). Essays were also scored by human teachers and an AI judge. The results showed that within-group homogeneity was observed in Named Entity Recognition (NERs), n-gram patterns, and topic ontology. EEG analysis revealed significant differences in brain connectivity among the groups: Brain-only participants exhibited strong and distributed networks, Search Engine users showed moderate engagement, while LLM users displayed weak connectivity. In session 4, LLM-to-Brain participants exhibited reduced alpha and beta connectivity, indicating under-engagement. On the other hand, Brain-to-LLM users showed higher memory recall and activation of specific brain areas similar to Search Engine users. Self-reported ownership of essays was lowest in the LLM group and highest in the Brain-only group. Additionally, LLM users struggled with accurately quoting their own work. Over a four-month period, LLM users consistently underperformed at neural, linguistic, and behavioral levels compared to other groups. These findings suggest potential cognitive costs associated with relying on LLMs for educational tasks. The study raises concerns about the long-term implications of LLM usage in learning contexts and emphasizes the need for further research into AI's role in education. For more detailed information on the experimental design, participant protocols, prompts used during each session, as well as additional data summaries including specific EEG dDTF values, readers are encouraged to visit https://www.brainonllm.com/.
Created on 24 Jun. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.