Schrodinger's Memory: Large Language Models

AI-generated keywords: Memory Large Language Models Universal Approximation Theorem Schrödinger's memory Reasoning

AI-generated Key Points

  • Memory is a crucial aspect of human cognition and serves as the foundation for daily activities.
  • Large Language Models (LLMs) exhibit behavior similar to human memory, but the underlying mechanism in LLMs has not been thoroughly explored.
  • The paper uses the Universal Approximation Theorem (UAT) to explain LLMs' memory mechanism and conducts experiments to assess their memory abilities.
  • Introduces the concept of "Schrödinger's memory," suggesting that an LLM's memory only becomes observable when queried.
  • Comparisons between LLM memory and human memory highlight similarities and differences in operational mechanisms.
  • Poems are dynamically generated by LLMs based on input, similar to how human memories are recalled through specific prompts.
  • Both the brain and LLMs operate by dynamically fitting outputs based on inputs, indicating a shared fundamental mechanism of reasoning ability.
  • Research extends this concept to other cognitive abilities such as social skills and creativity, attributing them to reasoning based on existing knowledge and inputs.
  • Despite exhibiting reasoning capabilities and creativity aligned with linguistic conventions, LLMs may underperform in reasoning tasks due to factors like model size and data quality/quantity.
  • LLMs have become integral tools impacting various fields like machine translation, text summarization, and sentiment analysis.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Wei Wang, Qing Li

License: CC BY 4.0

Abstract: Memory is the foundation of all human activities; without memory, it would be nearly impossible for people to perform any task in daily life. With the development of Large Language Models (LLMs), their language capabilities are becoming increasingly comparable to those of humans. But do LLMs have memory? Based on current performance, LLMs do appear to exhibit memory. So, what is the underlying mechanism of this memory? Previous research has lacked a deep exploration of LLMs' memory capabilities and the underlying theory. In this paper, we use Universal Approximation Theorem (UAT) to explain the memory mechanism in LLMs. We also conduct experiments to verify the memory capabilities of various LLMs, proposing a new method to assess their abilities based on these memory ability. We argue that LLM memory operates like Schr\"odinger's memory, meaning that it only becomes observable when a specific memory is queried. We can only determine if the model retains a memory based on its output in response to the query; otherwise, it remains indeterminate. Finally, we expand on this concept by comparing the memory capabilities of the human brain and LLMs, highlighting the similarities and differences in their operational mechanisms.

Submitted to arXiv on 16 Sep. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2409.10482v3

Memory is a crucial aspect of human cognition and serves as the foundation for daily activities. With the rapid advancement of Large Language Models (LLMs), some models exhibit behavior similar to human memory. However, the underlying mechanism of memory in LLMs has not been thoroughly explored. This paper utilizes the Universal Approximation Theorem (UAT) to explain this mechanism and conducts experiments to assess LLMs' memory abilities. It introduces the concept of "Schrödinger's memory," suggesting that an LLM's memory only becomes observable when queried. By comparing LLM memory to human memory, similarities and differences in operational mechanisms are highlighted. The study delves into how poems are dynamically generated by LLMs based on input, similar to how human memories are recalled through specific prompts. It suggests that both the brain and LLMs operate by dynamically fitting outputs based on inputs, indicating a shared fundamental mechanism of reasoning ability. The research extends this concept to other cognitive abilities such as social skills and creativity, attributing them to the capacity for reasoning based on existing knowledge and inputs. Despite exhibiting reasoning capabilities and creativity in generating outputs aligned with linguistic conventions, LLMs may underperform in reasoning tasks due to factors like model size and data quality/quantity. However, they have become integral tools impacting various fields like machine translation, text summarization, and sentiment analysis. Understanding the intricate workings of memory in LLMs not only sheds light on their cognitive processes but also provides insights into the broader landscape of artificial intelligence research and its implications for society.
Created on 11 Jan. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.