KAN: Kolmogorov-Arnold Networks

AI-generated keywords: Kolmogorov-Arnold Networks

AI-generated Key Points

  • Kolmogorov-Arnold Networks (KANs) are innovative alternatives to traditional Multi-Layer Perceptrons (MLPs)
  • KANs feature learnable activation functions on edges for enhanced accuracy and interpretability
  • In mathematics, KANs have demonstrated the ability to rediscover known relations in an unsupervised mode
  • KANs show promise in physics applications such as Anderson localization
  • KANs aid in identifying models with mobility edges and contribute to resolving debates on localization in interacting systems
  • A new paradigm of "AI for Math" is proposed using KANs' unsupervised learning mode to discover additional relations beyond knot invariants
  • Through experimentation, it is evident that KANs offer a valuable tool for exploring complex mathematical relationships and advancing research in both mathematics and physics.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljačić, Thomas Y. Hou, Max Tegmark

48 pages, 20 figures. Codes are available at https://github.com/KindXiaoming/pykan
License: CC BY 4.0

Abstract: Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs). While MLPs have fixed activation functions on nodes ("neurons"), KANs have learnable activation functions on edges ("weights"). KANs have no linear weights at all -- every weight parameter is replaced by a univariate function parametrized as a spline. We show that this seemingly simple change makes KANs outperform MLPs in terms of accuracy and interpretability. For accuracy, much smaller KANs can achieve comparable or better accuracy than much larger MLPs in data fitting and PDE solving. Theoretically and empirically, KANs possess faster neural scaling laws than MLPs. For interpretability, KANs can be intuitively visualized and can easily interact with human users. Through two examples in mathematics and physics, KANs are shown to be useful collaborators helping scientists (re)discover mathematical and physical laws. In summary, KANs are promising alternatives for MLPs, opening opportunities for further improving today's deep learning models which rely heavily on MLPs.

Submitted to arXiv on 30 Apr. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2404.19756v1

Kolmogorov-Arnold Networks (KANs) are innovative alternatives to traditional Multi-Layer Perceptrons (MLPs), featuring learnable activation functions on edges for enhanced accuracy and interpretability. In mathematics, KANs have demonstrated the ability to rediscover known relations in an unsupervised mode, highlighting their reliability. They also show promise in physics applications such as Anderson localization, aiding in identifying models with mobility edges and contributing to resolving debates on localization in interacting systems. A new paradigm of "AI for Math" is proposed using KANs' unsupervised learning mode to discover additional relations beyond knot invariants. Through experimentation, it is evident that KANs offer a valuable tool for exploring complex mathematical relationships and advancing research in both mathematics and physics.
Created on 30 May. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.