Edge AI without Compromise: Efficient, Versatile and Accurate Neurocomputing in Resistive Random-Access Memory

AI-generated keywords: NeuRRAM Edge AI Cloud-level Artificial Intelligence Compute-in-Memory (CIM) Resistive Random-Access Memory (RRAM)

AI-generated Key Points

  • Edge hardware development is crucial for cloud-level AI functionalities at the edge of the internet
  • NeuRRAM is the first multimodal edge AI chip using RRAM CIM
  • NeuRRAM delivers high versatility and energy-efficiency 5-8 times better than prior art across various computational bit-precisions
  • Inference accuracy is comparable to software models with 4-bit weights on all measured standard AI benchmarks, including impressive results on MNIST image classification, CIFAR-10 image classification, and Google speech command recognition
  • NeuRRAM achieved a 70% reduction in image reconstruction error on a Bayesian image recovery task
  • These results pave the way towards building highly efficient and reconfigurable edge AI hardware platforms for more demanding and heterogeneous AI applications in the future.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Weier Wan (Stanford University), Rajkumar Kubendran (University of California San Diego), Clemens Schaefer (University of Notre Dame), S. Burc Eryilmaz (Stanford University), Wenqiang Zhang (Tsinghua University), Dabin Wu (Tsinghua University), Stephen Deiss (University of California San Diego), Priyanka Raina (Stanford University), He Qian (Tsinghua University), Bin Gao (Tsinghua University), Siddharth Joshi (University of Notre Dame), Huaqiang Wu (Tsinghua University), H. -S. Philip Wong (Stanford University), Gert Cauwenberghs (University of California San Diego)

34 pages, 14 figures, 1 table
License: CC BY 4.0

Abstract: Realizing today's cloud-level artificial intelligence functionalities directly on devices distributed at the edge of the internet calls for edge hardware capable of processing multiple modalities of sensory data (e.g. video, audio) at unprecedented energy-efficiency. AI hardware architectures today cannot meet the demand due to a fundamental "memory wall": data movement between separate compute and memory units consumes large energy and incurs long latency. Resistive random-access memory (RRAM) based compute-in-memory (CIM) architectures promise to bring orders of magnitude energy-efficiency improvement by performing computation directly within memory. However, conventional approaches to CIM hardware design limit its functional flexibility necessary for processing diverse AI workloads, and must overcome hardware imperfections that degrade inference accuracy. Such trade-offs between efficiency, versatility and accuracy cannot be addressed by isolated improvements on any single level of the design. By co-optimizing across all hierarchies of the design from algorithms and architecture to circuits and devices, we present NeuRRAM - the first multimodal edge AI chip using RRAM CIM to simultaneously deliver a high degree of versatility for diverse model architectures, record energy-efficiency $5\times$ - $8\times$ better than prior art across various computational bit-precisions, and inference accuracy comparable to software models with 4-bit weights on all measured standard AI benchmarks including accuracy of 99.0% on MNIST and 85.7% on CIFAR-10 image classification, 84.7% accuracy on Google speech command recognition, and a 70% reduction in image reconstruction error on a Bayesian image recovery task. This work paves a way towards building highly efficient and reconfigurable edge AI hardware platforms for the more demanding and heterogeneous AI applications of the future.

Submitted to arXiv on 17 Aug. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2108.07879v1

The development of edge hardware capable of processing multiple modalities of sensory data at unprecedented energy-efficiency is crucial for realizing cloud-level artificial intelligence functionalities directly on distributed devices at the edge of the internet. To address trade-offs between efficiency, versatility, and accuracy, researchers have co-optimized across all hierarchies of design from algorithms and architecture to circuits and devices to create NeuRRAM - the first multimodal edge AI chip using resistive random-access memory (RRAM) CIM. This innovative chip delivers a high degree of versatility for diverse model architectures while recording energy-efficiency 5-8 times better than prior art across various computational bit-precisions. Inference accuracy is comparable to software models with 4-bit weights on all measured standard AI benchmarks including an impressive 99.0% accuracy on MNIST image classification, 85.7% accuracy on CIFAR-10 image classification, and 84.7% accuracy on Google speech command recognition. Additionally, NeuRRAM achieved a 70% reduction in image reconstruction error on a Bayesian image recovery task. These results pave the way towards building highly efficient and reconfigurable edge AI hardware platforms for more demanding and heterogeneous AI applications in the future.
Created on 06 May. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.