DAppSCAN: Building Large-Scale Datasets for Smart Contract Weaknesses in DApp Projects
AI-generated Key Points
⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.
- DAppSCAN is a research project that evaluates tools designed to detect smart contract weaknesses specific to the Ethereum platform.
- The absence of a large, unbiased, real-world dataset has made it difficult to evaluate these tools.
- 22 participants spent 44 person-months analyzing 1,322 open-source audit reports from 30 security teams and identified 10,016 weaknesses.
- Two distinct datasets were developed: DAppSCAN-Source and DAppSCAN-Bytecode.
- The DAppSCAN-Source dataset comprises 25,077 Solidity files sourced from 1,139 real-world DApp projects and features 1,689 SWC vulnerabilities.
- A tool was developed to automatically identify dependency relationships within DApps and complete missing public libraries resulting in the second dataset (DAPPSCAN-Bytecode), which consists of 8,167 compiled smart contract bytecode with 895 SWC weaknesses.
- An empirical study was conducted based on the second dataset (DAPPSCAN-Bytecode) to assess the performance of five state-of-the-art smart contract vulnerability detection tools.
- The results revealed subpar performance for these tools in terms of both effectiveness and success detection rate.
- Future development should prioritize real-world datasets over simplistic toy contracts.
- Overall, DAppSCAN provides a valuable contribution to the field by creating large-scale datasets for smart contract weaknesses in real-world DApp projects.
Authors: Zibin Zheng, Jianzhong Su, Jiachi Chen, David Lo, Zhijie Zhong, Mingxi Ye
Abstract: The Smart Contract Weakness Classification Registry (SWC Registry) is a widely recognized list of smart contract weaknesses specific to the Ethereum platform. In recent years, significant research efforts have been dedicated to building tools to detect SWC weaknesses. However, evaluating these tools has proven challenging due to the absence of a large, unbiased, real-world dataset. To address this issue, we recruited 22 participants and spent 44 person-months analyzing 1,322 open-source audit reports from 30 security teams. In total, we identified 10,016 weaknesses and developed two distinct datasets, i.e., DAppSCAN-Source and DAppSCAN-Bytecode. The DAppSCAN-Source dataset comprises 25,077 Solidity files, featuring 1,689 SWC vulnerabilities sourced from 1,139 real-world DApp projects. The Solidity files in this dataset may not be directly compilable. To enable the dataset to be compilable, we developed a tool capable of automatically identifying dependency relationships within DApps and completing missing public libraries. By utilizing this tool, we created our DAPPSCAN-Bytecode dataset, which consists of 8,167 compiled smart contract bytecode with 895 SWC weaknesses. Based on the second dataset, we conducted an empirical study to assess the performance of five state-of-the-art smart contract vulnerability detection tools. The evaluation results revealed subpar performance for these tools in terms of both effectiveness and success detection rate, indicating that future development should prioritize real-world datasets over simplistic toy contracts.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
⚠The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.