Automatic Detection of Five API Documentation Smells: Practitioners' Perspectives

AI-generated keywords: API Documentation Smells Software Development Automated Detection Quality

AI-generated Key Points

⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

API documentation plays a crucial role in learning and utilizing an API, similar to source code
Poorly designed documentation can hinder the ease of reusing API features
'Smells' in API documentation are indicators of bad documentation styles that impede understanding and usability
The research by Khan et al. identifies five distinct types of API documentation smells
A survey with 21 professional software developers confirmed the prevalence of these smells in existing API documentation and their negative impact on productivity
The authors developed tools using rule-based techniques and deep learning classifiers to automatically detect and rectify these issues
Their best-performing classifier, BERT, achieved impressive F1-scores ranging from 0.75 to 0.97
This study addresses the lack of prior research on automatically detecting API documentation smells and aims to enhance developer experiences for more efficient software development practices

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Junaed Younus Khan, Md. Tawkat Islam Khondaker, Gias Uddin, Anindya Iqbal

2021 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)

arXiv: 2102.08486v1 - DOI (cs.SE)

License: NONEXCLUSIVE-DISTRIB 1.0

Abstract: The learning and usage of an API is supported by official documentation. Like source code, API documentation is itself a software product. Several research results show that bad design in API documentation can make the reuse of API features difficult. Indeed, similar to code smells or code antipatterns, poorly designed API documentation can also exhibit 'smells'. Such documentation smells can be described as bad documentation styles that do not necessarily produce an incorrect documentation but nevertheless make the documentation difficult to properly understand and to use. Recent research on API documentation has focused on finding content inaccuracies in API documentation and to complement API documentation with external resources (e.g., crowd-shared code examples). We are aware of no research that focused on the automatic detection of API documentation smells. This paper makes two contributions. First, we produce a catalog of five API documentation smells by consulting literature on API documentation presentation problems. We create a benchmark dataset of 1,000 API documentation units by exhaustively and manually validating the presence of the five smells in Java official API reference and instruction documentation. Second, we conduct a survey of 21 professional software developers to validate the catalog. The developers agreed that they frequently encounter all five smells in API official documentation and 95.2% of them reported that the presence of the documentation smells negatively affects their productivity. The participants wished for tool support to automatically detect and fix the smells in API official documentation. We develop a suite of rule-based, deep and shallow machine learning classifiers to automatically detect the smells. The best performing classifier BERT, a deep learning model, achieves F1-scores of 0.75 - 0.97.

Submitted to arXiv on 16 Feb. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2102.08486v1

⚠This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Comprehensive Summary
Key points
Layman's Summary
Blog article

In their paper titled "Automatic Detection of Five API Documentation Smells: Practitioners' Perspectives," authors Junaed Younus Khan, Md. Tawkat Islam Khondaker, Gias Uddin, and Anindya Iqbal delve into the crucial role that official documentation plays in the learning and utilization of an API. They highlight how API documentation, akin to source code, is a software product in itself and emphasize the detrimental impact that poorly designed documentation can have on the ease of reusing API features. Drawing parallels to code smells or antipatterns, the authors identify 'smells' in API documentation as indicators of bad documentation styles that hinder proper understanding and usability. The research conducted by Khan et al. focuses on addressing these documentation smells through the development of a catalog comprising five distinct types. To validate their findings, they meticulously examined 1,000 API documentation units within Java's official API reference and instruction documents. Additionally, a survey involving 21 professional software developers confirmed that these identified smells are prevalent in existing API documentation and significantly impede productivity. One notable aspect of this study is the absence of prior research on automatically detecting API documentation smells. To bridge this gap, the authors employ a combination of rule-based techniques and deep learning classifiers to create tools capable of identifying and rectifying these issues automatically. Their best-performing classifier, BERT - a deep learning model - achieves impressive F1-scores ranging from 0.75 to 0.97. Presented at the 2021 IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER), this paper sheds light on an often overlooked aspect of software development - the quality of API documentation. By providing insights into common pitfalls and offering solutions for improvement through automated detection methods, Khan et al. 's work aims to enhance developer experiences and streamline the utilization of APIs for more efficient software development practices.

- API documentation plays a crucial role in learning and utilizing an API, similar to source code
- Poorly designed documentation can hinder the ease of reusing API features
- 'Smells' in API documentation are indicators of bad documentation styles that impede understanding and usability
- The research by Khan et al. identifies five distinct types of API documentation smells
- A survey with 21 professional software developers confirmed the prevalence of these smells in existing API documentation and their negative impact on productivity
- The authors developed tools using rule-based techniques and deep learning classifiers to automatically detect and rectify these issues
- Their best-performing classifier, BERT, achieved impressive F1-scores ranging from 0.75 to 0.97
- This study addresses the lack of prior research on automatically detecting API documentation smells and aims to enhance developer experiences for more efficient software development practices

SummaryAPI documentation is like a guide that helps us understand and use APIs. Bad documentation can make it hard to use API features. 'Smells' in documentation show when something is not explained well. Researchers found five types of these 'smells'. They made tools, like BERT, to find and fix these issues automatically. Definitions- API: A set of rules and tools for building software applications. - Documentation: Information or instructions that explain how something works. - Smells: Signs or hints that something may be wrong or poorly done. - Classifier: A tool that sorts things into different categories based on certain criteria. - F1-score: A measure of how accurate a classifier is at identifying things correctly.

Introduction

APIs (Application Programming Interfaces) are crucial components of modern software development, allowing developers to access pre-built functions and features without having to write them from scratch. However, the effectiveness of an API heavily relies on its documentation - a set of instructions and reference materials that guide developers in understanding and utilizing its capabilities. In their paper titled "Automatic Detection of Five API Documentation Smells: Practitioners' Perspectives," Junaed Younus Khan et al. delve into the importance of high-quality API documentation and how poorly designed documentation can hinder productivity.

The Role of API Documentation

Similar to source code, API documentation is a software product in itself. It serves as a communication tool between the creators of the API and its users - providing essential information such as usage guidelines, parameter descriptions, error handling procedures, etc. Good documentation not only helps developers understand how to use an API but also saves time by reducing trial-and-error attempts.

The Impact of Poorly Designed Documentation

Khan et al.'s research highlights the detrimental impact that bad documentation can have on the ease of reusing API features. They identify 'smells' in API documentation as indicators of poor design choices that impede proper understanding and usability. These smells can lead to confusion, errors, and ultimately decrease developer productivity.

Identifying Documentation Smells

To validate their findings, Khan et al. conducted a thorough analysis of 1,000 Java official API reference documents for common patterns or smells in existing documentation styles. Through this process, they identified five distinct types: 1) Incomplete Information: This smell refers to missing or insufficient information about an API's functionality or usage guidelines. 2) Misleading Information: Incorrect or misleading information provided in the documentation. 3) Inconsistent Information: Contradictory or conflicting information within the documentation. 4) Unnecessary Information: Irrelevant or redundant information that adds no value to the understanding of an API. 5) Inadequate Formatting: Poorly structured or formatted documentation that makes it challenging to read and comprehend.

Solutions for Improvement

The authors' aim is not only to identify these smells but also to provide solutions for improvement. To achieve this, they developed a catalog of these five smells, along with corresponding detection rules and automated tools.

Detection Techniques

One notable aspect of this study is the absence of prior research on automatically detecting API documentation smells. To bridge this gap, Khan et al. employ a combination of rule-based techniques and deep learning classifiers. These techniques analyze the text in API documents and flag potential instances of smells based on predefined rules.

The BERT Classifier

To further improve their detection methods, the authors also experimented with deep learning models - specifically BERT (Bidirectional Encoder Representations from Transformers). This model achieved impressive F1-scores ranging from 0.75 to 0.97, outperforming all other classifiers used in the study.

Validation through Survey

To validate their findings, Khan et al. conducted a survey involving 21 professional software developers who were asked to evaluate existing API documentation for common smells identified by the researchers. The results showed that these smells are prevalent in current API documentation and significantly hinder productivity.

Conclusion

Presented at the 2021 IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER), Khan et al.'s paper sheds light on an often overlooked aspect of software development - the quality of API documentation. By providing insights into common pitfalls and offering solutions for improvement through automated detection methods, their work aims to enhance developer experiences and streamline the utilization of APIs for more efficient software development practices. This research serves as a valuable resource for API creators and developers alike, emphasizing the importance of well-designed documentation in maximizing the potential of APIs.

Created on 23 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

⚠The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

75.8%

Towards Automating Code Review Activities

cs.SE

74.9%

Code Quality Evaluation Methodology Using The ISO/IEC 9126 Standard

cs.SE

74.3%

Applying Machine Learning Analysis for Software Quality Test

cs.SE

73.4%

A Study of Documentation for Software Architecture

cs.SE

72.6%

Resist the Hype! Practical Recommendations to Cope With Résumé-Driven Develop…

cs.SE

72.3%

Assessing AI Detectors in Identifying AI-Generated Code: Implications for Edu…

cs.SE

72.2%

Automatic Code Documentation Generation Using GPT-3

cs.SE

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.