Machine Learning Practices Outside Big Tech: How Resource Constraints Challenge Responsible Development

AI-generated keywords: Machine Learning Selection Bias Responsible Development Explainable AI Human-AI Teaming

AI-generated Key Points

  • Machine learning (ML) practitioners from various backgrounds increasingly use ML methods
  • Studies often focus on Big Tech and academia, neglecting startups, non-tech companies, and the public sector
  • Selection bias excludes broader, lesser-resourced ML community facing challenges in deploying ML with limited resources and increased existential risk
  • Tensions identified in qualitative analysis of 17 interviews: privacy vs. ubiquity, resource management vs. performance optimization, access vs. monopolization
  • Challenges for responsible ML development in resource-constrained organizations include company expectations, bias, explainability, data literacy, model lifecycles, and privacy concerns
  • Mixed opinions on value of implementing ML technology among underrepresented organizations highlight need for further study to address challenges
  • Explainable AI community seeks meaningful mechanisms for understanding ML models; some advocate transparency through example testing
  • Desires for explanations vary among practitioners with concerns about uncertainty communication and overconfidence
  • Onboarding processes for ML products/tools should focus on continual evolution based on user interactions with the model to ensure effective human-AI teaming
  • Holistic understanding of limitations in responsible ML development is essential to guide future research agendas benefiting all stakeholders in the machine learning community
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Aspen Hopkins, Serena Booth

AAAI/ACM Conference on AI, Ethics, and Society 2021
License: CC BY 4.0

Abstract: Practitioners from diverse occupations and backgrounds are increasingly using machine learning (ML) methods. Nonetheless, studies on ML Practitioners typically draw populations from Big Tech and academia, as researchers have easier access to these communities. Through this selection bias, past research often excludes the broader, lesser-resourced ML community -- for example, practitioners working at startups, at non-tech companies, and in the public sector. These practitioners share many of the same ML development difficulties and ethical conundrums as their Big Tech counterparts; however, their experiences are subject to additional under-studied challenges stemming from deploying ML with limited resources, increased existential risk, and absent access to in-house research teams. We contribute a qualitative analysis of 17 interviews with stakeholders from organizations which are less represented in prior studies. We uncover a number of tensions which are introduced or exacerbated by these organizations' resource constraints -- tensions between privacy and ubiquity, resource management and performance optimization, and access and monopolization. Increased academic focus on these practitioners can facilitate a more holistic understanding of ML limitations, and so is useful for prescribing a research agenda to facilitate responsible ML development for all.

Submitted to arXiv on 06 Oct. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2110.02932v1

In the realm of machine learning (ML), practitioners from various backgrounds are increasingly utilizing ML methods. However, studies often focus on populations from Big Tech and academia, neglecting those in startups, non-tech companies, and the public sector. This selection bias excludes the broader, lesser-resourced ML community facing challenges in deploying ML with limited resources and increased existential risk. A qualitative analysis of 17 interviews with stakeholders from underrepresented organizations reveals tensions between privacy and ubiquity, resource management and performance optimization, and access and monopolization. The challenges of responsible ML development for resource-constrained organizations include company expectations, bias, explainability, data literacy, model lifecycles, and privacy concerns. Despite mixed opinions on the value of implementing ML technology among these organizations, there is a need for further study to address these challenges. The explainable AI community seeks meaningful mechanisms for understanding ML models while some practitioners advocate for transparency through example testing. However, desires for explanations vary among practitioners with concerns about uncertainty communication and overconfidence. Onboarding processes for ML products and tools remain underexplored but should focus on continual evolution based on user interactions with the model to ensure effective human-AI teaming. Overall,a more holistic understanding of limitations in responsible ML development is essential to guide future research agendas that benefit all stakeholders in the machine learning community.
Created on 11 Nov. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.