Subjectivity in the Creation of Machine Learning Models

Author:

Cummings Mary L.1,Li Songpo1

Affiliation:

1. Electrical and Computer Engineering, Duke University, Durham, NC

Abstract

Transportation analysts are inundated with requests to apply popular machine learning modeling techniques to datasets to uncover never-before-seen relationships that could potentially revolutionize safety, congestion, and mobility. However, the results from such models can be influenced not just by biases in underlying data, but also through practitioner-induced biases. To demonstrate the significant number of subjective judgments made in the development and interpretation of machine learning models, we developed Logistic Regression and Neural Network models for transportation-focused datasets including those looking at driving injury/fatalities and pedestrian fatalities. We then developed five different representations of feature importance for each dataset, including different feature interpretations commonly used in the machine learning community. Twelve distinct judgments were highlighted in the development and interpretation of these models, which produced inconsistent results. Such inconsistencies can lead to very different interpretations of the results, which can lead to errors of commission and omission, with significant cost and safety implications if policies are erroneously adapted from such outcomes.

Funder

US Department of Transportation University Transportation Center in partnership with the University of North Carolina Collaborative Science Center for Road Safety

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems and Management,Information Systems

Reference36 articles.

1. AI is sending people to jail—and getting it wrong;Hao K.;Technol. Rev. Retrieved,2019

2. IBM Watson, heal thyself: How IBM overpromised and underdelivered on AI health care

3. M. MacCarthy. 2020. AI needs more regulation not less. Brookings Institution Washington DC. https://www.brookings.edu/research/ai-needs-more-regulation-not-less/. M. MacCarthy. 2020. AI needs more regulation not less. Brookings Institution Washington DC. https://www.brookings.edu/research/ai-needs-more-regulation-not-less/.

4. J. Vincent. 2019. The Verge. Google and Microsoft warn investors that bad AI could harm their brand. Retrieved from https://www.theverge.com/2019/2/11/18220050/google-microsoft-ai-brand-damage-investors-10-k-filing. J. Vincent. 2019. The Verge. Google and Microsoft warn investors that bad AI could harm their brand. Retrieved from https://www.theverge.com/2019/2/11/18220050/google-microsoft-ai-brand-damage-investors-10-k-filing.

5. N. T. Lee P. Resnick and G. Barton. 2019. Algorithmic bias detection and mitigation: Best practices and policies to reduce consumer harms. Brookings Institution Washington DC. https://www.brookings.edu/research/algorithmic-bias-detection-and-mitigation-best-practices-and-policies-to-reduce-consumer-harms/. N. T. Lee P. Resnick and G. Barton. 2019. Algorithmic bias detection and mitigation: Best practices and policies to reduce consumer harms. Brookings Institution Washington DC. https://www.brookings.edu/research/algorithmic-bias-detection-and-mitigation-best-practices-and-policies-to-reduce-consumer-harms/.

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Towards Human-AI Teaming to Mitigate Alert Fatigue in Security Operations Centres;ACM Transactions on Internet Technology;2024-07-15

2. Impact of Data Heterogeneity on AI/ML Model Accuracy in Assisting Pneumonia Type Prediction;2024 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology (IAICT);2024-07-04

3. Challenges and opportunities when bringing machines onto the team: Human-AI teaming and flood evacuation decisions;Environmental Modelling & Software;2024-04

4. Conversation N: Visualization Installation Design Based on Voice Interaction;Communications in Computer and Information Science;2023-12-12

5. Will the Artificial Intelligence Ever Be Able to “Achieve” Buddhahood?;Perspectives on Artificial Intelligence in Times of Turbulence;2023-11-24

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3