权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

RI: Medium: Assessment of Machine Learning Algorithms in the Wild

RI：媒介：机器学习算法的实际评估

基本信息

批准号：
1900644
负责人：
Padhraic Smyth
金额：
$ 119.99万
依托单位：
University of California-Irvine
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2019
资助国家：
美国
起止时间：
2019-10-01 至 2024-09-30
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1900644&HistoricalAwards=false
关键词：
RI Medium Assessment Machine Learning

项目摘要

Machine learning is now a key aspect behind the smart software that is present in many aspects of our daily lives, including face recognition in cameras, chatbots that can answer questions, and speech recognition and language translation on our mobile phones. The prediction models that drive such software are created automatically by machine learning algorithms trained on large amounts of historical data. One issue with these models is that they are often black-box in nature and difficult for humans to understand. In particular, they can be overconfident in their predictions and are not always able to recognize their own limitations. As these types of machine learning models move into more critical tasks, such as autonomous driving and medical diagnosis, it is becoming increasingly important to understand the limitations of such models in real-world practical situations. This research project will address these issues by investigating new mathematical and algorithmic approaches that can improve our ability to assess the performance and confidence of black-box prediction models, particularly when the models are operating in new environments that they have not encountered before. The outcomes of this research will have the potential to significantly improve the reliability and usability of machine learning systems across a broad range of areas such as medicine, transportation, business, and consumer applications.This project focuses on an aspect of explainable AI concerned with enabling black-box machine learning models (specifically those based on classification and regression) to produce confidence statements about their predictions. This project is pursuing novel methods for understanding the predictions of these models to overcome the implicit overconfidence that otherwise black-and-white, in-or-out classification outcomes can imply. This research project will bring together expertise from cognitive science and computer science in the context of two broad themes. The first theme will focus on developing accurate and robust algorithms that can learn how much confidence to place in a black-box model's predictions. The researchers will investigate new Bayesian calibration methods and develop a broad framework for robust and accurate online assessment of the capabilities of black-box prediction models. The second theme will leverage the algorithmic advances from the first theme to develop new approaches to confidence assessment that can improve the effectiveness of the combined efforts of a black-box predictor and a human decision-maker. This work will in turn provide the basis for trading off prediction accuracy and human effort, and allow for development of techniques that leverage accurate confidence estimates to reduce algorithm aversion and increase trust on the part of the human. Engaging the interest of a broader community will also be a key aspect of the project, with a focus on workshops and hackathons involving under-represented community college students in Southern California, to address broad-ranging questions related to the use of artificial intelligence and machine learning techniques in our everyday world.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

机器学习现在是智能软件背后的一个关键方面，这些智能软件存在于我们日常生活的许多方面，包括相机中的面部识别、可以回答问题的聊天机器人，以及手机上的语音识别和语言翻译。驱动此类软件的预测模型是由经过大量历史数据训练的机器学习算法自动创建的。这些模型的一个问题是，它们在本质上往往是黑盒子，人类很难理解。特别是，他们可能对自己的预测过于自信，并不总是能够认识到自己的局限性。随着这些类型的机器学习模型进入更关键的任务，如自动驾驶和医疗诊断，了解这些模型在现实世界实际情况下的局限性变得越来越重要。该研究项目将通过研究新的数学和算法方法来解决这些问题，这些方法可以提高我们评估黑箱预测模型的性能和信心的能力，特别是当模型在以前从未遇到过的新环境中运行时。这项研究的结果将有可能显著提高机器学习系统在医学、交通、商业和消费者应用等广泛领域的可靠性和可用性。该项目侧重于可解释人工智能的一个方面，涉及使黑箱机器学习模型（特别是那些基于分类和回归的模型）能够产生关于其预测的置信度声明。该项目正在寻求新的方法来理解这些模型的预测，以克服隐性的过度自信，否则黑白分明的分类结果可能暗示。这个研究项目将在两个广泛的主题背景下汇集认知科学和计算机科学的专业知识。第一个主题将侧重于开发准确而稳健的算法，这些算法可以学习在黑盒模型的预测中放置多大的信心。研究人员将研究新的贝叶斯校准方法，并开发一个广泛的框架，用于对黑箱预测模型的能力进行稳健和准确的在线评估。第二个主题将利用第一个主题的算法进展来开发新的信心评估方法，以提高黑箱预测器和人类决策者共同努力的有效性。这项工作将反过来为权衡预测准确性和人类努力提供基础，并允许开发利用准确置信度估计来减少算法厌恶并增加人类信任的技术。吸引更广泛社区的兴趣也将是该项目的一个关键方面，重点是由南加州代表性不足的社区大学生参加的研讨会和黑客马拉松，以解决与在我们的日常世界中使用人工智能和机器学习技术相关的广泛问题。该奖项反映了美国国家科学基金会的法定使命，并通过使用基金会的知识价值和更广泛的影响审查标准进行评估，被认为值得支持。

项目成果

期刊论文数量（12）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Three Challenges for AI-Assisted Decision-Making

DOI：
10.1177/17456916231181102
发表时间：
2023-07-13
期刊：
PERSPECTIVES ON PSYCHOLOGICAL SCIENCE
影响因子：
12.6
作者：
Steyvers，Mark;Kumar，Aakriti
通讯作者：
Kumar，Aakriti

A Brief Tour of Deep Learning from a Statistical Perspective

从统计角度简要介绍深度学习

DOI：
10.1146/annurev-statistics-032921-013738
发表时间：
2023
期刊：
Annual Review of Statistics and Its Application
影响因子：
7.9
作者：
Nalisnick, Eric;Smyth, Padhraic;Tran, Dustin
通讯作者：
Tran, Dustin

AI-Assisted Decision-making: a Cognitive Modeling Approach to Infer Latent Reliance Strategies

人工智能辅助决策：推断潜在依赖策略的认知建模方法

DOI：
10.1007/s42113-022-00157-y
发表时间：
2022
期刊：
Computational Brain & Behavior
影响因子：
0
作者：
Tejeda, Heliodoro;Kumar, Aakriti;Smyth, Padhraic;Steyvers, Mark
通讯作者：
Steyvers, Mark

Capturing Humans’ Mental Models of AI: An Item Response Theory Approach

DOI：
10.1145/3593013.3594111
发表时间：
2023-05
期刊：
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency
影响因子：
0
作者：
M. Kelly;Aakriti Kumar;Padhraic Smyth;M. Steyvers
通讯作者：
M. Kelly;Aakriti Kumar;Padhraic Smyth;M. Steyvers

Combining human predictions with model probabilities via confusion matrices and calibration

通过混淆矩阵和校准将人类预测与模型概率相结合

DOI：
发表时间：
2021
期刊：
Advances in Neural Information Processing Systems (NeurIPS 2021
影响因子：
0
作者：
Kerrigan, Gavin;Smyth, Padhraic;Steyvers, Mark
通讯作者：
Steyvers, Mark

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Padhraic Smyth其他文献

Recursive Neural Networks for Coding Therapist and Patient Behavior in Motivational Interviewing

用于编码动机访谈中治疗师和患者行为的递归神经网络

DOI：
发表时间：
2015
期刊：
CLPsych@HLT-NAACL
影响因子：
0
作者：
Michael J. Tanana;Kevin A. Hallgren;Zac E. Imel;David C. Atkins;Padhraic Smyth;Vivek Srikumar
通讯作者：
Vivek Srikumar