Machine Learning Beyond Prediction - Extracting Insights and Guiding Actions
超越预测的机器学习 - 提取见解和指导行动
基本信息
- 批准号:RGPIN-2020-04333
- 负责人:
- 金额:$ 2.99万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2022
- 资助国家:加拿大
- 起止时间:2022-01-01 至 2023-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Owing to breakthroughs in supervised learning using deep neural networks, applications of machine learning (ML) have proliferated, spreading to countless industries and societal decisions. Amid this excitement, some fundamental obstacles are often ignored. While ML systems are typically trained to estimate conditional probabilities, in automated systems their primary purpose is to guide actions. The discrepancy between predictions and decisions is just one among many mismatches between the supervised learning formalism and real-world goals. For example, often the purpose of training a model is not simply to make a prediction but rather to extract qualitative insights, such as causal inference, data clustering or outlier detection.However, when machine learning tools are applied to extract any of those insights, strong assumptions (such as the data being generated by some parameterized probability distribution) are used, often implicitly. Just the same, machine learning is applied far beyond the strict confines of those assumptions. On the other end, for deriving hardness results, most of the theoretical analysis of required resources (be it computational time or training sample sizes) refer to worst-case scenarios, therefore being overly pessimistic. Under that view, large neural networks seem doomed to fail. This project addresses three areas of discrepancy between ML formalism and such real world goals. 1)Interpretability of ML-based tools: Traditional measures of success, such as statistical accuracy and computational efficiency, do not suffice for human consequential applications, where society expects accountability and interpretability. I will analyze formal notions of interpretability and investigate how such notions effect prediction accuracy and render models amenable to monitoring. I will develop theoretical principles under which today's deep learning tools can be leveraged to confer insights beyond their predictive accuracy. 2) Guided selection of clustering algorithms: In spite of the major practical importance of unsupervised learning, current practical implementations of such tasks are very rudimentary. There exists no methodical guidance for clustering tool selection for a given clustering task. I shall address this crucial lacuna by developing methods to guide task appropriate choices of clustering paradigms. 3) Alternatives to worst-case analysis of ML tasks: Many optimization problems that arise in machine learning are NP hard. For example, the training of even small neural networks. Just the same, such problems are being handled routinely on real data for many applications. Experimental evidence suggests that this success relies on some "tameness" of practically arising data. We propose to address this theory-practice discrepancy by distilling structural properties of inputs that can be assumed to hold for naturally arising input data, while giving rise to efficient algorithms for solving hard problems on such inputs.
由于深度神经网络在监督学习方面的突破,机器学习(ML)的应用已经激增,扩展到无数的行业和社会决策。在这种兴奋中,一些根本性的障碍往往被忽视。虽然ML系统通常被训练来估计条件概率,但在自动化系统中,它们的主要目的是指导行动。预测和决策之间的差异只是监督学习形式主义和现实世界目标之间的许多不匹配之一。例如,训练模型的目的通常不是简单地进行预测,而是提取定性的洞察力,例如因果推理、数据聚类或离群值检测。然而,当应用机器学习工具来提取这些洞察力中的任何一种时,通常隐含地使用强假设(例如由某些参数化的概率分布生成的数据)。同样,机器学习的应用远远超出了这些假设的严格限制。另一方面,为了得出困难结果,对所需资源(无论是计算时间还是训练样本大小)的大多数理论分析都提到了最坏的情况,因此过于悲观。在这种观点下,大型神经网络似乎注定要失败。这个项目解决了ML形式主义和这样的现实世界目标之间的三个方面的差异。1)基于ML的工具的可解释性:传统的成功衡量标准,如统计准确性和计算效率,不足以满足人类相应的应用,因为社会期望问责和可解释性。我将分析可解释性的正式概念,并调查这些概念如何影响预测精度和使模型易于监控。我将制定理论原则,在这些原则下,今天的深度学习工具可以被利用来提供超出其预测准确性的见解。2)有指导地选择聚类算法:尽管无监督学习具有重要的实际意义,但目前这类任务的实际实现非常初级。对于给定的集群任务,对于集群工具的选择没有系统的指导。我将通过开发指导任务适当选择集群范例的方法来解决这一关键缺陷。3)ML任务最坏情况分析的替代方法:机器学习中出现的许多优化问题都是NP困难的。例如,即使是小的神经网络的训练。同样,在许多应用程序中,这些问题都是在实际数据上进行常规处理的。实验证据表明,这一成功有赖于实际数据的某种“驯服”。我们建议通过提取输入的结构属性来解决这种理论-实践差异,这些结构属性可以被假设为适用于自然产生的输入数据,同时产生解决此类输入的困难问题的有效算法。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
BenDavid, Shai其他文献
BenDavid, Shai的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('BenDavid, Shai', 18)}}的其他基金
Machine Learning Beyond Prediction - Extracting Insights and Guiding Actions
超越预测的机器学习 - 提取见解和指导行动
- 批准号:
RGPIN-2020-04333 - 财政年份:2021
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Machine Learning Beyond Prediction - Extracting Insights and Guiding Actions
超越预测的机器学习 - 提取见解和指导行动
- 批准号:
RGPIN-2020-04333 - 财政年份:2020
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Utilizing unlabeled data for machine learning tasks - theoretical analysis
利用未标记数据进行机器学习任务 - 理论分析
- 批准号:
RGPIN-2015-04654 - 财政年份:2019
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Utilizing unlabeled data for machine learning tasks - theoretical analysis
利用未标记数据进行机器学习任务 - 理论分析
- 批准号:
RGPIN-2015-04654 - 财政年份:2018
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Utilizing unlabeled data for machine learning tasks - theoretical analysis
利用未标记数据进行机器学习任务 - 理论分析
- 批准号:
RGPIN-2015-04654 - 财政年份:2017
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Utilizing unlabeled data for machine learning tasks - theoretical analysis
利用未标记数据进行机器学习任务 - 理论分析
- 批准号:
RGPIN-2015-04654 - 财政年份:2016
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Utilizing unlabeled data for machine learning tasks - theoretical analysis
利用未标记数据进行机器学习任务 - 理论分析
- 批准号:
RGPIN-2015-04654 - 财政年份:2015
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Theoretical analysis of emerging machine learning paradigms
新兴机器学习范式的理论分析
- 批准号:
312393-2009 - 财政年份:2014
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Theoretical analysis of emerging machine learning paradigms
新兴机器学习范式的理论分析
- 批准号:
380482-2009 - 财政年份:2012
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Accelerator Supplements
Theoretical analysis of emerging machine learning paradigms
新兴机器学习范式的理论分析
- 批准号:
312393-2009 - 财政年份:2012
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Understanding structural evolution of galaxies with machine learning
- 批准号:n/a
- 批准年份:2022
- 资助金额:10.0 万元
- 项目类别:省市级项目
煤矿安全人机混合群智感知任务的约束动态多目标Q-learning进化分配
- 批准号:
- 批准年份:2022
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于领弹失效考量的智能弹药编队短时在线Q-learning协同控制机理
- 批准号:62003314
- 批准年份:2020
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
集成上下文张量分解的e-learning资源推荐方法研究
- 批准号:61902016
- 批准年份:2019
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
具有时序迁移能力的Spiking-Transfer learning (脉冲-迁移学习)方法研究
- 批准号:61806040
- 批准年份:2018
- 资助金额:20.0 万元
- 项目类别:青年科学基金项目
基于Deep-learning的三江源区冰川监测动态识别技术研究
- 批准号:51769027
- 批准年份:2017
- 资助金额:38.0 万元
- 项目类别:地区科学基金项目
具有时序处理能力的Spiking-Deep Learning(脉冲深度学习)方法研究
- 批准号:61573081
- 批准年份:2015
- 资助金额:64.0 万元
- 项目类别:面上项目
基于有向超图的大型个性化e-learning学习过程模型的自动生成与优化
- 批准号:61572533
- 批准年份:2015
- 资助金额:66.0 万元
- 项目类别:面上项目
E-Learning中学习者情感补偿方法的研究
- 批准号:61402392
- 批准年份:2014
- 资助金额:26.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Beyond Standard Numerical Relativistic Hydrodynamics in Binary Neutron Stars: Cooperation of Machine Learning Toward Era of Gravitational Waves Astronomy and Exascale Supercomputers
双中子星中超越标准数值相对论流体动力学:机器学习在引力波时代的合作天文学和百亿亿次超级计算机
- 批准号:
23K03399 - 财政年份:2023
- 资助金额:
$ 2.99万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Machine Learning-Aided Solutions for Efficient Planning, Design, Operation and Adaptation of Beyond 5G Wireless Networks
用于高效规划、设计、运营和适应超 5G 无线网络的机器学习辅助解决方案
- 批准号:
RGPIN-2022-03798 - 财政年份:2022
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Machine Learning Techniques for Ressource Allocation in 5G Networks and Beyond
5G 网络及其他网络中资源分配的机器学习技术
- 批准号:
516933-2018 - 财政年份:2022
- 资助金额:
$ 2.99万 - 项目类别:
Postdoctoral Fellowships
Beyond First Order Stochastic Optimization in Machine Learning
机器学习中超越一阶随机优化
- 批准号:
547276-2020 - 财政年份:2022
- 资助金额:
$ 2.99万 - 项目类别:
Postgraduate Scholarships - Doctoral
Machine Learning Beyond Prediction - Extracting Insights and Guiding Actions
超越预测的机器学习 - 提取见解和指导行动
- 批准号:
RGPIN-2020-04333 - 财政年份:2021
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Beyond First Order Stochastic Optimization in Machine Learning
机器学习中超越一阶随机优化
- 批准号:
547276-2020 - 财政年份:2021
- 资助金额:
$ 2.99万 - 项目类别:
Postgraduate Scholarships - Doctoral
Adaptive Machine Learning Algorithms for mmWave Communications in Beyond 5G and 6G Systems
5G 和 6G 之外系统中毫米波通信的自适应机器学习算法
- 批准号:
21K14162 - 财政年份:2021
- 资助金额:
$ 2.99万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
NSF Convergence Accelerator - Track D: A Standardized Model Description Format for Accelerating Convergence in Neuroscience, Cognitive Science, Machine Learning and Beyond
NSF 融合加速器 - 轨道 D:用于加速神经科学、认知科学、机器学习等领域融合的标准化模型描述格式
- 批准号:
2040682 - 财政年份:2020
- 资助金额:
$ 2.99万 - 项目类别:
Standard Grant
Machine Learning Beyond Prediction - Extracting Insights and Guiding Actions
超越预测的机器学习 - 提取见解和指导行动
- 批准号:
RGPIN-2020-04333 - 财政年份:2020
- 资助金额:
$ 2.99万 - 项目类别:
Discovery Grants Program - Individual
Beyond First Order Stochastic Optimization in Machine Learning
机器学习中超越一阶随机优化
- 批准号:
547276-2020 - 财政年份:2020
- 资助金额:
$ 2.99万 - 项目类别:
Postgraduate Scholarships - Doctoral