Collaborative Research: Transferable, Hierarchical, Expressive, Optimal, Robust, Interpretable Networks

协作研究:可转移、分层、富有表现力、最优、稳健、可解释的网络

基本信息

  • 批准号:
    2031985
  • 负责人:
  • 金额:
    $ 165万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-09-01 至 2025-08-31
  • 项目状态:
    未结题

项目摘要

Recent advances in deep learning have led to many disruptive technologies: from automatic speech recognition systems, to automated supermarkets, to self-driving cars. However, the complex and large-scale nature of deep networks makes them hard to analyze and, therefore, they are mostly used as black-boxes without formal guarantees on their performance. For example, deep networks provide a self-reported confidence score, but they are frequently inaccurate and uncalibrated, or likely to make large mistakes on rare cases. Moreover, the design of deep networks remains an art and is largely driven by empirical performance on a dataset. As deep learning systems are increasingly employed in our daily lives, it becomes critical to understand if their predictions satisfy certain desired properties. The goal of this NSF-Simons Research Collaboration on the Mathematical and Scientific Foundations of Deep Learning is to develop a mathematical, statistical and computational framework that helps explain the success of current network architectures, understand its pitfalls, and guide the design of novel architectures with guaranteed confidence, robustness, interpretability, optimality, and transferability. This project will train a diverse STEM workforce with data science skills that are essential for the global competitiveness of the US economy by creating new undergraduate and graduate programs in the foundations of data science and organizing a series of collaborative research events, including semester research programs and summer schools on the foundations of deep learning. This project will also impact women and underrepresented minorities by involving undergraduates in the foundations of data science.Deep networks have led to dramatic improvements in the performance of pattern recognition systems. However, the mathematical reasons for this success remain elusive. For instance, it is not clear why deep networks generalize or transfer to new tasks, or why simple optimization strategies can reach a local or global minimum of the associated non-convex optimization problem. Moreover, there is no principled way of designing the architecture of the network so that it satisfies certain desired properties, such as expressivity, transferability, optimality and robustness. This project brings together a multidisciplinary team of mathematicians, statisticians, theoretical computer scientists, and electrical engineers to develop the mathematical and scientific foundations of deep learning. The project is divided in four main thrusts. The analysis thrust will use principles from approximation theory, information theory, statistical inference, and robust control to analyze properties of deep networks such as expressivity, interpretability, confidence, fairness and robustness. The learning thrust will use principles from dynamical systems, non-convex and stochastic optimization, statistical learning theory, adaptive control, and high-dimensional statistics to design and analyze learning algorithms with guaranteed convergence, optimality and generalization properties. The design thrust will use principles from algebra, geometry, topology, graph theory and optimization to design and learn network architectures that capture algebraic, geometric and graph structures in both the data and the task. The transferability thrust will use principles from multiscale analysis and modeling, reinforcement learning, and Markov decision processes to design and study data representations that are suitable for learning from and transferring to multiple tasks.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
深度学习的最新进展带来了许多颠覆性技术:从自动语音识别系统到自动超市,再到自动驾驶汽车。然而,深度网络的复杂性和大规模性质使得它们难以分析,因此,它们大多被用作黑盒,没有对其性能的正式保证。例如,深度网络提供了一个自我报告的信心评分,但它们经常是不准确和未经校准的,或者在极少数情况下可能会犯很大的错误。此外,深度网络的设计仍然是一门艺术,很大程度上是由数据集上的经验性能驱动的。随着深度学习系统越来越多地应用于我们的日常生活,了解它们的预测是否满足某些期望属性变得至关重要。NSF-Simons深度学习的数学和科学基础研究合作的目标是开发一个数学、统计和计算框架,帮助解释当前网络架构的成功,理解其缺陷,并指导具有保证信心、鲁棒性、可解释性、最优性和可转移性的新架构的设计。该项目将通过创建新的数据科学基础本科和研究生课程,并组织一系列合作研究活动,包括学期研究项目和深度学习基础暑期学校,培养具有数据科学技能的多样化STEM劳动力,这对美国经济的全球竞争力至关重要。该项目还将通过让本科生参与数据科学基础,影响女性和代表性不足的少数族裔。深度网络极大地改善了模式识别系统的性能。然而,这种成功的数学原因仍然难以捉摸。例如,为什么深度网络泛化或转移到新的任务中,或者为什么简单的优化策略可以达到相关非凸优化问题的局部或全局最小值,这一点尚不清楚。此外,没有原则性的方法来设计网络架构,使其满足某些期望的属性,如表达性、可转移性、最优性和鲁棒性。该项目汇集了一个由数学家、统计学家、理论计算机科学家和电气工程师组成的多学科团队,以发展深度学习的数学和科学基础。该项目分为四个主要部分。分析推力将使用近似理论、信息论、统计推断和鲁棒控制的原理来分析深度网络的特性,如表达性、可解释性、置信度、公平性和鲁棒性。学习推力将使用动力系统、非凸和随机优化、统计学习理论、自适应控制和高维统计的原理来设计和分析具有保证收敛性、最优性和泛化性质的学习算法。设计主旨将使用代数、几何、拓扑、图论和优化的原理来设计和学习网络架构,以捕获数据和任务中的代数、几何和图结构。可转移性推力将使用多尺度分析和建模、强化学习和马尔可夫决策过程的原理来设计和研究适合从多个任务中学习和转移的数据表示。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Special Issue on the Mathematical Foundations of Deep Learning in Imaging Science
影像科学深度学习的数学基础特刊
  • DOI:
    10.1007/s10851-020-00955-8
  • 发表时间:
    2020
  • 期刊:
  • 影响因子:
    2
  • 作者:
    Bruna, Joan;Haber, Eldad;Kutyniok, Gitta;Pock, Thomas;Vidal, René
  • 通讯作者:
    Vidal, René
Learning a Self-Expressive Network for Subspace Clustering
The vision of self-evolving computing systems
自我进化计算系统的愿景
Interpretable by Design: Learning Predictors by Composing Interpretable Queries
Conformal symplectic and relativistic optimization
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Rene Vidal其他文献

Rene Vidal的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Rene Vidal', 18)}}的其他基金

Collaborative Research: SCH: Multimodal Algorithms for Motor Imitation Assessment in Children with Autism
合作研究:SCH:自闭症儿童运动模仿评估的多模式算法
  • 批准号:
    2124277
  • 财政年份:
    2021
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
HDR TRIPODS: Institute for the Foundations of Graph and Deep Learning
HDR TRIPODS:图形和深度学习基础研究所
  • 批准号:
    1934979
  • 财政年份:
    2019
  • 资助金额:
    $ 165万
  • 项目类别:
    Continuing Grant
III: Medium: Non-Convex Methods for Discovering High-Dimensional Structures in Big and Corrupted Data
III:媒介:在大数据和损坏数据中发现高维结构的非凸方法
  • 批准号:
    1704458
  • 财政年份:
    2017
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
RI: Small: An Optimization Framework for Understanding Deep Networks
RI:小型:理解深度网络的优化框架
  • 批准号:
    1618485
  • 财政年份:
    2016
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
CIF: Small: Collaborative Research: Sparse and Low Rank Methods for Imbalanced and Heterogeneous Data
CIF:小型:协作研究:针对不平衡和异构数据的稀疏和低秩方法
  • 批准号:
    1618637
  • 财政年份:
    2016
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
RI: Small: Object Detection, Pose Estimation, and Semantic Segmentation Using 3D Wireframe Models
RI:小:使用 3D 线框模型进行物体检测、姿势估计和语义分割
  • 批准号:
    1527340
  • 财政年份:
    2015
  • 资助金额:
    $ 165万
  • 项目类别:
    Continuing Grant
BIGDATA: F: DKA: Learning a Union of Subspaces from Big and Corrupted Data
BIGDATA:F:DKA:从大数据和损坏数据中学习子空间并集
  • 批准号:
    1447822
  • 财政年份:
    2014
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
Geometry and Statistics on Spaces of Dynamical Systems for Pattern Recognition in High-Dimensional Time Series
用于高维时间序列模式识别的动力系统空间的几何和统计
  • 批准号:
    1335035
  • 财政年份:
    2013
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
RI: Small: Structured Sparse Conditional Random Fields Models for Joint Categorization and Segmentation of Objects.
RI:小型:用于对象联合分类和分割的结构化稀疏条件随机场模型。
  • 批准号:
    1218709
  • 财政年份:
    2012
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
CDI-Type I: Collaborative Research: A Bio-Inspired Approach to Recognition of Human Movements and Movement Styles
CDI-I 型:协作研究:识别人类运动和运动风格的仿生方法
  • 批准号:
    0941463
  • 财政年份:
    2010
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

Collaborative Research: Towards Engaged, Personalized and Transferable Learning of Secure Programming by Leveraging Real-World Security Vulnerabilities
协作研究:利用现实世界的安全漏洞实现安全编程的参与式、个性化和可转移学习
  • 批准号:
    2235976
  • 财政年份:
    2023
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
Collaborative Research: Towards Engaged, Personalized and Transferable Learning of Secure Programming by Leveraging Real-World Security Vulnerabilities
协作研究:利用现实世界的安全漏洞实现安全编程的参与式、个性化和可转移学习
  • 批准号:
    2235224
  • 财政年份:
    2023
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
Collaborative Research: Transferable, Hierarchical, Expressive, Optimal, Robust, Interpretable Networks
协作研究:可转移、分层、富有表现力、最优、稳健、可解释的网络
  • 批准号:
    2032014
  • 财政年份:
    2020
  • 资助金额:
    $ 165万
  • 项目类别:
    Continuing Grant
Collaborative Research: Transferable, Hierarchical, Expressive, Optimal, Robust, Interpretable Networks
协作研究:可转移、分层、富有表现力、最优、稳健、可解释的网络
  • 批准号:
    2031899
  • 财政年份:
    2020
  • 资助金额:
    $ 165万
  • 项目类别:
    Continuing Grant
Collaborative Research: Transferable, Hierarchical, Expressive, Optimal, Robust, Interpretable Networks
协作研究:可转移、分层、富有表现力、最优、稳健、可解释的网络
  • 批准号:
    2031849
  • 财政年份:
    2020
  • 资助金额:
    $ 165万
  • 项目类别:
    Continuing Grant
Collaborative Research: Transferable, Hierarchical, Expressive, Optimal, Robust, Interpretable Networks
协作研究:可转移、分层、富有表现力、最优、稳健、可解释的网络
  • 批准号:
    2031895
  • 财政年份:
    2020
  • 资助金额:
    $ 165万
  • 项目类别:
    Continuing Grant
GOALI: RUI: Collaborative Research: Development of Transferable Force Fields and Monte Carlo Algorithms and Application to Phase and Sorption Equilibria
目标:RUI:协作研究:可转移力场和蒙特卡罗算法的开发以及在相和吸附平衡中的应用
  • 批准号:
    1159837
  • 财政年份:
    2012
  • 资助金额:
    $ 165万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了