权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CAREER: Neural Networks in the Practical Regime

职业：神经网络在实际应用中的应用

基本信息

批准号：
2145630
负责人：
Guido Montufar Cuartas
金额：
$ 40.89万
依托单位：
University of California-Los Angeles
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2022
资助国家：
美国
起止时间：
2022-06-01 至 2027-05-31
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2145630&HistoricalAwards=false
关键词：
CAREER Neural Networks Practical Regime

项目摘要

This award is funded in whole or in part under the American Rescue Plan Act of 2021 (Public Law 117-2). Deep learning is the predominant approach in modern artificial intelligence that uses multi-layer artificial neural networks to infer complex relations from data. Despite the rapid adoption of deep learning in scientific and industrial applications, a theoretical basis to explain its success and methods to guard against its limits has yet to be established. This project addresses two critical gaps in the current theory of deep learning, namely the effects of working with specific types of data and the behavior of artificial neural networks in practical settings. This research is crucial to developing deep learning methods that are more efficient, reliable, safe, and broadly applicable. Some areas that would benefit directly are accelerating production pipelines, improving energy efficiency, and improving data handling in the increasing number of critical technologies that rely on deep learning. This project involves significant educational, community building, and outreach activities. The research will be directly integrated into interdisciplinary curricula and will generate research project topics for graduate students and capstone projects for undergraduate students. The project aims to prepare a diverse STEM workforce through long-term research and career mentoring for undergraduate and graduate students and postdocs, open seminars and discussion sessions with experts and enthusiasts at different career stages, internship opportunities, and dedicated training sessions.Artificial neural networks provide specific parametrizations to specific sets of candidate solutions to learning tasks, and parameter optimization procedures introduce specific preferences in the space of possible solutions. This project seeks to illuminate these complex relations in cases of practical interest that are not sufficiently well covered by existing mathematical theory, namely, where the networks have a moderate level of overparametrization in relation to the amount of training data. Importantly, it develops theories and methods that integrate and exploit the properties of the training data and parameter initialization. The research concerns the following three aims: (1) the function space description of moderately overparametrized networks, (2) the data-dependent description of the objective function and optimization, and (3) the explicit form of the bias of gradient descent in function space. The research program advances the state of the art by integrating the properties of the training data and addressing the optimization bias and model bias in interplay, which are challenges beyond the scope of existing methods. The project builds on preliminary work that blends applied mathematics and deep learning, in particular techniques connecting the geometry of parameter space, function space, and data space, as well as techniques based on information geometry, optimal transport, and algebraic statistics. The project will further develop important connections between geometry, probability, statistics, and machine learning and will offer unique opportunities for interdisciplinary research and education.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

该奖项全部或部分根据2021年美国救援计划法案（公法117-2）资助。深度学习是现代人工智能的主要方法，它使用多层人工神经网络从数据中推断复杂的关系。尽管深度学习在科学和工业应用中迅速普及，但解释其成功的理论基础和防止其局限性的方法尚未建立。该项目解决了当前深度学习理论中的两个关键空白，即处理特定类型数据的影响和人工神经网络在实际环境中的行为。这项研究对于开发更高效、可靠、安全和广泛适用的深度学习方法至关重要。一些直接受益的领域包括加速生产管道，提高能源效率，以及在越来越多的依赖深度学习的关键技术中改进数据处理。该项目涉及重要的教育、社区建设和外联活动。该研究将直接融入跨学科课程，并将为研究生和本科生的顶点项目产生研究项目主题。该项目旨在通过对本科生、研究生和博士后的长期研究和职业指导、与不同职业阶段的专家和爱好者的公开研讨会和讨论会、实习机会以及专门的培训课程，为多样化的STEM劳动力做好准备。人工神经网络为学习任务的特定候选解决方案集提供特定的参数化并且参数优化过程在可能的解的空间中引入特定的偏好。该项目旨在阐明这些复杂的关系，在实际利益的情况下，没有足够好地涵盖现有的数学理论，即，网络有一个适度的过度参数化的训练数据量。重要的是，它开发了整合和利用训练数据和参数初始化属性的理论和方法。本文的主要研究内容有三个：（1）适度过参数化网络的函数空间描述;（2）目标函数和优化的数据依赖描述;（3）梯度下降偏差在函数空间中的显式表示。该研究计划通过整合训练数据的属性并解决相互作用中的优化偏差和模型偏差来推进最新技术水平，这些偏差和偏差超出了现有方法的范围。该项目建立在融合应用数学和深度学习的初步工作的基础上，特别是连接参数空间、函数空间和数据空间几何的技术，以及基于信息几何、最优传输和代数统计的技术。该项目将进一步发展几何、概率、统计和机器学习之间的重要联系，并将为跨学科研究和教育提供独特的机会。该奖项反映了NSF的法定使命，并通过使用基金会的知识价值和更广泛的影响审查标准进行评估，被认为值得支持。

项目成果

期刊论文数量（8）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Characterizing the Spectrum of the NTK via a Power Series Expansion

DOI：
10.48550/arxiv.2211.07844
发表时间：
2022-11
期刊：
ArXiv
影响因子：
0
作者：
Michael Murray;Hui Jin;Benjamin Bowman;Guido Montúfar
通讯作者：
Michael Murray;Hui Jin;Benjamin Bowman;Guido Montúfar

Spectral Bias Outside the Training Set for Deep Networks in the Kernel Regime

DOI：
10.48550/arxiv.2206.02927
发表时间：
2022-06
期刊：
ArXiv
影响因子：
0
作者：
Benjamin Bowman;Guido Montúfar
通讯作者：
Benjamin Bowman;Guido Montúfar

Algebraic optimization of sequential decision problems

顺序决策问题的代数优化

DOI：
10.1016/j.jsc.2023.102241
发表时间：
2024
期刊：
Journal of Symbolic Computation
影响因子：
0.7
作者：
Dressler, Mareike;Garrote-López, Marina;Montúfar, Guido;Müller, Johannes;Rose, Kemal
通讯作者：
Rose, Kemal

FoSR: First-order spectral rewiring for addressing oversquashing in GNNs