权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse

合作研究：RI：中：通过深度神经崩溃实现优化、泛化和可迁移性的原理

基本信息

批准号：
2312842
负责人：
Qing Qu
金额：
$ 40万
依托单位：
Regents of the University of Michigan - Ann Arbor
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2023
资助国家：
美国
起止时间：
2023-07-01 至 2026-06-30
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2312842&HistoricalAwards=false
关键词：
Collaborative Research RI Medium Principles

项目摘要

Deep learning has demonstrated unprecedented performance across various domains in engineering and science. However, the theoretical understanding of their success has remained elusive. Very recently, researchers discovered and characterized an elegant mathematical structure within the learned features and classifiers called Neural Collapse. This phenomenon persists across a variety of different network architectures, datasets, and data domains. This project will leverage the symmetry of Neural Collapse to develop a rigorous mathematical theory to explain when and why it happens and how it can be used to quantify generalization performance and provide guidelines to understand and improve transferability. By advancing the mathematical foundations of deep learning, this project is expected to influence not only the machine learning community, but also related areas such as optimization, signal and image processing, and natural language processing. The project also involves an integrated outreach and education plan, including promoting accessibility and awareness of computing and STEM concepts for K-12 students.This project will expand our understanding of the principles behind non-convex optimization of training deep learning models, and provide new mathematical insights on their generalization and transferability properties, leading to practical implications. In particular, the project is focused on the following three overarching research thrusts: (i) provide a unified framework to analyze convergence guarantees for training deep and overparametrized models through general loss functions to states of neural collapse, first for simplified cases and then for more general deep models that exhibit progressive neural collapse, with multi-labels and data imbalance; (ii) harness the structure of neural collapse to provide tighter generalization bounds for deep models, by characterizing the structure of the resulting classifiers and their mild dependence on the training data, as well as by making natural distributional assumptions; (iii) leverage the generalization of progressive neural collapse to new environments to understand transferability of deep models to new domains and tasks, and develop principled approaches for improving transferability and efficient fine-tuning.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

深度学习在工程和科学的各个领域都表现出了前所未有的性能。然而，对他们成功的理论理解仍然难以捉摸。最近，研究人员在学习的特征和分类器中发现并描述了一种优雅的数学结构，称为神经崩溃。这种现象在各种不同的网络架构、数据集和数据域中持续存在。该项目将利用神经崩溃的对称性来开发一个严格的数学理论，以解释它何时以及为什么发生，以及如何使用它来量化泛化性能，并为理解和提高可移植性提供指导。通过推进深度学习的数学基础，该项目预计不仅会影响机器学习社区，还会影响优化、信号和图像处理以及自然语言处理等相关领域。该项目还包括一个综合的外展和教育计划，包括促进K-12学生对计算和STEM概念的可访问性和意识，该项目将扩大我们对训练深度学习模型的非凸优化背后原理的理解，并提供关于其泛化和可移植性的新数学见解，从而产生实际影响。特别是，该项目专注于以下三个总体研究重点：（i）提供一个统一的框架来分析通过一般损失函数训练深度和过度参数化模型到神经崩溃状态的收敛保证，首先是简化的情况，然后是表现出渐进神经崩溃的更一般的深度模型，具有多标签和数据不平衡;（ii）利用神经崩溃的结构，通过表征所得分类器的结构及其对训练数据的轻度依赖，以及通过做出自然分布假设，为深度模型提供更严格的泛化边界;（iii）利用渐进神经崩溃到新环境的泛化来理解深度模型到新领域和任务的可转移性，并制定原则性的方法，以提高可转移性和有效的罚款，该奖项反映了NSF的法定使命，并被认为值得通过使用基金会的智力价值进行评估来支持和更广泛的影响审查标准。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Qing Qu其他文献

Exact and Efficient Multi-Channel Sparse Blind Deconvolution — A Nonconvex Approach

精确高效的多通道稀疏盲反卷积——一种非凸方法

DOI：
发表时间：
2019
期刊：
Asilomar Conference on Signals, Systems and Computers
影响因子：
0
作者：
Qing Qu;Xiao Li;Zhihui Zhu
通讯作者：
Zhihui Zhu

Corrosion Behavior of Titanium in Artificial Saliva by Lactic Acid

DOI：
doi:10.3390/ma7085528
发表时间：
2014
期刊：
Materials
影响因子：
3.4
作者：
Qing Qu;Lei Wang;Yajun Chen;Lei Li;Yue He;Zhongtao Ding
通讯作者：
Zhongtao Ding

Inhibition of microbiologically influenced corrosion and biofouling of X70 carbon steels by near-superhydrophobic span class="small-caps"D/span-cysteine/Ag@ZIF-8 coatings

DOI：
10.1016/j.corsci.2022.110682
发表时间：
2022-11-01
期刊：
CORROSION SCIENCE
影响因子：
8.500
作者：
Kexin Chen;Xiaoqiang Yang;Qing Qu;Tao Wu;Shuai Chen;Lei Li
通讯作者：
Lei Li

Enhanced nitrate reduction emvia/em the Ag–Cu–P catalyst for sustainable ammonia generation under ambient conditions

通过 Ag–Cu–P 催化剂在环境条件下实现可持续氨生成的增强型硝酸盐还原

DOI：
10.1039/d3gc03859a
发表时间：
2024-01-22
期刊：
GREEN CHEMISTRY
影响因子：
9.200
作者：
Xinwei Wen;Yue Zhao;Puyang Fan;Jiajie Wu;Kai Xiong;Chang Liu;Qing Qu;Lei Li
通讯作者：
Lei Li

Forest thinning effects on soil carbon stocks and dynamics: Perspective of soil organic carbon sequestration rates

森林疏伐对土壤碳储量和动态的影响：土壤有机碳固存速率的视角

DOI：
10.1016/j.catena.2025.108759
发表时间：
2025-03-01
期刊：
CATENA
影响因子：
5.700
作者：
Qing Qu;Hongwei Xu;Lin Xu;Chengming You;Bo Tan;Han Li;Li Zhang;Lixia Wang;Sining Liu;Zhenfeng Xu;Sha Xue;Minggang Wang
通讯作者：
Minggang Wang