权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CIF: RI: Medium: Design principles and theory for data augmentation

CIF：RI：中：数据增强的设计原理和理论

基本信息

批准号：
2212182
负责人：
Vidya Muthukumar
金额：
$ 120万
依托单位：
Georgia Tech Research Corporation
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2022
资助国家：
美国
起止时间：
2022-10-01 至 2025-09-30
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2212182&HistoricalAwards=false
关键词：
CIF RI Medium Design principles

项目摘要

Generalization, or the ability to transfer knowledge from one context to the next, is a hallmark of human intelligence. In artificial intelligence (AI), however, models trained in one setting often fail when tested in a new setting, even if the shift is minor or imperceptible. To build more generalizable AI, most modern methods employ some form of data augmentation (DA), which applies transformations to the data to create virtual samples that are then added to the dataset. The resulting synthesis of new examples appears to build helpful properties in AI such as invariance or resistance to change to certain natural transformations, and robustness to new tasks as well as noise in existing tasks. Despite the promise and performance of DA procedures, they are mostly applied in an ad-hoc manner and need to be designed and tested on a dataset by dataset basis. A set of fundamental principles and theory to understand DA and its impact on model training and testing is lacking. To address this outstanding challenge, the investigators will provide a precise understanding of the impact of DA on generalization, and leverage this understanding to design novel augmentations that can be used across multiple applications and domains.In this project, the investigators propose a principled mathematical framework to 1) understand when DA helps and when DA could potentially hurt learning, 2) understand the structure induced by DA and characterize what makes high-quality augmentations, and 3) provide novel, systematic, and scalable design principles for augmenting data in new domains where we lack prior knowledge to guide us. These design principles will significantly broaden the applicability and promise of DA from computer vision to new domains (e.g., neural data, graphs and tabular data) where principled augmentations are still not known. Of special focus in this project will be applications of DA to neural activity, where augmentations have shown promise in building a more generalizable link between the brain and behavior. This research will also yield prescriptions for the role of DA in advancing fairness, accountability and transparency in modern machine learning.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

泛化能力，或者说将知识从一个环境转移到另一个环境的能力，是人类智能的标志。然而，在人工智能（AI）中，在一个环境中训练的模型在新的环境中测试时往往会失败，即使这种变化很小或难以察觉。为了构建更通用的AI，大多数现代方法都采用某种形式的数据增强（DA），它对数据进行转换以创建虚拟样本，然后将其添加到数据集中。由此产生的新示例的合成似乎在AI中建立了有用的属性，例如对某些自然变换的不变性或抵抗性，以及对新任务和现有任务中的噪声的鲁棒性。尽管DA程序的承诺和性能，他们大多是在一个特设的方式应用，需要设计和测试的数据集的数据集的基础上。缺乏一套基本的原则和理论来理解DA及其对模型训练和测试的影响。为了解决这一突出的挑战，研究人员将提供DA对泛化的影响的精确理解，并利用这种理解来设计可用于多个应用和领域的新颖增强。在这个项目中，研究人员提出了一个原则性的数学框架，以1）了解DA何时有助于学习，何时可能损害学习，2）理解DA诱导的结构，并描述高质量增强的特征; 3）为在缺乏先验知识指导的新领域中增强数据提供新颖、系统和可扩展的设计原则。这些设计原则将大大拓宽DA的适用性和前景，从计算机视觉到新的领域（例如，神经数据、图形和表格数据），其中原理性增强仍然是未知的。在这个项目中，特别关注的是DA在神经活动中的应用，其中增强已经显示出在大脑和行为之间建立更普遍联系的希望。这项研究还将为DA在促进现代机器学习的公平性、问责制和透明度方面的作用提供处方。该奖项反映了NSF的法定使命，并通过使用基金会的知识价值和更广泛的影响审查标准进行评估，被认为值得支持。

项目成果

期刊论文数量（5）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Adaptive Oracle-Efficient Online Learning

DOI：
10.48550/arxiv.2210.09385
发表时间：
2022-10
期刊：
ArXiv
影响因子：
0
作者：
Guanghui Wang;Zihao Hu;Vidya Muthukumar;Jacob D. Abernethy
通讯作者：
Guanghui Wang;Zihao Hu;Vidya Muthukumar;Jacob D. Abernethy

Half-Hop: A graph upsampling approach for slowing down message passing

DOI：
10.48550/arxiv.2308.09198
发表时间：
2023-07
期刊：
Proceedings of machine learning research
影响因子：
0
作者：
Mehdi Azabou;Venkataraman Ganesh;S. Thakoor;Chi-Heng Lin;Lakshmi Sathidevi;Ran Liu;M. Vaĺko;
通讯作者：
Mehdi Azabou;Venkataraman Ganesh;S. Thakoor;Chi-Heng Lin;Lakshmi Sathidevi;Ran Liu;M. Vaĺko;

Active metric learning and classification using similarity queries

DOI：
发表时间：
2022-02
期刊：
影响因子：
0
作者：
Namrata Nadagouda;Austin Xu;M. Davenport
通讯作者：
Namrata Nadagouda;Austin Xu;M. Davenport

The Complexity of Infinite-Horizon General-Sum Stochastic Games

无限视野广义和随机博弈的复杂性

DOI：
发表时间：
2023
期刊：
14th Innovations in Theoretical Computer Science Conference (ITCS 2023
影响因子：
0
作者：
Jin, Yujia;Muthukumar, Vidya;Sidford, Aaron
通讯作者：
Sidford, Aaron

How much Data is Augmentation Worth?

增强数据有多少价值？

DOI：
发表时间：
2022
期刊：
Invariance and Stability
影响因子：
0
作者：
Geiping, Jonas;Goldblum, Micah;Somepalli, Gowthami;Shwartz-Ziv;Goldstein, Tom;Gordon-Wilson, Andrew
通讯作者：
Gordon-Wilson, Andrew

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Vidya Muthukumar其他文献

New Equivalences Between Interpolation and SVMs: Kernels and Structured Features

插值和 SVM 之间的新等价关系：内核和结构化特征

DOI：
10.48550/arxiv.2305.02304
发表时间：
2023
期刊：
ArXiv
影响因子：
0
作者：
Chiraag Kaushik;Andrew D. McRae;M. Davenport;Vidya Muthukumar
通讯作者：
Vidya Muthukumar

Estimating Optimal Policy Value in General Linear Contextual Bandits

估计一般线性上下文强盗的最优策略价值

DOI：
发表时间：
2023
期刊：
arXiv.org
影响因子：
0
作者：
Jonathan Lee;Weihao Kong;Aldo Pacchiano;Vidya Muthukumar;E. Brunskill
通讯作者：
E. Brunskill

Commitment in regulatory spectrum games: Examining the first-player advantage

监管频谱博弈中的承诺：检验先发优势

DOI：
10.1109/isit.2017.8006979
发表时间：
2017
期刊：
2017 IEEE International Symposium on Information Theory (ISIT)
影响因子：
0
作者：
Vidya Muthukumar;A. Sahai
通讯作者：
A. Sahai

Color-Theoretic Experiments to Understand Unequal Gender Classification Accuracy From Face Images

DOI：
10.1109/cvprw.2019.00282
发表时间：
2019-06
期刊：
2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
影响因子：
0
作者：
Vidya Muthukumar
通讯作者：
Vidya Muthukumar