Collaborative Research: EnCORE: Institute for Emerging CORE Methods in Data Science

合作研究:EnCORE:数据科学新兴核心方法研究所

基本信息

  • 批准号:
    2217058
  • 负责人:
  • 金额:
    $ 463.89万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2022
  • 资助国家:
    美国
  • 起止时间:
    2022-09-01 至 2027-08-31
  • 项目状态:
    未结题

项目摘要

The proliferation of data-driven decision making, and its increased popularity, has fueled rapid emergence of data science as a new scientific discipline. Data science is seen as a key enabler of future businesses, technologies, and healthcare that can transform all aspects of socioeconomic lives. Its fast adoption, however, often comes with ad hoc implementation of techniques with suboptimal, and sometimes unfair and potentially harmful, results. The time is ripe to develop principled approaches to lay solid foundations of data science. This is particularly challenging as real-world data is highly complex with intricate structures, unprecedented scale, rapidly evolving characteristics, noise, and implicit biases. Addressing these challenges requires a concerted effort across multiple scientific disciplines such as statistics for robust decision making under uncertainty; mathematics and electrical engineering for enabling data-driven optimization beyond worst case; theoretical computer science and machine learning for new algorithmic paradigms to deal with dynamic and sensitive data in an ethical way; and basic sciences to bring the technical developments to the forefront of health sciences and society. The proposed institute for emerging CORE methods in data science (EnCORE) brings together a diverse team of researchers spanning the afore-mentioned disciplines from the University of California San Diego, University of Texas Austin, University of Pennsylvania, and the University of California Los Angeles. It presents an ambitious vision to transform the landscape of the four CORE pillars of data science: C for complexities of data, O for optimization, R for responsible learning, and E for education and engagement. Along with its transformative research vision, the institute fosters a bold plan for outreach and broadening participation by engaging students of diverse backgrounds at all levels from K-12 to postdocs and junior faculty. The project aims to impact a wide demography of students by offering collaborative courses across its partner universities and a flexible co-mentorship plan for truly multidisciplinary research. With regular organization of workshops, summer schools, and seminars, the project aims to engage the entire scientific community to become the new nexus of research and education on foundations of data science. To bring the fruit of theoretical development to practice, EnCORE will continuously work with industry partners, domain scientists, and will forge strong connections with other National Science Foundation Harnessing Data Revolution institutes across the nation.EnCORE as an institute embodies intellectual merit that has the potential to lead ground-breaking research to shape the foundations of data science in the United States. Its research mission is organized around three themes. The first theme on data complexity addresses the complex characteristics of data such as massive size, huge feature space, rapid changes, variety of sources, implicit dependence structures, arbitrary outliers, and noise. A major overhaul of the core concepts of algorithm design is needed with a holistic view of different computational complexity measures. Faced with noise and outliers, uncertainty estimation is both necessary, and at the same time difficult, due to dynamic and changing data. Data heterogeneity poses major challenges even in basic classification tasks. The structural relationships hidden inside such data are crucial in the understanding and processing, and for downstream data analysis tasks such as in visualization and neuroscience. The second theme of EnCORE aims to transform the classical area of optimization where adaptive methods and human intervention can lead to major advances. It plans to revisit the foundations of distributed optimization to include heterogeneity, robustness, safety, and communication; and address statistical uncertainty due to distributional shift in dynamic data in control and reinforcement learning. The third and final theme of EnCORE proposes to build the foundations of responsible learning. Applications of machine learning in human-facing systems are severely hampered when the learned models are hard for users to understand and reproduce, may give biased outcomes, are easily changeable by an adversary, and reveal sensitive information. Thus, interpretability, reproducibility, fairness, privacy, and robustness must be incorporated in any data-driven decision making. The experience and dedication to mentoring and outreach, collaborative curriculum design, socially aware responsible research program, extensive institute activities, and industrial partnerships would pave the way for a substantial broader impact for EnCORE. Summer schools with year-long mentoring will take place in three states involving a large demography. Joint courses with hybrid, and fully online offerings will be developed. Utilizing prior experience of running Thinkabit lab that has impacted over 74,000 K-12 students so far, EnCORE will embark on an ambitious and thoughtful outreach program to improve the representation of under-represented groups and help create a future generation of workforce that is diverse, responsible, and has solid foundations in data science.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
数据驱动决策的激增及其日益普及,推动了数据科学作为一门新的科学学科的迅速崛起。数据科学被视为未来业务、技术和医疗保健的关键推动因素,可以改变社会经济生活的各个方面。然而,它的快速采用往往伴随着临时实施的技术,这些技术会产生次优、有时不公平且可能有害的结果。现在是时候制定原则性的方法,为数据科学奠定坚实的基础。这尤其具有挑战性,因为现实世界的数据非常复杂,具有复杂的结构,前所未有的规模,快速变化的特征,噪音和隐含的偏见。应对这些挑战需要多个科学学科的共同努力,例如统计学,用于在不确定性下做出稳健的决策;数学和电气工程,用于实现最坏情况下的数据驱动优化;理论计算机科学和机器学习,用于新的算法范例,以道德方式处理动态和敏感数据;和基础科学,将技术发展带到健康科学和社会的最前沿。拟议中的数据科学新兴CORE方法研究所(EnCORE)汇集了来自加州圣地亚哥大学、德克萨斯大学奥斯汀分校、宾夕法尼亚大学和加州洛杉矶大学的跨上述学科的多元化研究团队。它提出了一个雄心勃勃的愿景,旨在改变数据科学的四大核心支柱:C代表数据的复杂性,O代表优化,R代表负责任的学习,E代表教育和参与。沿着其变革性的研究愿景,该研究所通过吸引从K-12到博士后和初级教师的各个层次的不同背景的学生,促进了一个大胆的外联和扩大参与的计划。该项目旨在通过在其合作大学提供合作课程和灵活的共同导师计划来影响广泛的学生群体,以实现真正的多学科研究。通过定期组织研讨会,暑期学校和研讨会,该项目旨在使整个科学界成为数据科学基础研究和教育的新纽带。为了将理论发展的成果付诸实践,EnCORE将继续与行业合作伙伴,领域科学家合作,并将与全国各地的其他国家科学基金会利用数据革命研究所建立密切联系。EnCORE作为一个研究所,体现了智力价值,有可能领导开创性的研究,以塑造美国数据科学的基础。其研究使命围绕三个主题展开。关于数据复杂性的第一个主题解决了数据的复杂特性,例如庞大的大小,巨大的特征空间,快速变化,各种来源,隐式依赖结构,任意离群值和噪声。需要对算法设计的核心概念进行重大改革,并对不同的计算复杂性措施进行整体考虑。面对噪声和异常值,由于数据的动态变化,不确定性估计既必要又困难。即使在基本的分类任务中,数据异构性也构成了重大挑战。隐藏在这些数据中的结构关系对于理解和处理以及下游数据分析任务(如可视化和神经科学)至关重要。EnCORE的第二个主题旨在改变经典的优化领域,其中自适应方法和人为干预可以带来重大进展。它计划重新审视分布式优化的基础,包括异构性,鲁棒性,安全性和通信;并解决由于控制和强化学习中动态数据的分布变化而导致的统计不确定性。EnCORE的第三个也是最后一个主题是建立负责任学习的基础。机器学习在面向人类的系统中的应用受到严重阻碍,当学习的模型很难让用户理解和复制时,可能会给出有偏见的结果,很容易被对手改变,并泄露敏感信息。因此,可解释性、可再现性、公平性、隐私性和鲁棒性必须纳入任何数据驱动的决策中。经验和奉献精神的指导和推广,协作课程设计,社会意识的负责任的研究计划,广泛的研究所活动,和工业伙伴关系将铺平道路,为EnCORE的实质性更广泛的影响。为期一年的暑期学校将在三个人口众多的州举办。将开发混合和完全在线的联合课程。利用之前运行Thinkabit实验室的经验,到目前为止,该实验室已经影响了74,000多名K-12学生,EnCORE将着手实施一项雄心勃勃且深思熟虑的外联计划,以提高代表性不足的群体的代表性,并帮助创造未来一代的劳动力,这是多样化的,负责任的,该奖项反映了NSF的法定使命,并被认为值得通过使用基金会的智力价值和更广泛的评估来支持。影响审查标准。

项目成果

期刊论文数量(14)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Communication-Efficient Zeroth-Order Distributed Online Optimization: Algorithm, Theory, and Applications
  • DOI:
    10.1109/access.2023.3284891
  • 发表时间:
    2023-06
  • 期刊:
  • 影响因子:
    3.9
  • 作者:
    Ege C. Kaya;M. Sahin;Abolfazl Hashemi
  • 通讯作者:
    Ege C. Kaya;M. Sahin;Abolfazl Hashemi
Binary Iterative Hard Thresholding Converges with Optimal Number of Measurements for 1-Bit Compressed Sensing
Rethinking Logic Minimization for Tabular Machine Learning
  • DOI:
    10.1109/tai.2022.3224415
  • 发表时间:
    2023-10
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Litao Qiao;Weijia Wang;S. Dasgupta;Bill Lin
  • 通讯作者:
    Litao Qiao;Weijia Wang;S. Dasgupta;Bill Lin
Weighted Edit Distance Computation: Strings, Trees, and Dyck
加权编辑距离计算:字符串、树和 Dyck
Constants Matter: The Performance Gains of Active Learning
常数很重要:主动学习的性能提升
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Barna Saha其他文献

Language Edit Distance & Maximum Likelihood Parsing of Stochastic Grammars: Faster Algorithms & Connection to Fundamental Graph Problems
语言编辑距离
  • DOI:
  • 发表时间:
    2014
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Barna Saha
  • 通讯作者:
    Barna Saha

Barna Saha的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Barna Saha', 18)}}的其他基金

CAREER: Efficient Fine-grained Algorithms
职业:高效的细粒度算法
  • 批准号:
    2223282
  • 财政年份:
    2021
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Continuing Grant
Inaugural TCS Women Meeting at Symposium of Theory of Computing 2018
2018 年计算理论研讨会上首次 TCS 女性会议
  • 批准号:
    1834336
  • 财政年份:
    2018
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Standard Grant
CAREER: Efficient Fine-grained Algorithms
职业:高效的细粒度算法
  • 批准号:
    1652303
  • 财政年份:
    2017
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Continuing Grant
CRII:AF: Scaling up Dynamic Programming for Certain Optimization Problems
CRII:AF:针对某些优化问题扩展动态规划
  • 批准号:
    1464310
  • 财政年份:
    2015
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Standard Grant

相似国自然基金

复杂电子产品超精密加工及检测关键技术研究与应用
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
基于合成生物学的动物底盘品种优化及中试应用研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
运用组学整合技术探索萆薢分清散联合化疗治疗晚期胰腺癌的临床研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
九里香等提取物多靶向制剂抗肺癌的作用及机制研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
升血小板方治疗原发免疫性血小板减少症的临床研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
八髎穴微波热疗在女性膀胱过度活动症治疗中的价值研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
基于 miR-455-5p 介导的氧化应激机制探讨糖尿病视网膜病变中医分型治疗的临床研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
基于 UPLC-Q-TOF-MS/MS 分析的 异功散活性成分评价及提取工艺研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
无创电针对于痉挛型双瘫脑 瘫患儿的有效性与安全性研究:一项随机 单盲前瞻性队列研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
弹压式手法与体外冲击波治疗肱骨外上髁炎的对比研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目

相似海外基金

Collaborative Research: REU Site: Earth and Planetary Science and Astrophysics REU at the American Museum of Natural History in Collaboration with the City University of New York
合作研究:REU 地点:地球与行星科学和天体物理学 REU 与纽约市立大学合作,位于美国自然历史博物馆
  • 批准号:
    2348998
  • 财政年份:
    2025
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Standard Grant
Collaborative Research: REU Site: Earth and Planetary Science and Astrophysics REU at the American Museum of Natural History in Collaboration with the City University of New York
合作研究:REU 地点:地球与行星科学和天体物理学 REU 与纽约市立大学合作,位于美国自然历史博物馆
  • 批准号:
    2348999
  • 财政年份:
    2025
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Standard Grant
"Small performances": investigating the typographic punches of John Baskerville (1707-75) through heritage science and practice-based research
“小型表演”:通过遗产科学和基于实践的研究调查约翰·巴斯克维尔(1707-75)的印刷拳头
  • 批准号:
    AH/X011747/1
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Research Grant
Democratizing HIV science beyond community-based research
将艾滋病毒科学民主化,超越社区研究
  • 批准号:
    502555
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
Translational Design: Product Development for Research Commercialisation
转化设计:研究商业化的产品开发
  • 批准号:
    DE240100161
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Discovery Early Career Researcher Award
北欧諸国における男女間賃金格差是正への取組みと賃金透明化制度の役割に関する研 究
北欧国家纠正性别工资差距的努力和工资透明制度的作用研究
  • 批准号:
    24K15587
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
ニコラウス・ペヴスナーの福祉社会デザインの研究
尼古拉斯·佩夫斯纳的福利社会设计研究
  • 批准号:
    24K15643
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
実用化を目指した室温磁気センサを用いた心磁図による心筋電気活動の可視化の研究
基于室温磁传感器的心磁图心肌电活动可视化研究及其实际应用
  • 批准号:
    24K15719
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
COPDと肺気腫の非侵襲的在宅呼吸計測技術に関する研究
COPD及肺气肿无创家庭呼吸测量技术研究
  • 批准号:
    24K15849
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
ECMOの人工肺において結露対策専用に開発された温風装置の基礎研究と臨床評価
专为防止 ECMO 氧合器肺部冷凝而开发的热风装置的基础研究和临床评估
  • 批准号:
    24K15873
  • 财政年份:
    2024
  • 资助金额:
    $ 463.89万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了