XTRIPODS: Algorithms and Machine Learning in Data Intensive Models

XTRIPODS:数据密集型模型中的算法和机器学习

基本信息

  • 批准号:
    2342527
  • 负责人:
  • 金额:
    $ 20万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2024
  • 资助国家:
    美国
  • 起止时间:
    2024-02-15 至 2026-01-31
  • 项目状态:
    未结题

项目摘要

Large datasets have emerged within numerous scientific disciplines, unveiling valuable insights and helping to develop various useful applications. However, they also pose several challenges due to their ever-growing size and dynamic nature. Often, these data sets are processed as data streams or distributed across multiple machines. Sketching and streaming algorithms have been successful in tackling many problems in these settings, ranging from data analysis, network algorithms, to optimization. One research objective of this project is to further improve these algorithms, in terms of time and memory efficiency, with the aid of machine learning predictions. This project will also apply sketching techniques to develop federated machine learning algorithms where data is distributed across machines or devices, offering privacy advantages due to their decentralized nature. The project also aims to improve the foundation of data science and computer science education at San Diego State University and in the community at large through collaboration with the TRIPODS EnCore Institute at UC San Diego.Unlike traditional worst-case analysis, by incorporating machine learning to unravel the underlying structure of the data, it becomes possible in many cases to design better algorithms. The investigator plans to improve the efficiency of existing sketching and streaming algorithms using machine learning. These improvements are in terms of space and time complexity as well as approximation quality. A wide range of problems in this paradigm including data summarization, graph theory, and combinatorial optimization will be considered. Additionally, the investigator plans to utilize sketching to aid the design of machine learning algorithms in distributed and federated settings. Data sketches offer several advantages for this task. They have a small memory footprint and can be merged to form a sketch of the combined data. Additionally, they reveal minimal information about local data, benefiting privacy. The investigator aims to employ sketching algorithms on various problems such as building boosted decision trees for classification and regression, and learning a Bayesian network to explain the data. The investigator will also develop new computer science courses at San Diego State University to improve data science education and collaborate with the TRIPODS EnCORE Institute at UC San Diego, to expand a summer boot camp for high school students focusing on data science.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
大型数据集已经出现在许多科学学科中,揭示了有价值的见解,并有助于开发各种有用的应用程序。然而,由于其不断增长的规模和动态性质,它们也带来了一些挑战。通常,这些数据集被处理为数据流或分布在多台机器上。草图和流算法已经成功地解决了这些设置中的许多问题,从数据分析,网络算法到优化。该项目的一个研究目标是在机器学习预测的帮助下,在时间和内存效率方面进一步改进这些算法。该项目还将应用草图技术来开发联合机器学习算法,其中数据分布在机器或设备上,由于其分散的性质而提供隐私优势。该项目还旨在通过与加州大学圣地亚哥分校的TRIPODS EnCore研究所合作,改善圣地亚哥州立大学和整个社区的数据科学和计算机科学教育基础。与传统的最坏情况分析不同,通过结合机器学习来解开数据的底层结构,在许多情况下可以设计出更好的算法。研究人员计划使用机器学习来提高现有草图和流算法的效率。这些改进是在空间和时间复杂度以及近似质量方面。在这个范例中,包括数据汇总,图论和组合优化的问题范围广泛,将被考虑。此外,研究人员计划利用草图来帮助分布式和联邦环境中的机器学习算法的设计。数据草图为此任务提供了几个优势。它们的内存占用量很小,可以合并以形成组合数据的草图。此外,它们会泄露有关本地数据的最小信息,从而有利于隐私。研究人员的目标是使用草图算法来解决各种问题,例如构建用于分类和回归的增强决策树,以及学习贝叶斯网络来解释数据。该研究员还将在圣地亚哥州立大学开发新的计算机科学课程,以改善数据科学教育,并与加州大学圣地亚哥分校的TRIPODS EnCORE研究所合作,扩大面向高中生的数据科学夏季靴子训练营。该奖项反映了NSF的法定使命,并通过使用基金会的智力价值和更广泛的影响力审查标准进行评估,被认为值得支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Hoa Vu其他文献

The status of dialysis patients in Asian countries under COVID-19 disaster as of December 2019–June 2022: Vietnam, Indonesia, and Mongolia
  • DOI:
    10.1186/s41100-024-00572-w
  • 发表时间:
    2024-10-04
  • 期刊:
  • 影响因子:
    1.000
  • 作者:
    Toru Hyodo;Nobuhito Hirawa;Takahiro Kuragano;Yoshiaki Takemoto;Bui Van Pham;Ha Minh Nguyen;Loc Duc Nguyen;Hoa Vu;Giang Le;Hang Nguyen;Tung Nguyen;An Phan;I. Gde Raka Widiana;Saruultuvshin Adiya;Mandkhai Nergui;Galmunkh Dashmend;Narantungalag Bayankhuu;Tsatsral Dorjsuren;Dolzodmaa Ulziibayar;Nurguli Zulpihar;Bolorchimeg Batbold;Amartuvshin Bat-Ochir;Chuluuntsetseg Dorj
  • 通讯作者:
    Chuluuntsetseg Dorj
Sparse Manifold Alignment
稀疏流形对齐
  • DOI:
  • 发表时间:
    2012
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Chang Wang;Bo Liu;Hoa Vu;S. Mahadevan
  • 通讯作者:
    S. Mahadevan
Womb to wisdom: Early-life exposure to midwifery laws and later-life disability
从子宫到智慧:早年接触助产法与晚年残疾情况
  • DOI:
    10.1016/j.socscimed.2025.117973
  • 发表时间:
    2025-05-01
  • 期刊:
  • 影响因子:
    5.000
  • 作者:
    Hamid Noghanibehambari;Hesamaldin Bagheri;Mostafa Toranji;Hoa Vu;Nasrin Tavassoli
  • 通讯作者:
    Nasrin Tavassoli
LCT-MALTA’s Submission to RepEval 2017 Shared Task
Born on the wrong side of the tracks: Exploring the causal effects of segregation on infant health
出身于错误的轨道:探索隔离对婴儿健康的因果影响
  • DOI:
    10.1016/j.jhealeco.2024.102876
  • 发表时间:
    2024-05-01
  • 期刊:
  • 影响因子:
    3.600
  • 作者:
    Hoa Vu;Tiffany L. Green;Laura E.T. Swan
  • 通讯作者:
    Laura E.T. Swan

Hoa Vu的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

相似海外基金

CAREER: Blessing of Nonconvexity in Machine Learning - Landscape Analysis and Efficient Algorithms
职业:机器学习中非凸性的祝福 - 景观分析和高效算法
  • 批准号:
    2337776
  • 财政年份:
    2024
  • 资助金额:
    $ 20万
  • 项目类别:
    Continuing Grant
DMS-EPSRC: Asymptotic Analysis of Online Training Algorithms in Machine Learning: Recurrent, Graphical, and Deep Neural Networks
DMS-EPSRC:机器学习中在线训练算法的渐近分析:循环、图形和深度神经网络
  • 批准号:
    EP/Y029089/1
  • 财政年份:
    2024
  • 资助金额:
    $ 20万
  • 项目类别:
    Research Grant
CAREER: Gaussian Processes for Scientific Machine Learning: Theoretical Analysis and Computational Algorithms
职业:科学机器学习的高斯过程:理论分析和计算算法
  • 批准号:
    2337678
  • 财政年份:
    2024
  • 资助金额:
    $ 20万
  • 项目类别:
    Continuing Grant
Collaborative Research: AF: Medium: Algorithms Meet Machine Learning: Mitigating Uncertainty in Optimization
协作研究:AF:媒介:算法遇见机器学习:减轻优化中的不确定性
  • 批准号:
    2422926
  • 财政年份:
    2024
  • 资助金额:
    $ 20万
  • 项目类别:
    Continuing Grant
AF:RI:Small: Fairness in allocation and machine learning problems: algorithms and solution concepts
AF:RI:Small:分配公平性和机器学习问题:算法和解决方案概念
  • 批准号:
    2334461
  • 财政年份:
    2024
  • 资助金额:
    $ 20万
  • 项目类别:
    Standard Grant
CAREER: Interpretable and Robust Machine Learning Models: Analysis and Algorithms
职业:可解释且稳健的机器学习模型:分析和算法
  • 批准号:
    2239787
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
    Continuing Grant
STINMALE: Strategic Interactions with Machine-Learning Algorithms: The Role of Simple Beliefs
STINMALE:与机器学习算法的战略交互:简单信念的作用
  • 批准号:
    EP/Y033361/1
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
    Research Grant
Next-Generation Algorithms in Statistical Genetics Based on Modern Machine Learning
基于现代机器学习的下一代统计遗传学算法
  • 批准号:
    10714930
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
DMS-EPSRC: Asymptotic Analysis of Online Training Algorithms in Machine Learning: Recurrent, Graphical, and Deep Neural Networks
DMS-EPSRC:机器学习中在线训练算法的渐近分析:循环、图形和深度神经网络
  • 批准号:
    2311500
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
    Standard Grant
The mathematics of Stackelberg games in machine learning: constructing categories towards powerful algorithms
机器学习中 Stackelberg 博弈的数学:构建强大算法的类别
  • 批准号:
    EP/X040909/1
  • 财政年份:
    2023
  • 资助金额:
    $ 20万
  • 项目类别:
    Research Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了