权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

XTRIPODS: Algorithms and Machine Learning in Data Intensive Models

XTRIPODS：数据密集型模型中的算法和机器学习

基本信息

批准号：
2342527
负责人：
Hoa Vu
金额：
$ 20万
依托单位：
San Diego State University Foundation
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2024
资助国家：
美国
起止时间：
2024-02-15 至 2026-01-31
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2342527&HistoricalAwards=false
关键词：
XTRIPODS Algorithms Machine Learning Data

项目摘要

Large datasets have emerged within numerous scientific disciplines, unveiling valuable insights and helping to develop various useful applications. However, they also pose several challenges due to their ever-growing size and dynamic nature. Often, these data sets are processed as data streams or distributed across multiple machines. Sketching and streaming algorithms have been successful in tackling many problems in these settings, ranging from data analysis, network algorithms, to optimization. One research objective of this project is to further improve these algorithms, in terms of time and memory efficiency, with the aid of machine learning predictions. This project will also apply sketching techniques to develop federated machine learning algorithms where data is distributed across machines or devices, offering privacy advantages due to their decentralized nature. The project also aims to improve the foundation of data science and computer science education at San Diego State University and in the community at large through collaboration with the TRIPODS EnCore Institute at UC San Diego.Unlike traditional worst-case analysis, by incorporating machine learning to unravel the underlying structure of the data, it becomes possible in many cases to design better algorithms. The investigator plans to improve the efficiency of existing sketching and streaming algorithms using machine learning. These improvements are in terms of space and time complexity as well as approximation quality. A wide range of problems in this paradigm including data summarization, graph theory, and combinatorial optimization will be considered. Additionally, the investigator plans to utilize sketching to aid the design of machine learning algorithms in distributed and federated settings. Data sketches offer several advantages for this task. They have a small memory footprint and can be merged to form a sketch of the combined data. Additionally, they reveal minimal information about local data, benefiting privacy. The investigator aims to employ sketching algorithms on various problems such as building boosted decision trees for classification and regression, and learning a Bayesian network to explain the data. The investigator will also develop new computer science courses at San Diego State University to improve data science education and collaborate with the TRIPODS EnCORE Institute at UC San Diego, to expand a summer boot camp for high school students focusing on data science.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

大型数据集已经出现在许多科学学科中，揭示了有价值的见解，并有助于开发各种有用的应用程序。然而，由于其不断增长的规模和动态性质，它们也带来了一些挑战。通常，这些数据集被处理为数据流或分布在多台机器上。草图和流算法已经成功地解决了这些设置中的许多问题，从数据分析，网络算法到优化。该项目的一个研究目标是在机器学习预测的帮助下，在时间和内存效率方面进一步改进这些算法。该项目还将应用草图技术来开发联合机器学习算法，其中数据分布在机器或设备上，由于其分散的性质而提供隐私优势。该项目还旨在通过与加州大学圣地亚哥分校的TRIPODS EnCore研究所合作，改善圣地亚哥州立大学和整个社区的数据科学和计算机科学教育基础。与传统的最坏情况分析不同，通过结合机器学习来解开数据的底层结构，在许多情况下可以设计出更好的算法。研究人员计划使用机器学习来提高现有草图和流算法的效率。这些改进是在空间和时间复杂度以及近似质量方面。在这个范例中，包括数据汇总，图论和组合优化的问题范围广泛，将被考虑。此外，研究人员计划利用草图来帮助分布式和联邦环境中的机器学习算法的设计。数据草图为此任务提供了几个优势。它们的内存占用量很小，可以合并以形成组合数据的草图。此外，它们会泄露有关本地数据的最小信息，从而有利于隐私。研究人员的目标是在各种问题上使用草图算法，例如构建用于分类和回归的提升决策树，以及学习贝叶斯网络来解释数据。该研究员还将在圣地亚哥州立大学开发新的计算机科学课程，以改善数据科学教育，并与加州大学圣地亚哥分校的TRIPODS EnCORE研究所合作，扩大面向高中生的数据科学夏季靴子训练营。该奖项反映了NSF的法定使命，并通过使用基金会的智力价值和更广泛的影响力审查标准进行评估，被认为值得支持。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Hoa Vu其他文献

The status of dialysis patients in Asian countries under COVID-19 disaster as of December 2019–June 2022: Vietnam, Indonesia, and Mongolia

DOI：
10.1186/s41100-024-00572-w
发表时间：
2024-10-04
期刊：
Renal Replacement Therapy
影响因子：
1.000
作者：
Toru Hyodo;Nobuhito Hirawa;Takahiro Kuragano;Yoshiaki Takemoto;Bui Van Pham;Ha Minh Nguyen;Loc Duc Nguyen;Hoa Vu;Giang Le;Hang Nguyen;Tung Nguyen;An Phan;I. Gde Raka Widiana;Saruultuvshin Adiya;Mandkhai Nergui;Galmunkh Dashmend;Narantungalag Bayankhuu;Tsatsral Dorjsuren;Dolzodmaa Ulziibayar;Nurguli Zulpihar;Bolorchimeg Batbold;Amartuvshin Bat-Ochir;Chuluuntsetseg Dorj
通讯作者：
Chuluuntsetseg Dorj

Sparse Manifold Alignment

稀疏流形对齐

DOI：
发表时间：
2012
期刊：
影响因子：
0
作者：
Chang Wang;Bo Liu;Hoa Vu;S. Mahadevan
通讯作者：
S. Mahadevan

Womb to wisdom: Early-life exposure to midwifery laws and later-life disability

从子宫到智慧：早年接触助产法与晚年残疾情况

DOI：
10.1016/j.socscimed.2025.117973
发表时间：
2025-05-01
期刊：
SOCIAL SCIENCE & MEDICINE
影响因子：
5.000
作者：
Hamid Noghanibehambari;Hesamaldin Bagheri;Mostafa Toranji;Hoa Vu;Nasrin Tavassoli
通讯作者：
Nasrin Tavassoli