权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CAREER: Learning of graph diffusion and transport from high dimensional data with low-dimensional structures

职业：从具有低维结构的高维数据中学习图扩散和传输

基本信息

批准号：
2237842
负责人：
Xiuyuan Cheng
金额：
$ 42.38万
依托单位：
Duke University
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
2023
资助国家：
美国
起止时间：
2023-09-01 至 2028-08-31
项目状态：
未结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=2237842&HistoricalAwards=false
关键词：
CAREER Learning graph diffusion transport

项目摘要

Graph-based methods are pivotal tools in big data analysis due to their powerful ability to model data in various fields of science and industry. For high-dimensional data, an affinity graph can be constructed from the data cloud and the graph geometry will recover the implicit low-dimensional structure of the data. Therefore, a graph-based approach has the potential to overcome the curse of dimensionality and provide distribution-free methods for predictive and generative learning tasks. The overarching goal of this project is to develop a theoretical and computational framework for graph-based data analysis that overcomes the curse of dimensionality of high dimensional data by leveraging the underlying low-dimensional geometric structure in the data. The mathematical results can be applied to data visualization and dimension reduction, generative models, general unsupervised learning, and a wide range of real applications, ranging from single-cell sequencing to sensor networks. The project will provide research opportunities and projects that are suitable for graduate and undergraduate students, and results of the project will produce pedagogical materials to be incorporated into data science courses at the undergraduate and graduate levels. The project aims to develop theoretical and computational tools for efficient and accurate graph-based analysis of high-dimensional data that captures the intrinsically low-dimensional, non-linear structures in the data. The research work consists of four integrated topics: (1) learning of graph diffusion with a theoretical guarantee, (2) robust graph affinity for graph-based data analysis, (3) graph-based learning of intrinsic optimal transport in high dimension, and (4) generative model of graph data by gradient flow. Using tools from applied harmonic analysis and high dimensional probability, the project will address several open questions in the field. On the theoretical side, the project will model the implicit low-dimensional structure as data lying on or near hidden manifolds embedded in the high-dimensional space and analyze the convergence of the graph operators in the limit of large samples. On the practical side, the project will develop algorithms with sampling and computational complexities only depending on the intrinsic data dimensionality. The mathematical findings will provide computational tools to analyze data in real world applications, including biomedical and network data.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

基于图的方法是大数据分析中的关键工具，因为它们具有强大的建模能力，可以在科学和工业的各个领域进行数据建模。对于高维数据，可以从数据云构建亲和图，并且图几何将恢复数据的隐式低维结构。因此，基于图的方法有可能克服维数灾难，并为预测和生成学习任务提供无分布方法。该项目的总体目标是开发一个基于图的数据分析的理论和计算框架，通过利用数据中的底层低维几何结构来克服高维数据的维数灾难。数学结果可以应用于数据可视化和降维，生成模型，一般的无监督学习，以及广泛的真实的应用，从单细胞测序到传感器网络。该项目将提供适合研究生和本科生的研究机会和项目，该项目的结果将产生教学材料，以纳入本科和研究生阶段的数据科学课程。该项目旨在开发理论和计算工具，用于对高维数据进行有效和准确的基于图形的分析，以捕获数据中固有的低维非线性结构。研究工作由四个综合主题组成：（1）具有理论保证的图扩散学习，（2）基于图的数据分析的鲁棒图亲和力，（3）基于图的高维固有最优传输学习，以及（4）梯度流生成图数据模型。使用应用调和分析和高维概率的工具，该项目将解决该领域的几个开放性问题。在理论方面，该项目将把隐式低维结构建模为位于或接近嵌入高维空间的隐藏流形的数据，并分析图算子在大样本限制下的收敛性。在实践方面，该项目将开发采样和计算复杂性仅取决于内在数据维度的算法。数学发现将提供计算工具，以分析数据在真实的世界中的应用，包括生物医学和网络数据。这个奖项反映了NSF的法定使命，并已被认为是值得支持，通过评估使用基金会的智力价值和更广泛的影响审查标准。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Xiuyuan Cheng其他文献

Cluster-independent multiscale marker identification in single-cell RNA-seq data using localized marker detector (LMD)

使用局部标记检测器（LMD）在单细胞 RNA-seq 数据中进行独立于聚类的多尺度标记识别

DOI：
10.1038/s42003-025-08485-y
发表时间：
2025-07-16
期刊：
Communications Biology
影响因子：
5.100
作者：
Ruiqi Li;Rihao Qu;Fabio Parisi;Francesco Strino;Hainan Lam;Jay S. Stanley;Xiuyuan Cheng;Peggy Myung;Yuval Kluger
通讯作者：
Yuval Kluger