Collaborative Research: OAC Core: Robust, Scalable, and Practical Low Rank Approximation
合作研究:OAC 核心:稳健、可扩展且实用的低阶近似
基本信息
- 批准号:2106738
- 负责人:
- 金额:$ 27.5万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2021
- 资助国家:美国
- 起止时间:2021-07-15 至 2024-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Nearly all aspects of society are affected by data being produced at a faster rate in recent years. The data from experiments, observations, and simulations are not only in more classical science and engineering domains but also in numerous other areas such as businesses tracking more and more facets of consumer behavior, and social networking capturing vast amounts of information on the relationships between people and their actions and interactions. There is a strong need to distill a set of data into a smaller representation that separates useful information from noise and captures the most important trends, patterns, and underlying relationships. Such a representation can be used for direct interpretation of hidden patterns or as a means of simplifying other data analytic tasks. This project addresses these challenges by studying a concept from linear algebra called low rank approximation. The project develops techniques that faithfully distill the meaningful information within a data set. The algorithms are also designed to exploit high-performance computers so that analysts can get results more quickly and tackle larger problems. The overall effort in the project is expected to close the gap between algorithms that can effectively handle very large-scale problems and the data analyst’s ability to convert raw input into meaningful representations and actionable insight.The matrix and tensor low rank approximations being studied in this project serve as foundational tools in numerous science and engineering applications. Imposing constraints on the low rank approximations enables the modeling of many key problems, and designing scalable algorithms enables new applications that reach far beyond classical science and engineering disciplines. In particular, mathematical models with nonnegative data values abound, and imposing nonnegative constraints allows for more accurate and interpretable models. Variants of these constraints can be designed to reflect additional characteristics of real-life data analytics problems. The primary goals of this project are (1) to develop robust techniques for evaluating computed low rank approximations for rank and model determination, (2) to develop scalable parallel algorithms for large and robust low rank approximations on today’s extreme-scale machines, and (3) to provide end users the practical tools required to compute and analyze solutions at scale. Typical data and application scientists use Python or Matlab to iteratively compute, visualize, and evaluate solutions, and they are limited to small data sets with feasible memory and computational requirements. While high-performance algorithms and implementations exist, end users would not leverage these tools if they cannot rely on the robustness and generalizability of the results. This project aims to close this gap, developing an end-to-end system with scalable solutions for all steps of the data analytics workflow.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
社会的几乎所有方面都受到近年来以更快速度产生的数据的影响。来自实验、观察和模拟的数据不仅存在于更经典的科学和工程领域,而且还存在于许多其他领域,例如企业跟踪消费者行为的越来越多方面,以及社交网络捕获关于人与其行为和互动之间的关系的大量信息。强烈需要将一组数据提取成较小的表示,该表示将有用的信息与噪声分开并捕捉最重要的趋势、模式,和底层关系。 这样的表示可用于直接解释隐藏模式或作为简化其他数据分析任务的手段。 此项目通过研究线性代数中称为低阶近似的概念来解决这些挑战。 该项目开发了忠实地提取数据集中有意义的信息的技术。 算法还被设计为利用高性能计算机,以便分析师能够更快地获得结果并解决更大的问题。 项目中的整体努力有望弥合能够有效处理超大规模问题的算法与数据分析师转换能力之间的差距在这个项目中研究的矩阵和张量低阶近似是许多科学和工程应用中的基本工具。对低阶近似施加约束可以对许多关键问题进行建模,而设计可扩展的算法可以实现远远超出经典科学和工程学科的新应用。特别是,具有非负数据值的数学模型比比皆是,施加非负约束允许更准确和可解释的模型。这些约束的变体可以设计为反映现实生活中数据分析问题的其他特征。该项目的主要目标是(1)开发用于评估用于确定等级和模型的计算低阶近似的健壮技术,(2)开发可伸缩的并行算法,用于在当今极大规模的机器上进行大规模且健壮的低阶近似,以及(3)为最终用户提供大规模计算和分析解决方案所需的实用工具。典型的数据和应用科学家使用Python或MatLab迭代计算、可视化和评估解决方案,并且他们仅限于具有可行的内存和计算要求的小数据集。虽然存在高性能的算法和实现,但如果最终用户不能依赖结果的健壮性和普适性,他们就不会利用这些工具。该项目旨在缩小这一差距,开发一个端到端系统,为数据分析工作流程的所有步骤提供可扩展的解决方案。该奖项反映了NSF的法定使命,并通过使用基金会的智力优势和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Haesun Park其他文献
A Dynamic Data Driven Application System for Vehicle Tracking
用于车辆跟踪的动态数据驱动应用系统
- DOI:
10.1016/j.procs.2014.05.108 - 发表时间:
2014 - 期刊:
- 影响因子:0
- 作者:
R. Fujimoto;Angshuman Guin;M. Hunter;Haesun Park;G. Kanitkar;R. Kannan;Michael Milholen;Sabra A. Neal;P. Pecher - 通讯作者:
P. Pecher
Unfolding Latent Tree Structures using 4th Order Tensors
使用四阶张量展开潜在树结构
- DOI:
- 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
Mariya Ishteva;Haesun Park;Le Song - 通讯作者:
Le Song
GPS-Based Shortest-Path Routing Scheme in Mobile Ad Hoc Network
移动Ad Hoc网络中基于GPS的最短路径路由方案
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Haesun Park;Soo;So;Joo - 通讯作者:
Joo
Biocompatibility Issues of Implantable Drug Delivery Systems
- DOI:
10.1023/a:1016012520276 - 发表时间:
1996-01-01 - 期刊:
- 影响因子:4.300
- 作者:
Haesun Park;Kinam Park - 通讯作者:
Kinam Park
Efficient Implementation of Jacobi Algorithms and Jacobi Sets on Distributed Memory Architectures
雅可比算法和雅可比集在分布式内存架构上的高效实现
- DOI:
- 发表时间:
1990 - 期刊:
- 影响因子:0
- 作者:
P. Eberlein;Haesun Park - 通讯作者:
Haesun Park
Haesun Park的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Haesun Park', 18)}}的其他基金
SI2-SSE: Collaborative Research: High Performance Low Rank Approximation for Scalable Data Analytics
SI2-SSE:协作研究:可扩展数据分析的高性能低秩近似
- 批准号:
1642410 - 财政年份:2016
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
CAREER: New Representations of Probability Distributions to Improve Machine Learning --- A Unified Kernel Embedding Framework for Distributions
职业:改进机器学习的概率分布的新表示——统一的分布内核嵌入框架
- 批准号:
1350983 - 财政年份:2014
- 资助金额:
$ 27.5万 - 项目类别:
Continuing Grant
EAGER: Hierarchical Topic Modeling by Nonnegative Matrix Factorization for Interactive Multi-scale Analysis of Text Data
EAGER:通过非负矩阵分解进行分层主题建模,用于文本数据的交互式多尺度分析
- 批准号:
1348152 - 财政年份:2013
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
EAGER: Fast and Accurate Nonnegative Tensor Decompositions: Algorithms and Software
EAGER:快速准确的非负张量分解:算法和软件
- 批准号:
0956517 - 财政年份:2009
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
FODAVA-Lead: Dimension Reduction and Data Reduction: Foundations for Visualization
FODAVA-Lead:降维和数据缩减:可视化的基础
- 批准号:
0808863 - 财政年份:2008
- 资助金额:
$ 27.5万 - 项目类别:
Continuing Grant
MSPA-MCS: Collaborative Research: Fast Nonnegative Matrix Factorizations: Theory, Algorithms, and Applications
MSPA-MCS:协作研究:快速非负矩阵分解:理论、算法和应用
- 批准号:
0732318 - 财政年份:2007
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
SGER: Effective Network Anomaly Detection Based on Adaptive Machine Learning
SGER:基于自适应机器学习的有效网络异常检测
- 批准号:
0715342 - 财政年份:2007
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: Greedy Approximations with Nonsubmodular Potential Functions
协作研究:具有非子模势函数的贪婪近似
- 批准号:
0728812 - 财政年份:2007
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
CompBio: Collaborative Research: Development of Effective Gene Selection Algorithms for Microarray Data Analysis
CompBio:合作研究:开发用于微阵列数据分析的有效基因选择算法
- 批准号:
0621889 - 财政年份:2006
- 资助金额:
$ 27.5万 - 项目类别:
Continuing Grant
Special Meeting: Workshop on Future Direction in Numerical Algorithms and Optimization
特别会议:数值算法与优化未来方向研讨会
- 批准号:
0633793 - 财政年份:2006
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: OAC Core: Distributed Graph Learning Cyberinfrastructure for Large-scale Spatiotemporal Prediction
合作研究:OAC Core:用于大规模时空预测的分布式图学习网络基础设施
- 批准号:
2403312 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC CORE: Federated-Learning-Driven Traffic Event Management for Intelligent Transportation Systems
合作研究:OAC CORE:智能交通系统的联邦学习驱动的交通事件管理
- 批准号:
2414474 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Large-Scale Spatial Machine Learning for 3D Surface Topology in Hydrological Applications
合作研究:OAC 核心:水文应用中 3D 表面拓扑的大规模空间机器学习
- 批准号:
2414185 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Learning AI Surrogate of Large-Scale Spatiotemporal Simulations for Coastal Circulation
合作研究:OAC Core:学习沿海环流大规模时空模拟的人工智能替代品
- 批准号:
2402947 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Distributed Graph Learning Cyberinfrastructure for Large-scale Spatiotemporal Prediction
合作研究:OAC Core:用于大规模时空预测的分布式图学习网络基础设施
- 批准号:
2403313 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: Learning AI Surrogate of Large-Scale Spatiotemporal Simulations for Coastal Circulation
合作研究:OAC Core:学习沿海环流大规模时空模拟的人工智能替代品
- 批准号:
2402946 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: CropDL - Scheduling and Checkpoint/Restart Support for Deep Learning Applications on HPC Clusters
合作研究:OAC 核心:CropDL - HPC 集群上深度学习应用的调度和检查点/重启支持
- 批准号:
2403088 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: CropDL - Scheduling and Checkpoint/Restart Support for Deep Learning Applications on HPC Clusters
合作研究:OAC 核心:CropDL - HPC 集群上深度学习应用的调度和检查点/重启支持
- 批准号:
2403090 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC: Core: Harvesting Idle Resources Safely and Timely for Large-scale AI Applications in High-Performance Computing Systems
合作研究:OAC:核心:安全及时地收集闲置资源,用于高性能计算系统中的大规模人工智能应用
- 批准号:
2403399 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant
Collaborative Research: OAC Core: CropDL - Scheduling and Checkpoint/Restart Support for Deep Learning Applications on HPC Clusters
合作研究:OAC 核心:CropDL - HPC 集群上深度学习应用的调度和检查点/重启支持
- 批准号:
2403089 - 财政年份:2024
- 资助金额:
$ 27.5万 - 项目类别:
Standard Grant