Developing statistical and topological learning methodologies for high-dimensional complex data

开发高维复杂数据的统计和拓扑学习方法

基本信息

  • 批准号:
    RGPIN-2016-05167
  • 负责人:
  • 金额:
    $ 1.31万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2020
  • 资助国家:
    加拿大
  • 起止时间:
    2020-01-01 至 2021-12-31
  • 项目状态:
    已结题

项目摘要

Natural processes can yield data patterns so complex and high-dimensional that they cannot be visualized by the human mind. Examples of high-dimensional data include, but are not limited to, social/sensor networks, semantics, DNA sequences, genomic studies, and medical images. Persistent homology is a particular branch of computational topology which studies the evolution of topological features of a filtration, a one-parameter family of nested spaces. It can be combined with traditional statistical methods as well as machine learning techniques and has been shown to be effective in discerning the differences between signal and noise. The fundamental idea of persistent homology is analogous to significant zero crossings of derivatives in statistics and scale-space theory in computer vision. In all three disciplines, however, only one parameter has been considered: the height of the function in persistent homology, bandwidth in statistics, and the scale of resolution in computer vision. In many situations, it is necessary to let several parameters vary simultaneously. For instance, in kernel density estimations, there are two parameters: the height of the function and the bandwidth. Persistent homology of multi-parameter filtrations remains unsolved: one of our main research goals is to study the persistent homology of bifiltrations. We propose the following four research goals: (1) computation, visualization, and interpretation of high-dimensional topological features; (2) development of two-dimensional persistence to be applied to random fields, functional data analysis, and multivariate regression analysis; (3) incorporation of two-dimensional persistent homology into cluster analysis and techniques in machine learning, such as support vector machine; and (4) development of statistical and topological learning tools that will incorporate our newly-developed techniques. The proposed methods will be applied in several ways: visualizing and interpreting high-dimensional topological features in social networks, semantics, molecules, DNA sequences, and brain images; comparing the craniofacial shapes and upper airways of pediatric obstructive sleep apnea (OSA) patients and normative subjects; clustering and/or classifying pediatric patients in terms of their OSA severity based on over one hundred variables; applications of sequential analysis to all of the combined methods in (1)-(4), the motivation for which stemming from clinical trials performed on pediatric OSA patients. Sequential analysis, in combination with topological and machine learning methodologies, could be conducive to early termination of the clinical trials. Altogether, our proposal encompasses three major scientific disciplines--statistics, computational topology, and machine learning--and serves as a step towards combining powerful techniques from each of these research areas.
自然过程可以产生如此复杂和高维的数据模式,以至于人类思维无法将其可视化。高维数据的示例包括但不限于社交/传感器网络、语义、DNA序列、基因组研究和医学图像。 持久同调是计算拓扑学的一个特殊的分支,它研究的是一个单参数嵌套空间族--滤子的拓扑特征的演化。它可以与传统的统计方法以及机器学习技术相结合,并已被证明可以有效地识别信号和噪声之间的差异。持久同调的基本思想类似于统计学中导数的重要零交叉和计算机视觉中的尺度空间理论。然而,在所有这三个学科中,只有一个参数被考虑:持续同源性中的函数高度,统计学中的带宽,以及计算机视觉中的分辨率范围。在许多情况下,有必要让几个参数同时变化。例如,在核密度估计中,有两个参数:函数的高度和带宽。多参数过滤的持续同调问题一直是一个未解决的问题:我们的主要研究目标之一就是研究双过滤的持续同调问题。 我们提出了以下四个研究目标:(1)计算、可视化和解释高维拓扑特征;(2)发展二维持久性,将其应用于随机场、函数数据分析和多元回归分析;(3)将二维持久性同源性纳入聚类分析和机器学习技术,如支持向量机;(4)将二维持久性同源性应用于聚类分析和机器学习技术,如支持向量机。以及(4)开发统计和拓扑学习工具,这些工具将结合我们新开发的技术。 所提出的方法将以几种方式应用:可视化和解释社交网络、语义、分子、DNA序列和脑图像中的高维拓扑特征;比较小儿阻塞性睡眠呼吸暂停(OSA)患者和正常受试者的颅面形状和上呼吸道;基于超过100个变量根据他们的OSA严重程度对小儿患者进行聚类和/或分类;将顺序分析应用于(1)-(4)中的所有组合方法,其动机源于对小儿OSA患者进行的临床试验。序贯分析结合拓扑和机器学习方法,可能有助于临床试验的提前终止。 总而言之,我们的建议包括三个主要的科学学科-统计学,计算拓扑学和机器学习-并作为一个步骤,从这些研究领域的每一个强大的技术相结合。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Heo, Giseon其他文献

Orthodontic interventions as a management option for children with residual obstructive sleep apnea: a cohort study protocol.
  • DOI:
    10.1136/bmjopen-2022-061651
  • 发表时间:
    2022-06-15
  • 期刊:
  • 影响因子:
    2.9
  • 作者:
    Fagundes, Nathalia Carolina Fernandes;Perez-Garcia, Arnaldo;Graf, Daniel;Flores-Mir, Carlos;Heo, Giseon
  • 通讯作者:
    Heo, Giseon
Transverse dental changes after toothborne and bone-borne maxillary expansion
  • DOI:
    10.1016/j.ortho.2012.12.003
  • 发表时间:
    2013-03-01
  • 期刊:
  • 影响因子:
    1.5
  • 作者:
    Lagravere, Manuel O.;Gamble, Jennifer;Heo, Giseon
  • 通讯作者:
    Heo, Giseon
Exploring uses of persistent homology for statistical analysis of landmark-based shape data
  • DOI:
    10.1016/j.jmva.2010.04.016
  • 发表时间:
    2010-10-01
  • 期刊:
  • 影响因子:
    1.6
  • 作者:
    Gamble, Jennifer;Heo, Giseon
  • 通讯作者:
    Heo, Giseon
Initial forces experienced by the anterior and posterior teeth during dental anchored or skeletal-anchored en masse retraction in vitro
  • DOI:
    10.2319/080916-616.1
  • 发表时间:
    2017-07-01
  • 期刊:
  • 影响因子:
    3.4
  • 作者:
    Lee, David;Heo, Giseon;Romany, Dan L.
  • 通讯作者:
    Romany, Dan L.
Bump hunting by topological data analysis
  • DOI:
    10.1002/sta4.167
  • 发表时间:
    2017-01-01
  • 期刊:
  • 影响因子:
    1.7
  • 作者:
    Sommerfeld, Max;Heo, Giseon;Marron, J. S.
  • 通讯作者:
    Marron, J. S.

Heo, Giseon的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Heo, Giseon', 18)}}的其他基金

Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2019
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2018
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2017
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2016
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical methodology for multi-dimensional data
多维数据的统计方法
  • 批准号:
    293180-2011
  • 财政年份:
    2015
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical methodology for multi-dimensional data
多维数据的统计方法
  • 批准号:
    293180-2011
  • 财政年份:
    2014
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical methodology for multi-dimensional data
多维数据的统计方法
  • 批准号:
    293180-2011
  • 财政年份:
    2013
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical methodology for multi-dimensional data
多维数据的统计方法
  • 批准号:
    293180-2011
  • 财政年份:
    2012
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical methodology for multi-dimensional data
多维数据的统计方法
  • 批准号:
    293180-2011
  • 财政年份:
    2011
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual

相似国自然基金

基于随机网络演算的无线机会调度算法研究
  • 批准号:
    60702009
  • 批准年份:
    2007
  • 资助金额:
    24.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

A statistical framework for the analysis of the evolution in shape and topological structure of random objects
用于分析随机物体形状和拓扑结构演化的统计框架
  • 批准号:
    2311338
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
Developing statistical, topological and geometrical techniques for ab-initio protein structure prediction
开发用于从头算蛋白质结构预测的统计、拓扑和几何技术
  • 批准号:
    2671188
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
Next Generation Fusion Reactor Design: Existence and Symmetry of Magnetofluidostatic Equilibria in Bounded Domains
下一代聚变反应堆设计:有界域中磁流体静力平衡的存在性和对称性
  • 批准号:
    21K13851
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Unraveling the topological architecture and phenotypic contexture of structural variation
揭示结构变异的拓扑结构和表型背景
  • 批准号:
    10356208
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Stochastic Topology and Topological Statistical Mechanics
随机拓扑和拓扑统计力学
  • 批准号:
    2005630
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
FRG: Collaborative Research: Statistical Approaches to Topological Data Analysis that Address Questions in Complex Data
FRG:协作研究:解决复杂数据问题的拓扑数据分析统计方法
  • 批准号:
    2038556
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
FRG: Collaborative Research: Statistical Approaches to Topological Data Analysis that Address Questions in Complex Data
FRG:协作研究:解决复杂数据问题的拓扑数据分析统计方法
  • 批准号:
    1854220
  • 财政年份:
    2019
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
Statistical and Topological Quantification of Shape Features in Biomedical Imaging
生物医学成像中形状特征的统计和拓扑量化
  • 批准号:
    2283918
  • 财政年份:
    2019
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
Developing statistical and topological learning methodologies for high-dimensional complex data
开发高维复杂数据的统计和拓扑学习方法
  • 批准号:
    RGPIN-2016-05167
  • 财政年份:
    2019
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了