Statistical Machine Learning for Large High Dimensional Complex Data

大型高维复杂数据的统计机器学习

基本信息

  • 批准号:
    RGPIN-2018-05981
  • 负责人:
  • 金额:
    $ 1.31万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2018
  • 资助国家:
    加拿大
  • 起止时间:
    2018-01-01 至 2019-12-31
  • 项目状态:
    已结题

项目摘要

1. Machine Learning Theory****** As the core of the modern artificial intelligence, deep learning methods are reshaping our world on a daily basis. However, a general and unified framework still has not been established. Consequently, the powerful deep neural network method still remains as a black box. This lack of theoretical understanding also created difficulties to further improve an effective deep learning method by significantly reducing the computational complexity and extend it to handle more complex situations such as high dimensional non-stationary time series data. ******I would like to establish a theoretical framework and then investigate the theoretical properties of deep learning by building a mathematical and statistical model to describe the behaviors of the deep learning method. In the past, we have been successful in establishing a unified theoretical framework and derive theoretical properties such as convergence and stability for a class of dynamic clustering methods including the famous mean-shift algorithm that has been widely used in image analysis. We will extend our framework to investigate some fundamental issues of machine learning. Our approach is based on statistical mechanics and information theory. Method from physics such as quantum field theory will also be employed. ******2. High Dimensional Statistical Inference ******One of the common challenges for analyzing high dimensional complex data is to reduce the dimensionality efficiently to a manageable level. Common methods include regularization type of method such as Lasso and Bayesian inference which usually reduce the problem into a subspace. Monte Carlo Markov Chain is also an essential tool for Bayesian inference for high dimensional data when the posterior is not analytically trackable. ***We have developed a very effective high dimensional optimization method for searching global optimum for non-convex objective functions. Our method is based on the idea of geodesics or exponential map which is an essential concept in general relativity. We will extend our method to improve the numerical performance of various dimensionality reduction methods and also apply it to high dimensional Bayesian including MCMC. ******3.Statistical Analysis of Brain Data****** We are developing some novel and effective mathematical and statistical models for highly oscillatory non-stationary time series. The new model will take a drastically different approach than most of the traditional spatial-temporal methods which often integrates deterministic and stochastic component in either an additive or certain very restrictive fashion. Our approach will consider the underlying biological mechanism first and then consider other factors that can be best modelled by statistical models. We will also consider a hidden dimension approach which take into account the fact that the information we obtained could be incomplete.
1. 机器学习理论******作为现代人工智能的核心,深度学习方法每天都在重塑我们的世界。然而,一个通用的、统一的框架还没有建立起来。因此,强大的深度神经网络方法仍然是一个黑盒子。这种理论理解的缺乏也给进一步改进有效的深度学习方法带来了困难,无法显著降低计算复杂度,并将其扩展到处理更复杂的情况,如高维非平稳时间序列数据。******我想建立一个理论框架,然后通过建立一个数学和统计模型来描述深度学习方法的行为来研究深度学习的理论性质。在过去,我们已经成功地建立了一个统一的理论框架,并推导了一类动态聚类方法的理论性质,如收敛性和稳定性,包括在图像分析中广泛使用的著名的mean-shift算法。我们将扩展我们的框架来研究机器学习的一些基本问题。我们的方法是基于统计力学和信息论。物理学的方法,如量子场论也将被采用。* * * * * * 2。高维统计推断******分析高维复杂数据的常见挑战之一是有效地将维数降至可管理的水平。常用的方法包括正则化类型的方法,如Lasso和贝叶斯推理,它们通常将问题缩减到一个子空间。当后验不可分析跟踪时,蒙特卡洛马尔可夫链也是对高维数据进行贝叶斯推理的重要工具。***我们开发了一种非常有效的高维优化方法来搜索非凸目标函数的全局最优。我们的方法是基于广义相对论中的基本概念测地线或指数映射的思想。我们将扩展我们的方法来提高各种降维方法的数值性能,并将其应用于包括MCMC在内的高维贝叶斯。* * * * * * 3。脑数据的统计分析******我们正在为高度振荡的非平稳时间序列开发一些新颖有效的数学和统计模型。新模型将采取一种与大多数传统时空方法截然不同的方法,这些方法通常以一种附加的或某种非常限制的方式集成确定性和随机成分。我们的方法将首先考虑潜在的生物学机制,然后考虑其他因素,这些因素可以最好地通过统计模型建模。我们还将考虑一种隐藏维度方法,该方法考虑到我们获得的信息可能是不完整的。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Wang, Steven其他文献

Liquid metal droplets bouncing higher on thicker water layer.
  • DOI:
    10.1038/s41467-023-39348-x
  • 发表时间:
    2023-06-14
  • 期刊:
  • 影响因子:
    16.6
  • 作者:
    Dai, Yuhang;Li, Minfei;Ji, Bingqiang;Wang, Xiong;Yang, Siyan;Yu, Peng;Wang, Steven;Hao, Chonglei;Wang, Zuankai
  • 通讯作者:
    Wang, Zuankai
Reasons for Phosphate Binder Discontinuation Vary by Binder Type
  • DOI:
    10.1053/j.jrn.2013.11.004
  • 发表时间:
    2014-03-01
  • 期刊:
  • 影响因子:
    3.2
  • 作者:
    Wang, Steven;Anum, Emmanuel A.;Newsome, Britt
  • 通讯作者:
    Newsome, Britt
Identification of tyrosine sulfation in the variable region of a bispecific antibody and its effect on stability and biological activity.
  • DOI:
    10.1080/19420862.2023.2259289
  • 发表时间:
    2023-01
  • 期刊:
  • 影响因子:
    5.3
  • 作者:
    Lietz, Christopher B.;Deyanova, Ekaterina;Cho, Younhee;Cordia, Jon;Franc, Sarah;Kabro, Sally;Wang, Steven;Mikolon, David;Banks, Douglas D.
  • 通讯作者:
    Banks, Douglas D.
The Effect of Longer Dosing Intervals for Long-Acting Injectable Antipsychotics on Outcomes in Schizophrenia.
  • DOI:
    10.2147/ndt.s395383
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    3.2
  • 作者:
    Milz, Ruth;Benson, Carmela;Knight, Karl;Antunes, Jose;Najarian, Dean;Rengel, Paola -Maria Lopez;Wang, Steven;Richarz, Ute;Gopal, Srihari;Kane, John M.
  • 通讯作者:
    Kane, John M.
One-dimensionally oriented self-assembly of ordered mesoporous nanofibers featuring tailorable mesophases via kinetic control.
  • DOI:
    10.1038/s41467-023-43963-z
  • 发表时间:
    2023-12-09
  • 期刊:
  • 影响因子:
    16.6
  • 作者:
    Peng, Liang;Peng, Huarong;Wang, Steven;Li, Xingjin;Mo, Jiaying;Wang, Xiong;Tang, Yun;Che, Renchao;Wang, Zuankai;Li, Wei;Zhao, Dongyuan
  • 通讯作者:
    Zhao, Dongyuan

Wang, Steven的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Wang, Steven', 18)}}的其他基金

Statistical Machine Learning for Large High Dimensional Complex Data
大型高维复杂数据的统计机器学习
  • 批准号:
    RGPIN-2018-05981
  • 财政年份:
    2022
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical Machine Learning for Large High Dimensional Complex Data
大型高维复杂数据的统计机器学习
  • 批准号:
    RGPIN-2018-05981
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical Machine Learning for Large High Dimensional Complex Data
大型高维复杂数据的统计机器学习
  • 批准号:
    RGPIN-2018-05981
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical Machine Learning for Large High Dimensional Complex Data
大型高维复杂数据的统计机器学习
  • 批准号:
    RGPIN-2018-05981
  • 财政年份:
    2019
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical analysis of complex data sets
复杂数据集的统计分析
  • 批准号:
    261470-2008
  • 财政年份:
    2011
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical analysis of complex data sets
复杂数据集的统计分析
  • 批准号:
    261470-2008
  • 财政年份:
    2010
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical analysis of complex data sets
复杂数据集的统计分析
  • 批准号:
    261470-2008
  • 财政年份:
    2009
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical analysis of complex data sets
复杂数据集的统计分析
  • 批准号:
    261470-2008
  • 财政年份:
    2008
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical data mining and information integration
统计数据挖掘与信息整合
  • 批准号:
    261470-2003
  • 财政年份:
    2007
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Statistical data mining and information integration
统计数据挖掘与信息整合
  • 批准号:
    261470-2003
  • 财政年份:
    2006
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual

相似国自然基金

Understanding structural evolution of galaxies with machine learning
  • 批准号:
    n/a
  • 批准年份:
    2022
  • 资助金额:
    10.0 万元
  • 项目类别:
    省市级项目

相似海外基金

Comparison of Machine Learning and Conventional Statistical Modeling for Predicting Readmission Following Acute Heart Failure Hospitalization
机器学习与传统统计模型预测急性心力衰竭住院后再入院的比较
  • 批准号:
    495410
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
Modern Statistics and Statistical Machine Learning
现代统计学和统计机器学习
  • 批准号:
    2886365
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
Modern Statistics and Statistical Machine Learning
现代统计学和统计机器学习
  • 批准号:
    2886852
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
EAGER: SSMCDAT2023: Revealing Local Symmetry Breaking in Intermetallics: Combining Statistical Mechanics and Machine Learning in PDF Analysis
EAGER:SSMCDAT2023:揭示金属间化合物中的局部对称性破缺:在 PDF 分析中结合统计力学和机器学习
  • 批准号:
    2334261
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
REU Site: University of North Carolina at Greensboro - Complex Data Analysis using Statistical and Machine Learning Tools
REU 站点:北卡罗来纳大学格林斯伯勒分校 - 使用统计和机器学习工具进行复杂数据分析
  • 批准号:
    2244160
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
Next-Generation Algorithms in Statistical Genetics Based on Modern Machine Learning
基于现代机器学习的下一代统计遗传学算法
  • 批准号:
    10714930
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
Unravel machine learning blackboxes -- A general, effective and performance-guaranteed statistical framework for complex and irregular inference problems in data science
揭开机器学习黑匣子——针对数据科学中复杂和不规则推理问题的通用、有效和性能有保证的统计框架
  • 批准号:
    2311064
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Standard Grant
A Novel Approach to Semi-Supervised Statistical Machine Learning
半监督统计机器学习的新方法
  • 批准号:
    DP230101671
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Projects
Modern Statistics and Statistical Machine Learning
现代统计学和统计机器学习
  • 批准号:
    2886723
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
Modern Statistics and Statistical Machine Learning
现代统计学和统计机器学习
  • 批准号:
    2886777
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了