Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
基本信息
- 批准号:RGPIN-2015-03805
- 负责人:
- 金额:$ 1.02万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2016
- 资助国家:加拿大
- 起止时间:2016-01-01 至 2017-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
INTRODUCTION: In recent years, we have witnessed the rise of large scale data, colloquially referred to as big data, in different fields of scientific research ranging from biology and medicine to engineering, the social sciences and econometrics. A common statistical problem of interest in the analysis of such data is to model a response variable of interest as a function of a small subset of a large number of features. This is referred to as a feature selection problem. In addition to noise accumulation and spurious correlation, unobserved heterogeneity in high-dimensional data makes the feature selection problem even harder. Finite mixture of regressions (FMR) and mixture-of-experts (MOE) are powerful statistical models for capturing heterogeneity in data. The first part of this proposal focuses on feature selection, estimation and post-selection inference problems in FMR/MOE. The second part concerns varying coefficient finite mixture of regression (VC-FMR) models in which regression coefficients change as smooth functions of an index variable such as time. For example, in market segmentation research, consumer preferences for products often change over time and across different market segments. VC-FMR models provide a natural tool for modeling such phenomena which involves heterogeneous functional data. However, methodological and computational tools for these relatively new models are largely unexplored.
OBJECTIVES: An emphasis of my research program is on developing sound statistical methodology and computationally efficient algorithms for estimation, feature selection, and also post-selection inference such as hypothesis testing and confidence intervals in FMR/MOE in high dimensions. Another focal point of my research concerns estimation and feature selection in VC-FMR. My longer term objectives focus on complex time series data and high-dimensional heterogeneous and dependent data.
METHODS: I will study the regularization techniques LASSO/SCAD for simultaneous parameter estimation and feature selection in FMR/MOE models in high dimensions. Coordinate descent-type expectation-maximization (EM) algorithms will be investigated for numerical computations. Post-selection inference such as hypothesis testing and confidence intervals for parameters in sparse FMR/MOE will be explored based on sample splitting techniques. Regularized local kernel likelihood-based methods will be used for functional parameter estimation and feature selection in VC-FMR models.
IMPACT: My proposed research program will address unresolved statistical issues in FMR/MOE models in high dimensions as well as in VC-FMR models, and offer solutions to practical problems of interest to a broader statistical audience. The proposed methods could then immediately be used to solve scientific problems in areas such as biology, engineering, the health sciences, and marketing research.
简介:近年来,我们见证了大规模数据的兴起,俗称大数据,在不同的科学研究领域,从生物学和医学到工程学,社会科学和计量经济学。在分析这类数据时,一个常见的统计问题是将一个响应变量建模为大量特征的一个小子集的函数,这被称为特征选择问题。除了噪声积累和虚假相关之外,高维数据中未观察到的异质性使特征选择问题更加困难。有限混合回归(FMR)和混合专家(莫伊)是捕捉数据异质性的强大统计模型。本建议的第一部分集中在FMR/莫伊的特征选择,估计和后选择推理问题。第二部分研究变系数有限混合回归(VC-FMR)模型,其中回归系数随时间等指标变量的光滑函数而变化。例如,在市场细分研究中,消费者对产品的偏好往往会随着时间的推移和不同的细分市场而变化。VC-FMR模型提供了一个自然的工具,用于建模这种现象,涉及异构的功能数据。然而,这些相对较新的模型的方法和计算工具在很大程度上是未经探索的。
目的:我的研究计划的重点是开发可靠的统计方法和计算效率高的算法,用于估计、特征选择以及选择后推理,例如高维FMR/莫伊中的假设检验和置信区间。我的研究的另一个重点是VC-FMR中的估计和特征选择。我的长期目标集中在复杂的时间序列数据和高维异构和相关数据。
方法:我将研究正则化技术LASSO/SCAD的同时参数估计和功能选择的FMR/莫伊模型在高维。坐标下降型期望最大化(EM)算法将研究数值计算。基于样本分裂技术,我们将探索稀疏FMR/莫伊模型中参数的假设检验和置信区间等后选择推理,并将正则化局部核似然方法用于VC-FMR模型中的功能参数估计和特征选择。
影响:我建议的研究计划将解决FMR/莫伊模型在高维以及VC-FMR模型中未解决的统计问题,并为更广泛的统计受众感兴趣的实际问题提供解决方案。然后,所提出的方法可以立即用于解决生物学、工程学、健康科学和市场研究等领域的科学问题。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Khalili, Abbas其他文献
Feature selection in finite mixture of sparse normal linear models in high-dimensional feature space
- DOI:
10.1093/biostatistics/kxq048 - 发表时间:
2011-01-01 - 期刊:
- 影响因子:2.1
- 作者:
Khalili, Abbas;Chen, Jiahua;Lin, Shili - 通讯作者:
Lin, Shili
Disseminated Intravascular Coagulation Associated with Large Deletion of Immunoglobulin Heavy Chain
- DOI:
10.18502/ijaai.v20i6.8030 - 发表时间:
2021-12-01 - 期刊:
- 影响因子:1.5
- 作者:
Khalili, Abbas;Yadegari, Amir Hosein;Abolhassani, Hassan - 通讯作者:
Abolhassani, Hassan
Autosomal Recessive Agammaglobulinemia: A Novel Non-sense Mutation in CD79a
- DOI:
10.1007/s10875-014-9989-3 - 发表时间:
2014-02-01 - 期刊:
- 影响因子:9.1
- 作者:
Khalili, Abbas;Plebani, Alessandro;Aghamohammadi, Asghar - 通讯作者:
Aghamohammadi, Asghar
Order Selection in Finite Mixture Models With a Nonsmooth Penalty
- DOI:
10.1198/016214508000001075 - 发表时间:
2008-12-01 - 期刊:
- 影响因子:3.7
- 作者:
Chen, Jiahua;Khalili, Abbas - 通讯作者:
Khalili, Abbas
Order Selection in Finite Mixture Models With a Nonsmooth Penalty
- DOI:
10.1198/jasa.2009.0103 - 发表时间:
2009-03-01 - 期刊:
- 影响因子:3.7
- 作者:
Chen, Jiahua;Khalili, Abbas - 通讯作者:
Khalili, Abbas
Khalili, Abbas的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Khalili, Abbas', 18)}}的其他基金
High-dimensional Data Analysis: Modeling Unobserved Heterogeneity in Data, and Studying Imbalanced Classification Problems
高维数据分析:对数据中未观察到的异质性进行建模,并研究不平衡分类问题
- 批准号:
RGPIN-2020-05011 - 财政年份:2022
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
High-dimensional Data Analysis: Modeling Unobserved Heterogeneity in Data, and Studying Imbalanced Classification Problems
高维数据分析:对数据中未观察到的异质性进行建模,并研究不平衡分类问题
- 批准号:
RGPIN-2020-05011 - 财政年份:2021
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
High-dimensional Data Analysis: Modeling Unobserved Heterogeneity in Data, and Studying Imbalanced Classification Problems
高维数据分析:对数据中未观察到的异质性进行建模,并研究不平衡分类问题
- 批准号:
RGPIN-2020-05011 - 财政年份:2020
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2019
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2018
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2017
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2015
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Model selection and statistical inference in mixture distributions and hidden markov (regression) models
混合分布和隐马尔可夫(回归)模型中的模型选择和统计推断
- 批准号:
386578-2010 - 财政年份:2014
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Model selection and statistical inference in mixture distributions and hidden markov (regression) models
混合分布和隐马尔可夫(回归)模型中的模型选择和统计推断
- 批准号:
386578-2010 - 财政年份:2013
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Model selection and statistical inference in mixture distributions and hidden markov (regression) models
混合分布和隐马尔可夫(回归)模型中的模型选择和统计推断
- 批准号:
386578-2010 - 财政年份:2012
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
Developing Statistical Tools for Data integration and Data Fusion for Finite Population Inference
开发用于有限总体推理的数据集成和数据融合的统计工具
- 批准号:
2242820 - 财政年份:2023
- 资助金额:
$ 1.02万 - 项目类别:
Standard Grant
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2019
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Multivariate Histograms and Inference with Finite Sample Guarantees
具有有限样本保证的多元直方图和推理
- 批准号:
1916074 - 财政年份:2019
- 资助金额:
$ 1.02万 - 项目类别:
Standard Grant
Statistical theory on finite alphabet structures: inference, algorithms, and applications
有限字母表结构的统计理论:推理、算法和应用
- 批准号:
411042450 - 财政年份:2018
- 资助金额:
$ 1.02万 - 项目类别:
Research Fellowships
Bayesian inference of earthquake source parameters: kinematic and dynamic finite fault models
震源参数的贝叶斯推断:运动学和动力有限断层模型
- 批准号:
391058966 - 财政年份:2018
- 资助金额:
$ 1.02万 - 项目类别:
Research Grants
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2018
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2017
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Statistical inference in finite mixture of regressions and mixture-of-experts models in high-dimensional spaces, and varying coefficient finite mixture of regression models
高维空间中回归和专家混合模型的有限混合的统计推断,以及回归模型的变系数有限混合
- 批准号:
RGPIN-2015-03805 - 财政年份:2015
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual
Development of Inference Methods for Finite Normal Mixture Models
有限正态混合模型推理方法的发展
- 批准号:
26380267 - 财政年份:2014
- 资助金额:
$ 1.02万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Decision theoretic inference in problems involving balanced loss functions, constraint parameter spaces and finite mixture models
涉及平衡损失函数、约束参数空间和有限混合模型问题的决策理论推理
- 批准号:
386575-2010 - 财政年份:2014
- 资助金额:
$ 1.02万 - 项目类别:
Discovery Grants Program - Individual