Adaptive Statistical Methods for Genetic Association Studies

遗传关联研究的自适应统计方法

基本信息

项目摘要

DESCRIPTION (provided by applicant): The major focus of this project is the development of methodologies for high-dimensional data that arise from new emerging high-throughput genomic technologies. The types of data that we focus on are single nucleotide polymorphism (SNP) data from genome-wide association studies (GWAS) and whole genome exome sequencing data, though many methods developed here can be readily applied to other types of high-dimensional data. One feature of these data is that the number of predictors (genes or SNPs) p is typically much larger than the number of observations n. The key to handle these high-dimensional data is to reduce the dimensionality effectively. There are several challenges in reducing the dimensionality. First, there are many variants which contribute to complex diseases. GWAS target common variants that typically only have modest effects, whereas variants in sequencing studies that have larger effects are more rare. The consequence is that the variants that are associated with the trait do not stand out, because of stochastic variation as well as the number of variants under study. Secondly, many of these variants act in combination with environmental factors and other variants. This poses even more challenges, as the number of potential gene-environment and gene-gene interactions is much greater than the number of marginal analyses. Thirdly, to elucidate complex disease risk, a comprehensive approach which considers many genetic variants, environmental factors, and their interactions is needed. Developing methods that deal with large numbers of variants and environmental factors is the focus of this project. Using adaptive function estimation techniques, which have been developed for many large nonparametric regression problems, we will develop a suite of statistical and computational techniques for the identification of environmental factors that modify genetic effects, for the predicting of disease risk from many thousands of SNPs, and for identifying significant predictors in exome sequencing studies. In adaptive function estimation, an unknown function is modeled as a combination of many basis functions. Model selection techniques, such as the lasso and boosting, have been developed for selecting which combination of basis functions is best at predicting a (disease) outcome. These approaches are very suited to the problems studied in this project. The investigators on this project are directly involved in a number of genetic association studies as (principal) investigator. The specific aims that we propose are in response to actual analysis problems facing these projects. This direct relation to projects ensures the relevance of the methods we intend to develop. PUBLIC HEALTH RELEVANCE: The major focus of this proposal is the development of analytical approaches for high-dimensional data that arise from genome-wide association studies and whole exome sequencing studies. In particular, we propose to develop adaptive methods to construct predictive models and to identify gene-environment interactions in GWAS, and to improve power for association studies in whole exome sequencing studies.
描述(由申请人提供):该项目的主要重点是新兴的高通量基因组技术产生的高维数据方法的发展。我们关注的数据类型是来自全基因组关联研究(GWAS)的单核苷酸多态性(SNP)数据和全基因组外显子组测序数据,尽管这里开发的许多方法可以很容易地应用于其他类型的高维数据。这些数据的一个特征是预测因子(基因或snp)的数量p通常比观测值n大得多。处理这些高维数据的关键是有效地降低维数。在降低维数方面有几个挑战。首先,有许多变异会导致复杂的疾病。GWAS针对的是通常只有适度影响的常见变异,而测序研究中具有较大影响的变异则更为罕见。结果是,由于随机变异以及研究中的变异数量,与该性状相关的变异并不突出。其次,许多这些变异与环境因素和其他变异相结合。这带来了更多的挑战,因为潜在的基因-环境和基因-基因相互作用的数量远远大于边际分析的数量。第三,为了阐明复杂的疾病风险,需要综合考虑多种遗传变异、环境因素及其相互作用的方法。开发处理大量变量和环境因素的方法是本项目的重点。使用自适应函数估计技术,该技术已经开发用于许多大型非参数回归问题,我们将开发一套统计和计算技术,用于识别改变遗传效应的环境因素,用于预测来自数千个snp的疾病风险,以及用于识别外显子组测序研究中的重要预测因子。在自适应函数估计中,一个未知函数被建模为多个基函数的组合。模型选择技术,如套索和增强,已经被开发出来,用于选择哪种基函数组合最能预测(疾病)结果。这些方法非常适合本项目所研究的问题。该项目的研究人员作为(主要)研究者直接参与了许多遗传关联研究。我们提出的具体目标是针对这些项目所面临的实际分析问题。这种与项目的直接关系确保了我们打算开发的方法的相关性。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Charles L Kooperberg其他文献

Charles L Kooperberg的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Charles L Kooperberg', 18)}}的其他基金

Physical Activity to Improve CV Health in Older Women: A Pragmatic Trial
体力活动可改善老年女性的心血管健康:一项务实的试验
  • 批准号:
    10274794
  • 财政年份:
    2020
  • 资助金额:
    $ 35.2万
  • 项目类别:
Physical Activity to Improve CV Health in Older Women: A Pragmatic Trial
体力活动可改善老年女性的心血管健康:一项务实的试验
  • 批准号:
    10688242
  • 财政年份:
    2020
  • 资助金额:
    $ 35.2万
  • 项目类别:
Physical Activity to Improve CV Health in Older Women: A Pragmatic Trial
体力活动可改善老年女性的心血管健康:一项务实的试验
  • 批准号:
    10652593
  • 财政年份:
    2020
  • 资助金额:
    $ 35.2万
  • 项目类别:
Trans-omics elucidation of genetic architecture underlying cardiovascular and HLBS diseases
跨组学阐明心血管和 HLBS 疾病的遗传结构
  • 批准号:
    9895848
  • 财政年份:
    2019
  • 资助金额:
    $ 35.2万
  • 项目类别:
Whole Genome Sequence Analysis of Ischemic Stroke in the Women's Health Initiative
妇女健康倡议中缺血性中风的全基因组序列分析
  • 批准号:
    9290440
  • 财政年份:
    2017
  • 资助金额:
    $ 35.2万
  • 项目类别:
Research Program: Biostatistics and Computational Biology
研究项目:生物统计学和计算生物学
  • 批准号:
    8804802
  • 财政年份:
    2015
  • 资助金额:
    $ 35.2万
  • 项目类别:
Physical Activity to Improve CV Health in Older Women: A Pragmatic Trial -- DCC
体力活动可改善老年女性的心血管健康:一项务实的试验——DCC
  • 批准号:
    9010974
  • 财政年份:
    2015
  • 资助金额:
    $ 35.2万
  • 项目类别:
Physical Activity to Improve CV Health in Older Women: A Pragmatic Trial -- DCC
体力活动可改善老年女性的心血管健康:一项务实的试验——DCC
  • 批准号:
    9212845
  • 财政年份:
    2015
  • 资助金额:
    $ 35.2万
  • 项目类别:
Exonic variants and their relation to complex traits in minorities of the WHI
外显子变异及其与 WHI 少数群体复杂性状的关系
  • 批准号:
    9527426
  • 财政年份:
    2013
  • 资助金额:
    $ 35.2万
  • 项目类别:
Exonic variants and their relation to complex traits in minorities of the WHI
外显子变异及其与 WHI 少数群体复杂性状的关系
  • 批准号:
    8571986
  • 财政年份:
    2013
  • 资助金额:
    $ 35.2万
  • 项目类别:

相似海外基金

Robust Derivative-Free Algorithms for Complex Optimisation Problems
用于复杂优化问题的鲁棒无导数算法
  • 批准号:
    DE240100006
  • 财政年份:
    2024
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Discovery Early Career Researcher Award
ATD: Algorithms and Geometric Methods for Community and Anomaly Detection and Robust Learning in Complex Networks
ATD:复杂网络中社区和异常检测以及鲁棒学习的算法和几何方法
  • 批准号:
    2220271
  • 财政年份:
    2023
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Standard Grant
CAREER: Toward Real-Time, Constraint-Aware Control of Complex Dynamical Systems: from Theory and Algorithms to Software Tools
职业:实现复杂动力系统的实时、约束感知控制:从理论和算法到软件工具
  • 批准号:
    2238424
  • 财政年份:
    2023
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Standard Grant
Hybrid Symbolic-Numeric Algorithms for Complex Nonlinear Systems
复杂非线性系统的混合符号数值算法
  • 批准号:
    RGPIN-2020-06438
  • 财政年份:
    2022
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Discovery Grants Program - Individual
Optimization Models and Algorithms for Complex Production Planning Problems
复杂生产计划问题的优化模型和算法
  • 批准号:
    RGPIN-2019-05759
  • 财政年份:
    2022
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Discovery Grants Program - Individual
Algebraic analysis of deformations of non-isolated singularities, computational complex analysis and algorithms
非孤立奇点变形的代数分析、计算复杂性分析和算法
  • 批准号:
    22K03334
  • 财政年份:
    2022
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Multiagent Innovative Consensus Algorithms in Complex Negotiation Environments
复杂谈判环境下的多智能体创新共识算法
  • 批准号:
    22H00533
  • 财政年份:
    2022
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
CAREER: Advancing Mathematical Models and Algorithms for Decentralized Optimization in Complex Multi-agent Networks
职业:推进复杂多智能体网络中分散优化的数学模型和算法
  • 批准号:
    2323159
  • 财政年份:
    2022
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Standard Grant
Optimization Models and Algorithms for Complex Production Planning Problems
复杂生产计划问题的优化模型和算法
  • 批准号:
    RGPIN-2019-05759
  • 财政年份:
    2021
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Discovery Grants Program - Individual
Models and Algorithms for Optimal Vision-Based Surveillance and Exploration of Complex Environments
基于最佳视觉的复杂环境监控和探索的模型和算法
  • 批准号:
    2110895
  • 财政年份:
    2021
  • 资助金额:
    $ 35.2万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了