Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures

复杂结构高维遗传数据的变量选择与预测

基本信息

  • 批准号:
    RGPIN-2020-05133
  • 负责人:
  • 金额:
    $ 1.31万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2022
  • 资助国家:
    加拿大
  • 起止时间:
    2022-01-01 至 2023-12-31
  • 项目状态:
    已结题

项目摘要

The challenge of precision medicine is to appropriately fit treatments or recommendations to each individual. Large amounts of resources are being used to generate genetic data with the hope that it will provide tailored decision making. Since genotyping costs have now dropped below those of several routine clinical tests, at least seven large health care systems have invested in genome-wide genotyping of a large proportion of their population, within whom electronic health record data are available. This data is being used to develop polygenic risk scores (PRS), which can predict complex diseases on the basis of genetic data, and thus have the potential to improve clinical care via precision medicine which would be of great interest to the Canadian population. Analytic tools increasing prediction accuracy are needed to maximize the productivity of these investments. In the context of clinical decision making, there is also a need to understand which variables are driving these predictions. Indeed, there is a reluctance among substantive experts to use so-called black-box algorithms from the machine learning literature because there is a lack of interpretability and transparency. It is difficult to know how the algorithm is making its decisions which can have serious ethical consequences. On the other hand, while many of the models developed in the statistical literature are interpretable and provide measures of uncertainty around their parameter estimates, they are not scalable to the massive amounts of data being generated today. It is becoming increasingly important for statisticians to not only develop theoretically justified methods, but also consider practical issues such as computational algorithms, data format and software. Considering each component in tandem is a step towards more appropriate methods being used in practice. To this end, the goal of this proposal is focused around three Themes: 1) to develop the theory and computational algorithms for new high-dimensional linear mixed models for variable selection and prediction in correlated or groped data; 2) propose interaction models between a key exposure and a high-dimensional dataset (e.g. gene-environment interactions); and 3) develop prediction tools from high-dimensional data for survival time endpoints. Our methods will be implemented in user friendly software, with careful considerations of data format and storage, in order to promote wider uptake of more complex models by data analysts. The results from this project will help to establish me as a new researcher with expertise in variable selection and prediction models for high-dimensional data with complex structures and make me competitive nationally and internationally.
精准医疗的挑战是为每个人提供适当的治疗或建议。大量资源被用于生成基因数据,希望它将提供量身定制的决策。由于基因分型的成本现在已经低于几种常规临床测试的成本,至少有七个大型医疗保健系统已经投资于大部分人口的全基因组基因分型,其中电子健康记录数据可用。这些数据正被用于开发多基因风险评分(PRS),它可以根据遗传数据预测复杂的疾病,从而有可能通过精确医学来改善临床护理,这将引起加拿大人口的极大兴趣。为了最大限度地提高这些投资的生产率,需要提高预测准确性的分析工具。在临床决策的背景下,还需要了解哪些变量正在驱动这些预测。事实上,实务专家不愿意使用机器学习文献中的所谓黑盒算法,因为缺乏可解释性和透明度。很难知道算法是如何做出可能产生严重道德后果的决定的。另一方面,虽然统计文献中开发的许多模型是可解释的,并提供了围绕其参数估计的不确定性度量,但它们无法扩展到今天生成的大量数据。 越来越重要的是,统计人员不仅要制定理论上合理的方法,而且要考虑计算算法、数据格式和软件等实际问题。将每个组成部分放在一起考虑是朝着在实践中使用更适当的方法迈出的一步。为此,本课题围绕三个主题展开研究:1)发展新的高维线性混合模型的理论和计算算法,用于相关或摸索数据中的变量选择和预测; 2)提出关键暴露与高维数据集之间的交互模型(例如基因-环境相互作用); 3)从生存时间终点的高维数据中开发预测工具。我们的方法将在用户友好的软件中实施,并仔细考虑数据格式和存储,以促进数据分析师更广泛地采用更复杂的模型。这个项目的结果将有助于我成为一名新的研究人员,在复杂结构的高维数据的变量选择和预测模型方面具有专业知识,并使我在国内和国际上具有竞争力。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Bhatnagar, Sahir其他文献

Results of a RCT on a Transition Support Program for Adults with ASD: Effects on Self-Determination and Quality of Life
  • DOI:
    10.1002/aur.2027
  • 发表时间:
    2018-12-01
  • 期刊:
  • 影响因子:
    4.7
  • 作者:
    Nadig, Aparna;Flanagan, Tara;Bhatnagar, Sahir
  • 通讯作者:
    Bhatnagar, Sahir

Bhatnagar, Sahir的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Bhatnagar, Sahir', 18)}}的其他基金

Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures
复杂结构高维遗传数据的变量选择与预测
  • 批准号:
    RGPIN-2020-05133
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures
复杂结构高维遗传数据的变量选择与预测
  • 批准号:
    RGPIN-2020-05133
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures
复杂结构高维遗传数据的变量选择与预测
  • 批准号:
    DGECR-2020-00344
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Launch Supplement

相似国自然基金

Intelligent Patent Analysis for Optimized Technology Stack Selection:Blockchain BusinessRegistry Case Demonstration
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    外国学者研究基金项目
连锁群选育法(Linkage Group Selection)在柔嫩艾美耳球虫表型相关基因研究中应用
  • 批准号:
    30700601
  • 批准年份:
    2007
  • 资助金额:
    17.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Application of machine learning to genomic selection of dairy cattle through improved feed efficiency complex prediction
通过提高饲料效率综合预测,将机器学习应用于奶牛基因组选择
  • 批准号:
    2887069
  • 财政年份:
    2023
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures
复杂结构高维遗传数据的变量选择与预测
  • 批准号:
    RGPIN-2020-05133
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Advancing Personalised Care: Treatment Selection and Outcome Prediction Research"
推进个性化护理:治疗选择和结果预测研究”
  • 批准号:
    2605893
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
A Prediction Model for Algorithm Selection in Solving Combinatorial Optimisation Problems.
解决组合优化问题的算法选择的预测模型。
  • 批准号:
    2608381
  • 财政年份:
    2021
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Studentship
Elucidation of energy balance and environmental selection during migration of juvenile salmon and prediction of adaptation to global warming
阐明幼鲑迁徙过程中的能量平衡和环境选择以及对全球变暖的适应预测
  • 批准号:
    20H00428
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures
复杂结构高维遗传数据的变量选择与预测
  • 批准号:
    RGPIN-2020-05133
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Grants Program - Individual
Prediction of binding sites between proteins by deep learning, and development of the system to help facilitate target site selection
通过深度学习预测蛋白质之间的结合位点,并开发系统以帮助促进目标位点选择
  • 批准号:
    20K12048
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Variable Selection and Prediction for High-Dimensional Genetic Data with Complex Structures
复杂结构高维遗传数据的变量选择与预测
  • 批准号:
    DGECR-2020-00344
  • 财政年份:
    2020
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Discovery Launch Supplement
Development of KYT sheet and KYT app for improvement of risk prediction awareness and selection of evacuation method in tsunami evacuation
开发KYT表和KYT应用程序,以提高海啸疏散中的风险预测意识和疏散方法的选择
  • 批准号:
    19H01723
  • 财政年份:
    2019
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Effect of sensory prediction error on reward-based action selection
感官预测误差对基于奖励的行动选择的影响
  • 批准号:
    18K17893
  • 财政年份:
    2018
  • 资助金额:
    $ 1.31万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了