Probabilistic methods towards understanding complex human phenotypes using genomic and healthcare data
使用基因组和医疗数据理解复杂人类表型的概率方法
基本信息
- 批准号:RGPIN-2019-06216
- 负责人:
- 金额:$ 2.84万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2022
- 资助国家:加拿大
- 起止时间:2022-01-01 至 2023-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The advent of massive biological datasets challenge existing analytic frameworks. Large genomic profiling data confer the molecular basis to link mutations to gene expression changes in specific tissues. The broad adoption of EHR systems creates rich phenotypic data including diagnostic code, lab tests, and questionnaires. These data provide promising venues for developing novel machine learning methods to elucidate the biological mechanisms that give rise to the phenotypic diversities and interdependence. However, due to the lack of scalable inference methods, existing research is often limited to analyzing only a small snapshot of the entire datasets and unable to account for the sparse, multimodal, longitudinal, irregularly sampled, and non-missing-at-random nature of the data. Our long-term vision is to develop novel machine learning methods to decipher, in a human-understandable manner, the etiology of diverse phenotypes based on genetic variants, cell-type specificities, genomic regulatory elements, gene and pathway functions, and their interactions with environments. In pursuing this vision, we propose four short-term objectives. We will develop: 1. Bayesian model to account for the multi-modality of the heterogeneous data distributions and predict composite biomarkers by associating genes, tissues, lab results, diagnosis codes via latent phenotypic topics, 2. generative model to impute correlated non-randomly missing lab results and answers to self-reported questionnaires in patients' EHR and gene expression in inaccessible tissue samples of new patients, 3. unsupervised model to infer latent trajectory of diverse patients' health states based on their longitudinal and irregularly sampled outpatient and inpatient medical records, 4. hierarchical Bayesian network that leverages the functional impacts of sequence mutations inferred from genomic data and jointly infer the directed paths from driver genetic variants, causal genes and pathways, and to phenotypes. The key innovation of our proposed methods is that, in contrast to the existing ad hoc methods, we learn all components of our proposed models simultaneously (despite their complexity) and therefore harmonize diverse datasets with complementary information. We achieve this by scalable variational inference algorithms that leverage probability theory and deep learning techniques. The proposed research will advance Bayesian learning for mining massive heterogenous data with impactful applications in medicine including composite biomarker discovery, imputation-based clinical recommendations, forecasting health trajectories, personalized risk predictions, deep interpretable models for inferring causal mutations and disease risks. Together, we present a step towards bridging the gap between the genome and the phenome by efficient Bayesian integrations of massive data, thereby improving our understanding of the cascading events from genetic mutations to a broad phenotypic spectrum.
大规模生物数据集的出现挑战了现有的分析框架。大量的基因组分析数据赋予了将突变与特定组织中的基因表达变化联系起来的分子基础。EHR系统的广泛采用创造了丰富的表型数据,包括诊断代码,实验室测试和问卷调查。这些数据为开发新的机器学习方法提供了有希望的场所,以阐明引起表型差异和相互依赖的生物学机制。然而,由于缺乏可扩展的推理方法,现有的研究往往仅限于分析整个数据集的一小部分快照,无法解释数据的稀疏性、多模态性、纵向性、不规则采样性和非随机缺失性。 我们的长期愿景是开发新的机器学习方法,以人类可理解的方式破译基于遗传变异,细胞类型特异性,基因组调控元件,基因和途径功能及其与环境的相互作用的不同表型的病因。为了实现这一愿景,我们提出了四个短期目标。我们将开发:1。贝叶斯模型解释异质数据分布的多模态,并通过潜在表型主题将基因、组织、实验室结果、诊断代码相关联来预测复合生物标志物,2.生成模型来插补相关的非随机缺失的实验室结果和对患者的EHR中的自我报告问卷的回答以及新患者的不可接近的组织样本中的基因表达,3.基于纵向和不规则采样的门诊和住院病历推断不同患者健康状态的潜在轨迹的无监督模型,4.分层贝叶斯网络,利用从基因组数据推断的序列突变的功能影响,并联合推断从驱动遗传变异、致病基因和途径到表型的定向路径。我们提出的方法的关键创新在于,与现有的特设方法相比,我们同时学习我们提出的模型的所有组成部分(尽管它们很复杂),因此用互补信息协调不同的数据集。我们通过利用概率论和深度学习技术的可扩展变分推理算法来实现这一目标。拟议的研究将推进贝叶斯学习,用于挖掘大量异质数据,并在医学中具有影响力的应用,包括复合生物标志物发现,基于估算的临床建议,预测健康轨迹,个性化风险预测,用于推断因果突变和疾病风险的深度可解释模型。总之,我们提出了一个步骤,通过有效的贝叶斯集成的大量数据,从而提高我们的理解级联事件从基因突变到广泛的表型谱的基因组和表型之间的差距差距。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Li, Yue其他文献
Cotton Bsr-k1 modulates lignin deposition participating in plant resistance against Verticillium dahliae and Fusarium oxysporum
- DOI:
10.1007/s10725-021-00742-4 - 发表时间:
2021-08-18 - 期刊:
- 影响因子:4.2
- 作者:
Li, Yue;Zhou, Yaojun;Liu, Xiaodong - 通讯作者:
Liu, Xiaodong
Autophagy in renal fibrosis: Protection or promotion?
- DOI:
10.3389/fphar.2022.963920 - 发表时间:
2022 - 期刊:
- 影响因子:5.6
- 作者:
Dai, Rong;Zhang, Lei;Jin, Hua;Wang, Dong;Cheng, Meng;Sang, Tian;Peng, Chuyi;Li, Yue;Wang, Yiping - 通讯作者:
Wang, Yiping
An artificially-intelligent cornea with tactile sensation enables sensory expansion and interaction.
- DOI:
10.1038/s41467-023-42240-3 - 发表时间:
2023-11-07 - 期刊:
- 影响因子:16.6
- 作者:
Qu, Shangda;Sun, Lin;Zhang, Song;Liu, Jiaqi;Li, Yue;Liu, Junchi;Xu, Wentao - 通讯作者:
Xu, Wentao
Low-Molecular Weight Small Molecules Can Potently Bind RNA and Affect Oncogenic Pathways in Cells.
- DOI:
10.1021/jacs.2c08770 - 发表时间:
2022-11-16 - 期刊:
- 影响因子:15
- 作者:
Suresh, Blessy M.;Akahori, Yoshihiro;Taghavi, Amirhossein;Crynen, Gogce;Gibaut, Quentin M. R.;Li, Yue;Disney, Matthew D. - 通讯作者:
Disney, Matthew D.
Style of deep faulting and its control on the Paleogene basin evolved into the western Huimin depression, Eastern China
深断裂样式及其对古近纪盆地演化成中国东部惠民凹陷西部的控制
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Ni, Jin-Long;Liu, Jun-Lai;Lin, Yu-Xiang;Wang, Zhi-Min;Han, Zuo-Zhen;Li, Yue;Cao, Shu-Yun - 通讯作者:
Cao, Shu-Yun
Li, Yue的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Li, Yue', 18)}}的其他基金
Probabilistic methods towards understanding complex human phenotypes using genomic and healthcare data
使用基因组和医疗数据理解复杂人类表型的概率方法
- 批准号:
RGPIN-2019-06216 - 财政年份:2021
- 资助金额:
$ 2.84万 - 项目类别:
Discovery Grants Program - Individual
Probabilistic methods towards understanding complex human phenotypes using genomic and healthcare data
使用基因组和医疗数据理解复杂人类表型的概率方法
- 批准号:
RGPIN-2019-06216 - 财政年份:2020
- 资助金额:
$ 2.84万 - 项目类别:
Discovery Grants Program - Individual
Probabilistic methods towards understanding complex human phenotypes using genomic and healthcare data
使用基因组和医疗数据理解复杂人类表型的概率方法
- 批准号:
DGECR-2019-00253 - 财政年份:2019
- 资助金额:
$ 2.84万 - 项目类别:
Discovery Launch Supplement
Probabilistic methods towards understanding complex human phenotypes using genomic and healthcare data
使用基因组和医疗数据理解复杂人类表型的概率方法
- 批准号:
RGPIN-2019-06216 - 财政年份:2019
- 资助金额:
$ 2.84万 - 项目类别:
Discovery Grants Program - Individual
Multisensory integration training to improve working memory
多感觉统合训练改善工作记忆
- 批准号:
515134-2017 - 财政年份:2017
- 资助金额:
$ 2.84万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Master's
Discovery of RNA Biomarkers for Prostate Cancer using High-Throughput Sequencing Data
使用高通量测序数据发现前列腺癌的 RNA 生物标志物
- 批准号:
426531-2012 - 财政年份:2014
- 资助金额:
$ 2.84万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Discovery of RNA Biomarkers for Prostate Cancer using High-Throughput Sequencing Data
使用高通量测序数据发现前列腺癌的 RNA 生物标志物
- 批准号:
426531-2012 - 财政年份:2013
- 资助金额:
$ 2.84万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Discovery of RNA Biomarkers for Prostate Cancer using High-Throughput Sequencing Data
使用高通量测序数据发现前列腺癌的 RNA 生物标志物
- 批准号:
426531-2012 - 财政年份:2012
- 资助金额:
$ 2.84万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Doctoral
Machine-learning in finding correlates of immunity against pertussis
机器学习寻找百日咳免疫力的相关性
- 批准号:
393825-2010 - 财政年份:2010
- 资助金额:
$ 2.84万 - 项目类别:
Alexander Graham Bell Canada Graduate Scholarships - Master's
A probabilistic model for whole-proteome analyses
全蛋白质组分析的概率模型
- 批准号:
384849-2009 - 财政年份:2009
- 资助金额:
$ 2.84万 - 项目类别:
University Undergraduate Student Research Awards
相似国自然基金
复杂图像处理中的自由非连续问题及其水平集方法研究
- 批准号:60872130
- 批准年份:2008
- 资助金额:28.0 万元
- 项目类别:面上项目
Computational Methods for Analyzing Toponome Data
- 批准号:60601030
- 批准年份:2006
- 资助金额:17.0 万元
- 项目类别:青年科学基金项目
相似海外基金
MCA: Towards a Theory of Engineering Identity Development & Persistence of Minoritized Students with Imposter Feelings: A Longitudinal Mixed-methods Study of Developmental Networks
MCA:迈向工程身份发展理论
- 批准号:
2421846 - 财政年份:2024
- 资助金额:
$ 2.84万 - 项目类别:
Standard Grant
State Transformation in Historical Syria: Towards New Middle Eastern Area Studies with Mixed Methods
历史上叙利亚的国家转型:混合方法的新中东地区研究
- 批准号:
23H00043 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
Grant-in-Aid for Scientific Research (A)
Towards treatment for the complex patient: investigations of low-intensity focused ultrasound.
针对复杂患者的治疗:低强度聚焦超声的研究。
- 批准号:
10775216 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
Towards the understanding of how chaperones function and prevent amyloidogenic diseases
了解伴侣如何发挥作用并预防淀粉样蛋白形成疾病
- 批准号:
10734397 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
Towards equitable early identification of autism spectrum disorders in females
实现女性自闭症谱系障碍的公平早期识别
- 批准号:
10722011 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
PTSD and Autoimmune Disease: Towards Causal Effects, Risk Factors, and Mitigators
创伤后应激障碍 (PTSD) 和自身免疫性疾病:因果效应、危险因素和缓解措施
- 批准号:
10696671 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
Towards a neurobiology of "oromanual" motor control: behavioral analysis and neural mechanisms
走向“手动”运动控制的神经生物学:行为分析和神经机制
- 批准号:
10819032 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
Towards an integrated analytics solution to creating a spatially-resolved single-cell multi-omics brain atlas
寻求集成分析解决方案来创建空间解析的单细胞多组学大脑图谱
- 批准号:
10724843 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
Towards personalized medicine: pathophysiologic contributions to post-stroke sleep apnea
迈向个性化医疗:中风后睡眠呼吸暂停的病理生理学贡献
- 批准号:
10654941 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:
On Demand Dissoluble Supramolecular Hydrogels: Towards Pain Free Burn Dressings
按需可溶性超分子水凝胶:迈向无痛烧伤敷料
- 批准号:
10658220 - 财政年份:2023
- 资助金额:
$ 2.84万 - 项目类别:














{{item.name}}会员




