SCH: INT: Collaborative Research: High-throughput Phenotyping on Electronic Health Records using Multi-Tensor Factorization

SCH:INT:协作研究:使用多张量分解对电子健康记录进行高通量表型分析

基本信息

  • 批准号:
    1418504
  • 负责人:
  • 金额:
    $ 69.3万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2014
  • 资助国家:
    美国
  • 起止时间:
    2014-09-01 至 2019-08-31
  • 项目状态:
    已结题

项目摘要

As the adoption of electronic health records (EHRs) has grown, EHRs are now composed of a diverse array of data, including structured information (e.g., diagnoses, medications, and lab results), molecular sequences, unstructured clinical progress notes, and social network information. There is mounting evidence that EHRs are a rich resource for clinical research, but they are notoriously difficult to leverage because of their orientation to healthcare business operations, heterogeneity across commercial systems, and high levels of missing or erroneous entries. Moreover, the interactions among different data sources within an EHR are challenging to model, hampering our ability to leverage traditional analytic frameworks. In recognition of this problem, various efforts have been undertaken to transform EHR data into concise and meaningful concepts, or phenotypes. Yet, to date, these efforts have been ad hoc and labor intensive, resulting in specific phenotypes for specific environments; e.g., type 2 diabetes in the EHR system at Vanderbilt University Medical Center (VUMC). There is an urgent need for scalable phenotyping methods, but several major challenges must be addressed, including: a) patient representation, b) high-throughput phenotype generation from EHRs, c) expert-guided phenotype refinement, and d) phenotype adaptation across institutions. The goal of this project is to address these challenges by developing a general computational framework for transforming EHR data into meaningful phenotypes with only modest levels of expert guidance. The PIs will develop novel courses on Healthcare Analytics as a Massive Open Online Course (MOOC) that covers cross-disciplinary topics at the confluence of computer science and medical informatics, while embellishing existing graduate courses on biomedical informatics. The PIs plan to deliver tutorials and organize workshops at relevant computer science and medical informatics conferences with the goal of sharing research results and developing a community. The PIs will develop outreach modules that focus on freshmen and under-represented students, as well as educational sessions for clinical researchers who are currently performing phenotyping in academic medical centers. Thus, the project has a significant component the integrates research and education as well as providing for new scientific insights.In support of this goal, the team plans to represent and analyze EHR data as inter-connected high-order relations i.e. tensors (e.g. tuples of patient-medication-diagnosis, patient-lab, and patient-symptoms). The proposed analytic framework generalizes several existing data mining methodologies, including dimensionality reduction, topic modeling and co-clustering, which all arise as limited special cases of analyzing second order tensors. It will also enable flexible refinement of candidates to adapt phenotypes from one healthcare institution to another, and will incorporate feedback from domain experts. The accompanying suite of algorithms and methods will enable the automation of high-throughput phenotype generation, refinement, adaptation and applications, in a broad range of health informatics settings and across multiple institutions. This project will integrate biomedical informaticists, computer scientists, and clinical experts. The significance of the resulting phenotypes in diverse clinical applications, including: a) cohort construction, where case and control patients are identified with respect to specific phenotype combinations; b) genome wide association studies (GWAS), where target phenotypes of patients are tested against DNA sequence variation for significant statistical associations; and c) clinical predictive modeling, where a model is developed to predict target phenotypes or diseases will be demonstrated. The framework will be developed with public accessible data from MIMIC-II and CMS and validate in real clinical environments at Northwestern Memorial Hospital and VUMC through several high-impact disease targets (including hypertension, type 2 diabetes, hypothyroidism, atrial fibrillation, rheumatoid arthritis, and multiple sclerosis). Additionally, the methodologies developed through this project will be integrated into existing software platforms that support the representation of EHR-derived phenotypes, but lack a data-driven component for the generation and refinement of candidates. Overall, the proposed framework is expected to have a major impact on translational clinical research including clinical trial design, predictive modeling, epidemiology studies and clinical decision support.
随着电子健康记录(EHR)的普及,EHR现在由各种各样的数据组成,包括结构化信息(例如,诊断、药物和实验室结果)、分子序列、非结构化临床进展笔记和社交网络信息。越来越多的证据表明,电子病历是临床研究的丰富资源,但众所周知,它们很难利用,因为它们面向医疗保健业务运营,跨商业系统的异质性,以及高度缺失或错误的条目。此外,EHR中不同数据源之间的交互建模具有挑战性,阻碍了我们利用传统分析框架的能力。认识到这一问题,已采取各种努力将电子健康记录数据转化为简明和有意义的概念或表型。然而,到目前为止,这些努力是临时的和劳动密集型的,导致了特定环境下的特定表型;例如,范德比尔特大学医学中心(VUMC)EHR系统中的2型糖尿病。迫切需要可扩展的表型方法,但必须解决几个主要挑战,包括:a)患者代表性,b)从EHR高通量产生表型,c)专家指导的表型改进,以及d)跨机构的表型适应。该项目的目标是通过开发一个通用的计算框架,将电子病历数据转化为有意义的表型,只需适度的专家指导,以应对这些挑战。PIS将开发新的医疗保健分析课程,作为一门大型在线开放课程(MOOC),涵盖计算机科学和医学信息学交汇的跨学科主题,同时完善现有的生物医学信息学研究生课程。PIS计划在相关的计算机科学和医学信息学会议上提供教程和组织研讨会,目的是分享研究成果和发展社区。PIS将开发以新生和代表性不足的学生为重点的外展模块,以及为目前在学术医学中心进行表型分析的临床研究人员举办的教育课程。因此,该项目有一个重要的组成部分,整合了研究和教育,并提供了新的科学见解。为了支持这一目标,该团队计划将EHR数据表示和分析为相互关联的高阶关系,即张量(例如,患者-药物-诊断、患者-实验室和患者-症状的元组)。所提出的分析框架概括了现有的几种数据挖掘方法,包括降维、主题建模和共同聚类,这些方法都是分析二阶张量的有限特例。它还将使候选人能够灵活地精炼,以适应从一个医疗机构到另一个医疗机构的表型,并将纳入领域专家的反馈。随附的一套算法和方法将在广泛的健康信息学环境中和跨多个机构实现高通量表型生成、改进、适应和应用的自动化。该项目将整合生物医学信息学家、计算机科学家和临床专家。所得到的表型在不同的临床应用中的重要性,包括:a)队列构建,其中根据特定的表型组合来确定病例和对照患者;b)全基因组关联研究(Gwas),其中对照DNA序列变异对患者的目标表型进行测试,以获得显著的统计学关联;以及c)临床预测建模,其中将展示开发用于预测目标表型或疾病的模型。该框架将利用MIMIC-II和CMS的公共可访问数据开发,并在西北纪念医院和VUMC的真实临床环境中通过几个高影响的疾病目标(包括高血压、2型糖尿病、甲状腺功能减退、房颤、类风湿性关节炎和多发性硬化症)进行验证。此外,通过该项目开发的方法将被整合到现有软件平台中,这些软件平台支持电子病历衍生表型的表示,但缺乏用于生成和完善候选者的数据驱动的组成部分。总体而言,拟议的框架预计将对转化性临床研究产生重大影响,包括临床试验设计、预测性建模、流行病学研究和临床决策支持。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Bradley Malin其他文献

Perceptions and Utilization of Online Peer Support Among Informal Dementia Caregivers: Survey Study
非正式痴呆症护理人员对在线同伴支持的看法和利用:调查研究
  • DOI:
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    4.9
  • 作者:
    Zhijun Yin;Lauren Stratton;Qingyuan Song;Congning Ni;Lijun Song;Patricia Commiskey;Qingxia Chen;Monica Moreno;Sam Fazio;Bradley Malin
  • 通讯作者:
    Bradley Malin
Risk, trust, and altruism in genetic data sharing
遗传数据共享中的风险、信任和利他主义
  • DOI:
    10.1111/jpet.12678
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    1.1
  • 作者:
    Zeeshan Samad;M. Wooders;Bradley Malin;Yevgeniy Vorobeychik
  • 通讯作者:
    Yevgeniy Vorobeychik
Digital Information Ecosystems in Modern Care Coordination and Patient Care Pathways and the Challenges and Opportunities for AI Solutions
现代护理协调和患者护理路径中的数字信息生态系统以及人工智能解决方案的挑战与机遇
  • DOI:
    10.2196/60258
  • 发表时间:
    2024-01-01
  • 期刊:
  • 影响因子:
    6.000
  • 作者:
    You Chen;Christoph U Lehmann;Bradley Malin
  • 通讯作者:
    Bradley Malin
Introducing JMIR AI
JMIR 人工智能简介
  • DOI:
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    0
  • 作者:
    K. El Emam;Bradley Malin
  • 通讯作者:
    Bradley Malin
Trail Re-Identification: Learning Who You Are From Where You Have Been
踪迹重新识别:从你去过的地方了解你是谁
  • DOI:
  • 发表时间:
    2003
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Bradley Malin;Latanya Sweeney;Elaina Newton
  • 通讯作者:
    Elaina Newton

Bradley Malin的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Bradley Malin', 18)}}的其他基金

Collaborative Research: Workshop to Develop a Roadmap for Greater Public Use of Privacy-Sensitive Government Data
合作研究:制定路线图以扩大公众使用隐私敏感政府数据的研讨会
  • 批准号:
    2129909
  • 财政年份:
    2021
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
RAPID: Collaborative: A Privacy Risk Assessment Framework for Person-Level Data Sharing During Pandemics
RAPID:协作:大流行期间个人级数据共享的隐私风险评估框架
  • 批准号:
    2029651
  • 财政年份:
    2020
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
CT: Collaborative Research: Experience-Based Access Management (EBAM) for Hospital Information Technology
CT:协作研究:医院信息技术的基于经验的访问管理 (EBAM)
  • 批准号:
    0964063
  • 财政年份:
    2010
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant

相似国自然基金

内源性逆转录病毒MER65-int调控人类胎 盘发育与子宫内膜重塑的功能研究
  • 批准号:
  • 批准年份:
    2025
  • 资助金额:
    10.0 万元
  • 项目类别:
    省市级项目
隐秘重组信号序列INT-RSS在T细胞受体基因Tcra重排中的功能和机制研究
  • 批准号:
    32370939
  • 批准年份:
    2023
  • 资助金额:
    50 万元
  • 项目类别:
    面上项目
HPV16 E7 通过 Int1 蛋白调控 Wnt 信号通路调节肿瘤局部树突状细胞活性
  • 批准号:
    LQ22H160033
  • 批准年份:
    2021
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
选择性PPARγ激动剂INT131调控适应性产热和AD-MSCs分化成棕色样脂肪细胞的机制研究
  • 批准号:
    81903680
  • 批准年份:
    2019
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目
INT复合物调节U snRNA 3'加工的结构基础
  • 批准号:
    31800624
  • 批准年份:
    2018
  • 资助金额:
    28.0 万元
  • 项目类别:
    青年科学基金项目
沉默Int6基因的骨髓间充质干细胞复合生物支架构建血管化腹股沟疝补片及其促补片血管化机制
  • 批准号:
    81371698
  • 批准年份:
    2013
  • 资助金额:
    70.0 万元
  • 项目类别:
    面上项目
HIF/Int6调控迟发型EPC体外增殖的机制及其治疗重度子痫前期的可行性
  • 批准号:
    81100439
  • 批准年份:
    2011
  • 资助金额:
    22.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

SCH: INT: Collaborative Research: An Intelligent Pervasive Augmented reaLity therapy (iPAL) for Opioid Use Disorder and Recovery
SCH:INT:合作研究:针对阿片类药物使用障碍和恢复的智能普遍增强现实疗法 (iPAL)
  • 批准号:
    2343183
  • 财政年份:
    2023
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: DeepSense: Interpretable Deep Learning for Zero-effort Phenotype Sensing and Its Application to Sleep Medicine
SCH:INT:合作研究:DeepSense:零努力表型感知的可解释深度学习及其在睡眠医学中的应用
  • 批准号:
    2313481
  • 财政年份:
    2022
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: Context-Adaptive Multimodal Informatics for Psychiatric Discharge Planning
SCH:INT:合作研究:用于精神病出院计划的上下文自适应多模态信息学
  • 批准号:
    10573225
  • 财政年份:
    2021
  • 资助金额:
    $ 69.3万
  • 项目类别:
SCH: INT: Collaborative Research: Context-Adaptive Multimodal Informatics for Psychiatric Discharge Planning
SCH:INT:合作研究:用于精神病出院计划的上下文自适应多模态信息学
  • 批准号:
    10392429
  • 财政年份:
    2021
  • 资助金额:
    $ 69.3万
  • 项目类别:
SCH: INT: Collaborative Research: Using Multi-Stage Learning to Prioritize Mental Health
SCH:INT:协作研究:利用多阶段学习优先考虑心理健康
  • 批准号:
    2124270
  • 财政年份:
    2021
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: Privacy-Preserving Federated Transfer Learning for Early Acute Kidney Injury Risk Prediction
SCH:INT:合作研究:用于早期急性肾损伤风险预测的隐私保护联合迁移学习
  • 批准号:
    2014554
  • 财政年份:
    2020
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: Privacy-Preserving Federated Transfer Learning for Early Acute Kidney Injury Risk Prediction
SCH:INT:合作研究:用于早期急性肾损伤风险预测的隐私保护联合迁移学习
  • 批准号:
    2014552
  • 财政年份:
    2020
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: An intelligent Pervasive Augmented reaLity therapy (iPAL) for Opioid Use Disorder and Recovery
SCH:INT:合作研究:针对阿片类药物使用障碍和恢复的智能普遍增强现实疗法 (iPAL)
  • 批准号:
    2019389
  • 财政年份:
    2020
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: An intelligent Pervasive Augmented reaLity therapy (iPAL) for Opioid Use Disorder and Recovery
SCH:INT:合作研究:针对阿片类药物使用障碍和恢复的智能普遍增强现实疗法 (iPAL)
  • 批准号:
    2013651
  • 财政年份:
    2020
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
SCH: INT: Collaborative Research: An Intelligent Pervasive Augmented reaLity therapy (iPAL) for Opioid Use Disorder and Recovery
SCH:INT:合作研究:针对阿片类药物使用障碍和恢复的智能普遍增强现实疗法 (iPAL)
  • 批准号:
    2013122
  • 财政年份:
    2020
  • 资助金额:
    $ 69.3万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了