Subtyping complex phenotypes via constrastive learning by leveraging electronic health records
利用电子健康记录通过对比学习对复杂表型进行亚型分类
基本信息
- 批准号:10799083
- 负责人:
- 金额:$ 42.84万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-09-22 至 2025-08-31
- 项目状态:未结题
- 来源:
- 关键词:AddressAsthmaBreast Cancer PatientCharacteristicsClassificationClinicalComplexDataDatabasesDementiaDiabetes MellitusDiagnosisDimensionsDiseaseDrug InteractionsElectronic Health RecordEquityEstrogen ReceptorsEthnic OriginGeneticGenomicsGraves&apos DiseaseHashimoto DiseaseHealth SurveysHealthcareHeterogeneityHypertensionHyperthyroidismHypothyroidismIndividualInsulinLearningLocationMedicalMedicineMental disordersMethodsModelingObesityPatientsPatternPhenotypePopulationPopulation HeterogeneityPredispositionProbabilityProceduresProcessPrognosisRaceReportingRisk FactorsSourceStratificationStructureSurfaceTechniquesTimeTrainingUndifferentiatedVariantautoimmune thyroid diseasecell typeclinical heterogeneityclinical riskclinically relevantcohortcomputer frameworkdisorder subtypeestrophilinhigh dimensionalityimprovedlearning strategymultidimensional dataneoplastic cellnovelnovel strategiesphenotypic dataportabilityprecision medicinepredictive modelingpublic health relevanceracial populationresponsesuccess
项目摘要
Summary
A critical step towards realizing the promise of precision medicine is the identification of biologically- and
clinically-relevant disease subtypes. Disease subtypes are suspected yet unknown or not fully characterized
for many conditions, including obesity, diabetes, hypertension, asthma, dementia, and psychiatric disorders.
The existence of “phenotypic heterogeneity” has practical and clinical implications: undifferentiated cases of a
disease may represent the action of a variety of underlying causal processes, each of which may have a
different prognosis or respond to a different treatment. Existing phenotype subtyping methods predominantly
rely on the idea that applying clustering or dimensionality reduction techniques to high-dimensional data from
patients with a given condition may reveal explanatory patterns that correspond to disease subtypes. This
implicitly assumes that biologically meaningful subtypes can be captured by the dominant axes of variation in
the data. Yet, the most dominant sources of variation are expected to be independent of biologically
meaningful subtypes in many settings. In this project, a novel contrastive learning method is proposed for
learning a heterogeneity gradient of variation that is specific to cases of a given condition and cannot be found
in matched controls. Electronic health records (EHR) and survey information from the rich All of Us database is
expected to span the spectrum of clinical heterogeneity across common complex diseases, which can inform
the proposed method about meaningful sub-phenotypic variation for many diseases. The subtypes identified
will be evaluated within the All of Us database and replicated using three external EHR cohorts for subtype-
specific genetic effects, clinical risk factors, and clinical trajectories. Finally, EHR-based models are notoriously
known for their susceptibility to poor generalization on out-of-distribution data that represent locations,
populations, medical practices, or other factors that were not represented in the training data. This challenge
will be addressed by developing a domain generalization framework, which will allow learning disease
subtypes that are generalizable across demographic characteristics, including location, ancestry, ethnicity, and
race, which is essential to achieve equitable precision medicine and facilitate the integration of predictive
models in healthcare pipelines.
概括
实现精确医学承诺的关键一步是鉴定生物学上和
临床上与疾病相关的亚型。怀疑疾病亚型但未知或未完全表征
对于许多情况,包括肥胖,糖尿病,高血压,哮喘,痴呆和精神疾病。
“表型异质性”的存在具有实际和临床意义:未分化的病例
疾病可能代表各种基本因果过程的作用,每种过程可能都有
不同的预后或对不同治疗的反应。现有的表型亚型方法主要是
依靠这样的想法,即将群集或降低降低技术应用于高维数据
给定病情的患者可能揭示与疾病亚型相对应的剥夺模式。这
隐含地假设生物学上有意义的亚型可以通过主要的变化轴捕获
数据。然而,最主要的变异来源有望独立于生物学上
在许多设置中有意义的亚型。在这个项目中,提出了一种新颖的对比学习方法
学习特定于给定条件案例的异质性变异梯度,找不到
在匹配的控件中。电子健康记录(EHR)和来自我们所有数据库的富人的调查信息是
预计将跨越普通复杂疾病的临床异质性的范围,这可以告知
关于许多疾病的有意义的亚表型变异的建议方法。确定的子类型
将在我们所有的数据库中进行评估,并使用三个外部EHR队列进行子类型的复制
特定的遗传作用,临床危险因素和临床轨迹。最后,基于EHR的模型是众所周知的
以其对代表位置的分布数据的概括不佳的敏感性而闻名的
培训数据中未代表的人群,医疗实践或其他因素。这个挑战
将通过开发域泛化框架来解决,这将允许学习疾病
在人口特征上可以推广的亚型,包括位置,祖先,种族和
种族,这对于获得公平的精确医学和促进预测的整合至关重要
医疗管道中的模型。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Elior Rahmani其他文献
Elior Rahmani的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似国自然基金
PFKFB3介导的糖酵解通过诱导气道上皮细胞功能失调加重哮喘气道炎症和气道重塑的机制
- 批准号:82300041
- 批准年份:2023
- 资助金额:20 万元
- 项目类别:青年科学基金项目
NGF通过上调ASIC1a表达激活AMPK/FoxO3a信号通路促进气道上皮细胞自噬诱导支气管哮喘的机制研究
- 批准号:82360008
- 批准年份:2023
- 资助金额:32 万元
- 项目类别:地区科学基金项目
外泌体介导肥大细胞类糜蛋白酶调控哮喘气道平滑肌增殖和机制研究
- 批准号:82370033
- 批准年份:2023
- 资助金额:48 万元
- 项目类别:面上项目
CD207阳性树突状细胞在过敏性哮喘气道炎症中作用及机制研究
- 批准号:82370035
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
Neuropilin-1活化肺2型固有淋巴样细胞促进肺哮喘发生的机制研究
- 批准号:32300727
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Addressing the Biology of Health Disparities by Targeting Geographical Ancestry-driven Variants of Immunity
通过针对地理血统驱动的免疫变异来解决健康差异的生物学问题
- 批准号:
10625430 - 财政年份:2021
- 资助金额:
$ 42.84万 - 项目类别:
The Impact of Type II IL4R Signaling on Breast Cancer Brain Metastasis
II 型 IL4R 信号传导对乳腺癌脑转移的影响
- 批准号:
10454820 - 财政年份:2021
- 资助金额:
$ 42.84万 - 项目类别:
The Impact of Type II IL4R Signaling on Breast Cancer Brain Metastasis
II 型 IL4R 信号传导对乳腺癌脑转移的影响
- 批准号:
10673159 - 财政年份:2021
- 资助金额:
$ 42.84万 - 项目类别:
Addressing the Biology of Health Disparities by Targeting Geographical Ancestry-driven Variants of Immunity
通过针对地理血统驱动的免疫变异来解决健康差异的生物学问题
- 批准号:
10273703 - 财政年份:2021
- 资助金额:
$ 42.84万 - 项目类别:
Addressing the Biology of Health Disparities by Targeting Geographical Ancestry-driven Variants of Immunity
通过针对地理血统驱动的免疫变异来解决健康差异的生物学问题
- 批准号:
10491138 - 财政年份:2021
- 资助金额:
$ 42.84万 - 项目类别: