Collaborative Research: III: Medium: Knowledge discovery from highly heterogeneous, sparse and private data in biomedical informatics
合作研究:III:中:生物医学信息学中高度异构、稀疏和私有数据的知识发现
基本信息
- 批准号:2312863
- 负责人:
- 金额:$ 32.05万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2023
- 资助国家:美国
- 起止时间:2023-09-01 至 2027-08-31
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
In the United States, millions of people have chronic conditions, including Type 2 Diabetes and Heart Failure. It is important to screen patients for these illnesses as soon as possible. This research aims at mining health care data to find patients likely to develop these conditions and to develop a model for opportunistic screening in situations where the encounter with the patient may be unrelated to the specific diagnosis. Opportunistic screening is needed especially for minority and lower socio-economic status patients, who are less likely to seek regular care from primary care providers. This research will address many challenges. First, health records include different types of data, from text to numeric values, from continuous signals to images. Second, records comprise information collected at different timepoints, and with different frequencies: some patients may be seen once a year, and others, every few days. Third, the privacy of patients must be protected. Fourth, automatically derived models must be fair and unbiased, especially towards underprivileged groups. Finally, many powerful current Machine Learning models behave like black boxes: these models will be adopted in healthcare and other critical areas only if their conclusions can be explained. From a societal point of view, this project has the potential to positively impact the health of millions of people, and in particular, of minority and lower socio-economic status patients. As concerns education, this research will recruit underrepresented students at the University of Illinois Chicago, a federally-designated Minority-Serving Institution, and support the interdisciplinary development of a diverse cohort of PhD and undergraduate students. This project will explore new Machine Learning (ML) and Natural Language Processing approaches to uncover the earliest point in temporal sequence data, in which a patient can be screened for a certain chronic condition. The research will develop novel methods to integrate heterogeneous data, which features missing values and noise; de-identification approaches to protect privacy; new approaches to concept and temporal relation extraction; algorithms to improve fairness by addressing data heterogeneity and missing data; exploration of concept-level explainability. A robust assessment plan is an integral part of the proposed research. First, all algorithms will be evaluated according to current ML methodology. Additionally, a human-in-the-loop approach will be employed, in which the clinicians on the team will provide informal and formal evaluation of the algorithm predictions. The methods this research will uncover are likely applicable to other domains where heterogeneous, incomplete, identifiable, or biased temporal sequence data exist, for example predicting youth at risk, water resource monitoring, and supporting food safety.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
在美国,数百万人患有慢性疾病,包括2型糖尿病和心力衰竭。尽早对患者进行这些疾病的筛查是很重要的。本研究旨在挖掘医疗保健数据,以发现可能发展这些条件的患者,并在遇到患者可能与具体诊断无关的情况下开发机会性筛查模型。机会性筛查尤其需要少数民族和社会经济地位较低的患者,他们不太可能向初级保健提供者寻求定期护理。这项研究将解决许多挑战。首先,健康记录包括不同类型的数据,从文本到数值,从连续信号到图像。其次,记录包括在不同时间点和不同频率收集的信息:一些患者可能一年一次,而另一些则每隔几天一次。第三,必须保护患者的隐私。第四,自动导出的模型必须是公平和无偏见的,尤其是对弱势群体。最后,当前许多强大的机器学习模型表现得像黑盒子:只有在结论可以解释的情况下,这些模型才会被应用于医疗保健和其他关键领域。从社会角度来看,该项目有可能对数百万人的健康产生积极影响,特别是少数民族和社会经济地位较低的患者。在教育方面,这项研究将在伊利诺伊大学芝加哥分校招收代表性不足的学生,这是一所联邦政府指定的少数民族服务机构,并支持不同群体的博士和本科生的跨学科发展。该项目将探索新的机器学习(ML)和自然语言处理方法,以发现时间序列数据中的最早点,在这个点上,可以对患者进行某种慢性疾病的筛查。本研究将开发新的方法来整合具有缺失值和噪声特征的异构数据;保护隐私的去识别方法;概念和时间关系提取的新方法通过解决数据异构和缺失数据来提高公平性的算法;探索概念层面的可解释性。一个强有力的评估计划是拟议研究的一个组成部分。首先,所有算法将根据当前的机器学习方法进行评估。此外,将采用“人在循环”的方法,团队中的临床医生将对算法预测提供非正式和正式的评估。本研究将揭示的方法可能适用于存在异质性、不完整、可识别或有偏差的时间序列数据的其他领域,例如预测青年风险、水资源监测和支持食品安全。该奖项反映了美国国家科学基金会的法定使命,并通过使用基金会的知识价值和更广泛的影响审查标准进行评估,被认为值得支持。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mudassir Rashid其他文献
Prior-knowledge-embedded model predictive control for blood glucose regulation: Towards efficient and safe artificial pancreas
用于血糖调节的先验知识嵌入模型预测控制:迈向高效、安全的人工胰腺
- DOI:
10.1016/j.bspc.2022.104551 - 发表时间:
2023-04 - 期刊:
- 影响因子:5.1
- 作者:
Xiaoyu Sun;Ali Cinar;Jianchang Liu;Mudassir Rashid;Xia Yu - 通讯作者:
Xia Yu
Mudassir Rashid的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mudassir Rashid', 18)}}的其他基金
Collaborative Research: Designing Minimal Synthetic Cells Capable of Sensing and Self-Manipulation via Tunable Self-Assembly
合作研究:设计能够通过可调自组装进行传感和自我操纵的最小合成细胞
- 批准号:
2123593 - 财政年份:2021
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
相似国自然基金
Research on Quantum Field Theory without a Lagrangian Description
- 批准号:24ZR1403900
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
Cell Research
- 批准号:31224802
- 批准年份:2012
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research
- 批准号:31024804
- 批准年份:2010
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Cell Research (细胞研究)
- 批准号:30824808
- 批准年份:2008
- 资助金额:24.0 万元
- 项目类别:专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
- 批准号:10774081
- 批准年份:2007
- 资助金额:45.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: Conference: DESC: Type III: Eco Edge - Advancing Sustainable Machine Learning at the Edge
协作研究:会议:DESC:类型 III:生态边缘 - 推进边缘的可持续机器学习
- 批准号:
2342498 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: Conference: DESC: Type III: Eco Edge - Advancing Sustainable Machine Learning at the Edge
协作研究:会议:DESC:类型 III:生态边缘 - 推进边缘的可持续机器学习
- 批准号:
2342497 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
III : Medium: Collaborative Research: From Open Data to Open Data Curation
III:媒介:协作研究:从开放数据到开放数据管理
- 批准号:
2420691 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: High-Performance Scheduling for Modern Database Systems
协作研究:III:小型:现代数据库系统的高性能调度
- 批准号:
2322973 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: High-Performance Scheduling for Modern Database Systems
协作研究:III:小型:现代数据库系统的高性能调度
- 批准号:
2322974 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: A DREAM Proactive Conversational System
合作研究:III:小型:一个梦想的主动对话系统
- 批准号:
2336769 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: Small: A DREAM Proactive Conversational System
合作研究:III:小型:一个梦想的主动对话系统
- 批准号:
2336768 - 财政年份:2024
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Designing AI Systems with Steerable Long-Term Dynamics
合作研究:III:中:设计具有可操纵长期动态的人工智能系统
- 批准号:
2312865 - 财政年份:2023
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:
2312932 - 财政年份:2023
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant
Collaborative Research: III: Medium: Algorithms for scalable inference and phylodynamic analysis of tumor haplotypes using low-coverage single cell sequencing data
合作研究:III:中:使用低覆盖率单细胞测序数据对肿瘤单倍型进行可扩展推理和系统动力学分析的算法
- 批准号:
2415562 - 财政年份:2023
- 资助金额:
$ 32.05万 - 项目类别:
Standard Grant