Improving AI/ML-readiness of Synthetic Data in a Resource-Constrained Setting

在资源受限的环境中提高合成数据的 AI/ML 准备度

基本信息

  • 批准号:
    10841728
  • 负责人:
  • 金额:
    $ 25.44万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2021
  • 资助国家:
    美国
  • 起止时间:
    2021-09-15 至 2026-06-30
  • 项目状态:
    未结题

项目摘要

PROJECT SUMMARY / ABSTRACT The parent project, UZIMA-DS (UtiliZing Health Information for Meaningful Impact in East Africa through Data Science), aims to create a scalable and sustainable platform to apply novel approaches to data assimilation and advanced artificial intelligence (AI)/machine learning (ML)-based methods to improve health outcomes in two health domains: maternal, newborn and child health; and mental health. Led by the Aga Khan University in East Africa (AKU) and the University of Michigan, UZIMA-DS is a U54 Research Hub funded under the NIH Data Sci- ence for Health Discovery and Innovation in Africa Initiative. During these first two years, UZIMA-DS has focused on acquiring and harmonizing multimodal data sources. However, we and many other DS-I Africa awardees have encountered several barriers to efficiently and effectively creating AI-ready data sets, which include: 1) regulatory concerns around privacy and confidentiality, 2) heterogeneity in data laws across countries limiting the accessibil- ity of data, and 3) a lack of sufficient datasets not only for training ML models and validation but also for training students and early career investigators for capacity building. Synthetic data, or data that is generated artificially using computational techniques such as AI, is a promising technique that could address these barriers and ena- ble the broad sharing of AI-ready data sets. As part of this administrative supplement, we propose to create an AI-ready synthetic data set using one of our real UZIMA-DS data sets from Kenya: the Kaloleni-Rabai Health and Demographic Surveillance Systems (KRHDSS). KRHDSS is a population-based demographic and health surveil- lance system established in 2017 by AKU. Information is collected at least annually on ~40 demographic, health, social determinants of disease, and vital events from a resident population of about 99,000 individuals. Leverag- ing our preliminary work using a Microsoft Azure instance, we will create AI-ready synthetic datasets for research and training and evaluate whether causal relationships in real data are preserved in synthetic datasets. The overarching goal of this proposal is to “put data to work” by developing a roadmap for the curation and use of AI-ready synthetic data using FAIR principles (findable, accessible, interoperable, and re‑usable) that can be eas- ily accessed and shared for research and training purposes across the globe. Ultimately, this work has the poten- tial to promote more effective and efficient sharing of AI-ready data globally. Using cloud infrastructure and Health and Demographic Surveillance Systems data from rural Kenya as a use case, this work has immediate implications for how AI-ready data can be leveraged in resource-constrained settings to improve data driven health policy decisions for traditionally disadvantaged and marginalized groups.
项目摘要/摘要 上级项目UZIMA-DS(利用卫生信息通过数据在东非产生有意义的影响 科学),旨在创建一个可扩展和可持续的平台,以应用新的方法进行数据同化和 基于高级人工智能(AI)/机器学习(ML)的方法在两个方面改善健康结果 健康领域:孕产妇、新生儿和儿童健康;以及精神健康。由东部的阿加汗大学领导 非洲(AKU)和密歇根大学UZIMA-DS是由NIH数据科学资助的U54研究中心- 非洲卫生发现和创新倡议。在最初的两年里,UZIMA-DS专注于 关于多模式数据源的获取和协调。然而,我们和许多其他DS-I非洲奖获得者 在高效有效地创建支持人工智能的数据集方面遇到了几个障碍,其中包括:1)监管 对隐私和机密性的担忧,2)各国数据法律的异质性限制了访问- 数据的可靠性,以及3)缺乏足够的数据集,不仅用于训练ML模型和验证,而且还用于训练 学生和早期职业调查人员的能力建设。合成数据,或人工生成的数据 使用人工智能等计算技术是一种很有前途的技术,可以解决这些障碍和环境问题。 广泛共享支持人工智能的数据集。作为这项行政补充的一部分,我们建议创建一个 使用我们来自肯尼亚的一个真实的UZIMA-DS数据集:Kaloleni-Rabai Health和 人口监测系统(KRHDSS)。KRHDSS是一个以人口为基础的人口和健康监测机构- AKU于2017年建立了LANCE系统。至少每年收集约40人的人口、健康、 疾病的社会决定因素,以及来自常住人口约99,000人的重要事件。杠杆化- 在使用Microsoft Azure实例完成前期工作后,我们将创建用于研究的人工智能就绪的合成数据集 以及训练和评估真实数据中的因果关系是否被保存在合成数据集中。这个 这项提案的首要目标是通过制定管理和使用数据的路线图来“使数据发挥作用” 使用公平原则(可查找、可访问、可互操作和可重复使用)的人工智能准备好的合成数据,可以很容易地- 在全球范围内出于研究和培训目的方便地访问和共享。归根结底,这项工作有潜力- TAL,以促进更有效和高效地在全球共享人工智能就绪的数据。使用云基础设施和 来自肯尼亚农村的健康和人口监测系统数据作为用例,这项工作立即 对如何在资源受限的环境中利用AI就绪数据来改进数据驱动的影响 为传统上处于不利地位和边缘化群体的卫生政策决策。

项目成果

期刊论文数量(16)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Obesity and Risk of Hypertension in Preadolescent Urban School Children: Insights from a Developing Country.
青春期前城市学童的肥胖和高血压风险:来自发展中国家的见解。
  • DOI:
    10.21203/rs.3.rs-4213965/v1
  • 发表时间:
    2024
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Akhtar,Samina;Khan,Shahid;Aziz,Namra;Magsi,MuhammadImran;Samad,Zainab;Iqbal,Romaina;Almas,Aysha
  • 通讯作者:
    Almas,Aysha
COVID-19 vaccination refusal trends in Kenya over 2021.
  • DOI:
    10.1016/j.vaccine.2022.12.066
  • 发表时间:
    2023-01-27
  • 期刊:
  • 影响因子:
    5.5
  • 作者:
    Rego, Ryan T.;Kenney, Brooke;Ngugi, Anthony K.;Espira, Leon;Orwa, James;Siwo, Geoffrey H.;Sefa, Christabel;Shah, Jasmit;Weinheimer-Haus, Eileen;Delius, Antonia Johanna Sophie;Pape, Utz Johann;Irfan, Furqan B.;Abubakar, Amina;Shah, Reena;Wagner, Abram;Kolars, Joseph;Boulton, Matthew L.;Hofer, Timothy;Waljee, Akbar K.
  • 通讯作者:
    Waljee, Akbar K.
Mental health and psychological well-being of Kenyan adolescents from Nairobi and the Coast regions in the context of COVID-19.
  • DOI:
    10.1186/s13034-023-00613-y
  • 发表时间:
    2023-05-19
  • 期刊:
  • 影响因子:
    5.6
  • 作者:
    Mbithi, Gideon;Mabrouk, Adam;Sarki, Ahmed;Odhiambo, Rachel;Namuguzi, Mary;Dzombo, Judith Tumaini;Atukwatse, Joseph;Kabue, Margaret;Mwangi, Paul;Abubakar, Amina
  • 通讯作者:
    Abubakar, Amina
Comparison of logistic regression with regularized machine learning methods for the prediction of tuberculosis disease in people living with HIV: cross-sectional hospital-based study in Kisumu County, Kenya.
逻辑回归与正则化机器学习方法预测艾滋病毒感染者结核病的比较:肯尼亚基苏木县医院的横断面研究。
  • DOI:
    10.21203/rs.3.rs-3354948/v1
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Orwa,James;Oduor,Patience;Okelloh,Douglas;Gethi,Dickson;Agaya,Janet;Okumu,Albert;Wandiga,Steve
  • 通讯作者:
    Wandiga,Steve
Psychometric evaluation of PHQ-9 and GAD-7 among community health volunteers and nurses/midwives in Kenya following a nation-wide telephonic survey.
  • DOI:
    10.3389/fpsyt.2023.1123839
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    4.7
  • 作者:
    Odero, Sabina Adhiambo;Mwangi, Paul;Odhiambo, Rachel;Nzioka, Brenda Mumbua;Shumba, Constance;Ndirangu-Mugo, Eunice;Abubakar, Amina
  • 通讯作者:
    Abubakar, Amina
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Amina Abubakar Ali其他文献

Amina Abubakar Ali的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Amina Abubakar Ali', 18)}}的其他基金

2/3 Akili: Phenotypic and genetic characterization of ADHD in Kenya and South Africa
2/3 Akili:肯尼亚和南非 ADHD 的表型和遗传特征
  • 批准号:
    10637187
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Eneza Data Science: Enhancing Data Science Capability and Tools for Health in East Africa
Eneza 数据科学:增强东非健康领域的数据科学能力和工具
  • 批准号:
    10713044
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
UZIMA-DS: UtiliZing health Information for Meaningful impact in East Africa through Data Science
UZIMA-DS:通过数据科学利用健康信息对东非产生有意义的影响
  • 批准号:
    10490293
  • 财政年份:
    2021
  • 资助金额:
    $ 25.44万
  • 项目类别:
UZIMA-DS: UtiliZing health Information for Meaningful impact in East Africa through Data Science
UZIMA-DS:通过数据科学利用健康信息对东非产生有意义的影响
  • 批准号:
    10659241
  • 财政年份:
    2021
  • 资助金额:
    $ 25.44万
  • 项目类别:
UZIMA-DS: UtiliZing health Information for Meaningful impact in East Africa through Data Science
UZIMA-DS:通过数据科学利用健康信息对东非产生有意义的影响
  • 批准号:
    10314084
  • 财政年份:
    2021
  • 资助金额:
    $ 25.44万
  • 项目类别:

相似海外基金

Proton-secreting epithelial cells as key modulators of epididymal mucosal immunity - Administrative Supplement
质子分泌上皮细胞作为附睾粘膜免疫的关键调节剂 - 行政补充
  • 批准号:
    10833895
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
A Longitudinal Qualitative Study of Fentanyl-Stimulant Polysubstance Use Among People Experiencing Homelessness (Administrative supplement)
无家可归者使用芬太尼兴奋剂多物质的纵向定性研究(行政补充)
  • 批准号:
    10841820
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
StrokeNet Administrative Supplement for the Funding Extension
StrokeNet 资助延期行政补充文件
  • 批准号:
    10850135
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
2023 NINDS Landis Mentorship Award - Administrative Supplement to NS121106 Control of Axon Initial Segment in Epilepsy
2023 年 NINDS 兰迪斯指导奖 - NS121106 癫痫轴突初始段控制的行政补充
  • 批准号:
    10896844
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Biomarkers of Disease in Alcoholic Hepatitis Administrative Supplement
酒精性肝炎行政补充剂中疾病的生物标志物
  • 批准号:
    10840220
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Administrative Supplement: Life-Space and Activity Digital Markers for Detection of Cognitive Decline in Community-Dwelling Older Adults: The RAMS Study
行政补充:用于检测社区老年人认知衰退的生活空间和活动数字标记:RAMS 研究
  • 批准号:
    10844667
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Administrative Supplement: Improving Inference of Genetic Architecture and Selection with African Genomes
行政补充:利用非洲基因组改进遗传结构的推断和选择
  • 批准号:
    10891050
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Power-Up Study Administrative Supplement to Promote Diversity
促进多元化的 Power-Up 研究行政补充
  • 批准号:
    10711717
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Administrative Supplement for Peer-Delivered and Technology-Assisted Integrated Illness Management and Recovery
同行交付和技术辅助的综合疾病管理和康复的行政补充
  • 批准号:
    10811292
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
Administrative Supplement: Genome Resources for Model Amphibians
行政补充:模型两栖动物基因组资源
  • 批准号:
    10806365
  • 财政年份:
    2023
  • 资助金额:
    $ 25.44万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了