Realistic quantification of potential privacy loss from genomic summary results

从基因组摘要结果中实际量化潜在隐私损失

基本信息

  • 批准号:
    10229615
  • 负责人:
  • 金额:
    $ 1.37万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-08-05 至 2022-01-02
  • 项目状态:
    已结题

项目摘要

PROJECT ABSTRACT With the surge of large genomics data, there is an immense increase in the breadth and depth of different genomics datasets and an increasing importance in the topic of privacy of individuals in genomic data science. Detailed genetic and environmental characterization of diseases and conditions relies on the large-scale mining of genotype-phenotype relationships; hence, there is great desire to share data as broadly as possible. The recent change in NIH policy of sharing genomic summary results is a great step towards making the data available to broader researchers. However, privacy studies inferring study participations is outdated compared to the pace of the technological advancements in genome sequencing. A key first step in reducing private information leakage is to measure the amount of information leakage, particularly under different scenarios. To this end, we propose to derive information- theoretic measures for private information leakage in different genomic data sharing scenarios, especially when the datasets are noisy and incomplete. We will also develop various risk assessment tools. We will approach the privacy analysis under three aims. First, we will develop statistical metrics that can be used to quantify the sensitive information leakage in different data sharing scenarios as well as under the conditions when the genotype data is imperfect. We will systematically analyze the risk of inference of study participation of a patient. Second, we will design a plausible privacy attack through an experimental study, in which different technologies will be used to sequence genomes from trace amount of samples such as touch objects or used glasses. This will allow us to study the plausible scenarios of surreptious DNA testing and its effect on genomic data sharing. Third, we will develop risk assessment tools for sharing genomic summary results. These tools will simulate hundreds of scenarios learned through simulations in aim 1 and real-life privacy attacks in aim 2 to quantify the risks before the release of the data. These tools will be implemented using cryptographic techniques to further reduce the private information leakage during risk assessment step. During the K99 phase, the aim of this project is to find minimum amount of genotyping information required and maximum amount of noise tolerated for detection of a genome in a mixture using simulations and wet-lab experiments. To accomplish this research goal, the K99 phase will involve training in molecular biology, genomics and privacy. This training will take place at Yale University in the department of Molecular Biophysics and Biochemistry, under the mentorship of Dr. Mark Gerstein (genomics and privacy) and Dr. Andrew Miranker (molecular biology). Building on the training during the K99, the goal of the R00 phase will be simulation of the results of the experimental training to increase the sample size and building privacy risk assessment tools with the results learned from the experiment and simulations and implementation of such tools using cryptographic techniques.
项目摘要 随着大量基因组学数据的激增,不同基因组学的广度和深度都有了巨大的增长。 基因组学数据集和基因组数据科学中个人隐私主题的重要性日益增加。 疾病和病症的详细遗传和环境特征依赖于大规模的 基因型-表型关系的挖掘;因此,非常希望尽可能广泛地共享数据。 NIH最近改变了分享基因组总结结果的政策,这是朝着使数据 提供给更广泛的研究人员。然而,隐私研究推断学习参与是过时的相比, 基因组测序技术进步的步伐。减少私营部门的关键第一步 信息泄漏是衡量信息泄漏的数量,特别是在不同的场景下。到 为此,我们提出了在不同的隐私信息泄漏的信息理论措施, 基因组数据共享场景,特别是当数据集是嘈杂和不完整的。我们还将开发 各种风险评估工具。我们将在三个目标下进行隐私分析。首先,我们将发展 可用于量化不同数据共享中敏感信息泄漏的统计指标 场景以及在基因型数据不完善的条件下。我们将系统分析 推断患者参与研究的风险。第二,我们将设计一个合理的隐私攻击, 通过一项实验性研究,将使用不同的技术对来自微量元素的基因组进行测序, 样本量,如触摸物体或用过的眼镜。这将使我们能够研究 秘密DNA测试及其对基因组数据共享的影响。第三,我们将开发风险评估工具 分享基因组总结结果这些工具将模拟数百个场景, 目标1中的模拟和目标2中的现实隐私攻击,以便在数据发布之前量化风险。 这些工具将使用加密技术来实现,以进一步减少私人信息 风险评估步骤中的泄漏。 在K99阶段,该项目的目的是找到所需的最少量基因分型信息, 使用模拟和湿实验室检测混合物中基因组所能容忍的最大噪声量 实验为了实现这一研究目标,K99阶段将涉及分子生物学培训, 基因组学和隐私这次培训将在耶鲁大学的分子生物物理学系进行 和生物化学,在Mark Gerstein博士(基因组学和隐私)和Andrew Miranker博士的指导下 (分子生物学)。在K99培训的基础上,R 00阶段的目标是模拟 增加样本量和建立隐私风险评估工具的实验培训结果, 从实验和模拟以及使用加密的这种工具的实现中学习到的结果 技术.

项目成果

期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Data Sanitization to Reduce Private Information Leakage from Functional Genomics.
  • DOI:
    10.1016/j.cell.2020.09.036
  • 发表时间:
    2020-11-12
  • 期刊:
  • 影响因子:
    64.5
  • 作者:
    Gürsoy G;Emani P;Brannon CM;Jolanki OA;Harmanci A;Strattan JS;Cherry JM;Miranker AD;Gerstein M
  • 通讯作者:
    Gerstein M
FANCY: fast estimation of privacy risk in functional genomics data.
  • DOI:
    10.1093/bioinformatics/btaa661
  • 发表时间:
    2021-01-29
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Gürsoy G;Brannon CM;Navarro FCP;Gerstein M
  • 通讯作者:
    Gerstein M
Privacy-preserving genotype imputation with fully homomorphic encryption.
通过完全同态加密保护隐私的基因型插补。
  • DOI:
    10.1016/j.cels.2021.10.003
  • 发表时间:
    2022-02-16
  • 期刊:
  • 影响因子:
    9.3
  • 作者:
    Gürsoy G;Chielle E;Brannon CM;Maniatakos M;Gerstein M
  • 通讯作者:
    Gerstein M
Recovering genotypes and phenotypes using allele-specific genes.
  • DOI:
    10.1186/s13059-021-02477-x
  • 发表时间:
    2021-09-07
  • 期刊:
  • 影响因子:
    12.3
  • 作者:
    Gürsoy G;Lu N;Wagner S;Gerstein M
  • 通讯作者:
    Gerstein M
Fast and Scalable Private Genotype Imputation Using Machine Learning and Partially Homomorphic Encryption.
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Gamze Gursoy其他文献

Gamze Gursoy的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Gamze Gursoy', 18)}}的其他基金

Delineating the functional impact of recurrent repeat expansions in ALS using integrative multiomic analysis
使用综合多组学分析描述 ALS 中反复重复扩增的功能影响
  • 批准号:
    10776994
  • 财政年份:
    2023
  • 资助金额:
    $ 1.37万
  • 项目类别:
Tools to Address the Challenges of Preserving Privacy in Sharing and Analysis of Biomedical Data
应对生物医学数据共享和分析中保护隐私挑战的工具
  • 批准号:
    10708820
  • 财政年份:
    2022
  • 资助金额:
    $ 1.37万
  • 项目类别:
Realistic quantification of potential privacy loss from genomic summary results
从基因组摘要结果中实际量化潜在隐私损失
  • 批准号:
    10540473
  • 财政年份:
    2022
  • 资助金额:
    $ 1.37万
  • 项目类别:
Realistic quantification of potential privacy loss from genomic summary results
从基因组摘要结果中实际量化潜在隐私损失
  • 批准号:
    10616768
  • 财政年份:
    2022
  • 资助金额:
    $ 1.37万
  • 项目类别:
Realistic quantification of potential privacy loss from genomic summary results
从基因组摘要结果中实际量化潜在隐私损失
  • 批准号:
    10053985
  • 财政年份:
    2020
  • 资助金额:
    $ 1.37万
  • 项目类别:

相似海外基金

REU Site: Summer Undergraduate Research in Chemistry and Biochemistry at Miami University
REU 网站:迈阿密大学化学与生物化学暑期本科生研究
  • 批准号:
    2349468
  • 财政年份:
    2024
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Continuing Grant
REU Site: Research Experiences for Community College Students in Chemistry and Biochemistry at Texas A&M University-Commerce
REU 网站:德克萨斯 A 社区学院化学和生物化学专业学生的研究经验
  • 批准号:
    2349522
  • 财政年份:
    2024
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Standard Grant
Supporting Talented, Low-Income Undergraduate and Graduate Students in Chemistry and Biochemistry through Career Explorations, Research Experiences, and Scholarships
通过职业探索、研究经验和奖学金支持化学和生物化学领域有才华的低收入本科生和研究生
  • 批准号:
    2322722
  • 财政年份:
    2024
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Standard Grant
Conference: Active Learning Communities in Biochemistry
会议:生物化学主动学习社区
  • 批准号:
    2411535
  • 财政年份:
    2024
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Standard Grant
Hydrogen and carbon dioxide biochemistry in the bacterial energy-transducing membrane.
细菌能量转换膜中的氢气和二氧化碳生物化学。
  • 批准号:
    BB/Y004302/1
  • 财政年份:
    2024
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Research Grant
Biochemistry of Eukaryotic Replication Fork and DNA Repair
真核复制叉的生物化学和 DNA 修复
  • 批准号:
    10550045
  • 财政年份:
    2023
  • 资助金额:
    $ 1.37万
  • 项目类别:
ORCC: The Role of Betaine Lipid Biochemistry in Coral Thermal Tolerance
ORCC:甜菜碱脂质生物化学在珊瑚耐热性中的作用
  • 批准号:
    2307516
  • 财政年份:
    2023
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Continuing Grant
Supporting low-income student success in STEM through community, mentoring, and immersive research in biology and biochemistry
通过生物学和生物化学领域的社区、指导和沉浸式研究,支持低收入学生在 STEM 方面取得成功
  • 批准号:
    2221216
  • 财政年份:
    2023
  • 资助金额:
    $ 1.37万
  • 项目类别:
    Standard Grant
In vivo 2-photon imaging of retinal biochemistry before and after retinal organoid transplantation
视网膜类器官移植前后视网膜生物化学的体内2光子成像
  • 批准号:
    10643273
  • 财政年份:
    2023
  • 资助金额:
    $ 1.37万
  • 项目类别:
Biochemistry of Platelet Desialylation
血小板脱唾液酸化的生物化学
  • 批准号:
    10833844
  • 财政年份:
    2023
  • 资助金额:
    $ 1.37万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了