Data-driven Computational Modeling and Refinement of Protein Structures on Genomic Scales

数据驱动的计算建模和基因组尺度蛋白质结构的细化

基本信息

  • 批准号:
    10029150
  • 负责人:
  • 金额:
    $ 36.21万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2020
  • 资助国家:
    美国
  • 起止时间:
    2020-09-15 至 2025-07-31
  • 项目状态:
    未结题

项目摘要

PROJECT SUMMARY/ABSTRACT: A key remaining gap in our understanding of biological systems at the molecular level is how to structurally annotate the “dark” protein families—the portion of protein families unsolved by experimental structure determination techniques and inaccessible to homology modeling. Nearly a quarter of protein families are currently dark, where molecular conformation is completely unknown and this gap is likely to expand further with the rapid accumulation of new protein sequences without annotated structures. The key challenge is now how to bridge this gap to gain a comprehensive understanding of biology and disease, thereby paving the way to structure-based drug design at genomic scale. Computational protein modeling plays a key role in this effort due to its scalability and genome-wide applicability. My laboratory focuses on the development and application of novel data-driven computational modeling and refinement methods to increase accuracy and coverage of protein structure prediction on genomic scale irrespective of homology. Future research focuses on improving homology-free protein folding using multiscale de novo modeling driven by deep learning-based inter-residue interactions, enhancing low-homology threading or fold recognition by formulating new algorithms for remote template identification despite low evolutionary relatedness, and developing methods for high-resolution restrained structure refinement guided by generalized ensemble search for driving computational models to near-experimental accuracy. Proteome-wide computational modeling and refinement effort will be conducted, leveraging our unique access to large-scale supercomputing infrastructure, to build high-confidence models covering the dark protein families, which will be organized in a database for public access. This comprehensive database of structural annotations will shed light on the structures, functions, and interactions of the dark proteome, with broad implications in drug discovery and human health. Software and web servers will be freely disseminated to help worldwide community of biomedical researchers to apply these methods to their specific research problems, thus multiplying the impact of computational modeling on basic research in biology and medicine. My research program will involve close collaborations with other NIGMS-supported investigators, create training opportunities for the next generation of researchers including members from underrepresented groups, and foster future research advances in structural bioinformatics and computational biology.
项目摘要/摘要: 在我们对分子水平的生物系统的理解中,一个关键的剩余差距是如何从结构上 诠释“暗”蛋白质家族--实验结构未解决的蛋白质家族部分 确定技术和同源建模不可用。近四分之一的蛋白质家族是 目前是黑暗的,分子构象完全未知,这个缺口可能会进一步扩大 随着没有注释结构的新蛋白质序列的快速积累。关键的挑战是现在 如何弥合这一差距,以全面了解生物学和疾病,从而为 到基因组水平的基于结构的药物设计。计算蛋白质模型在这项工作中起着关键作用 由于它的可扩展性和全基因组的适用性。我的实验室专注于这一领域的开发和应用 新的数据驱动的计算建模和改进方法,以提高准确性和覆盖率 基因组水平上的蛋白质结构预测,与同源性无关。未来的研究重点是改进 基于深度学习的残基间多尺度从头建模的无同源蛋白质折叠 交互,通过制定新的远程算法来增强低同源性线程或折叠识别 尽管进化关联度较低的模板识别以及用于高分辨率的开发方法 广义集成搜索引导的约束结构精化驱动计算模型 接近实验精度。将进行蛋白质组范围的计算建模和改进工作, 利用我们对大规模超级计算基础设施的独特访问,构建高置信度模型 涵盖黑色蛋白质家族,将被组织在一个数据库中供公众查阅。这一全面的 结构注释数据库将阐明黑暗的结构、功能和相互作用 蛋白质组,在药物发现和人类健康方面具有广泛的影响。软件和Web服务器将免费 传播以帮助世界各地的生物医学研究人员将这些方法应用于他们的特定 研究问题,因此计算建模对生物学和基础研究的影响成倍增加 医药。我的研究计划将涉及与NIGMS支持的其他调查人员的密切合作, 为下一代研究人员创造培训机会,包括来自代表性不足的成员 并促进结构生物信息学和计算生物学的未来研究进展。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Debswapna Bhattacharya其他文献

Debswapna Bhattacharya的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Debswapna Bhattacharya', 18)}}的其他基金

Data-driven Computational Modeling and Refinement of Protein Structures on Genomic Scales
数据驱动的计算建模和基因组尺度蛋白质结构的细化
  • 批准号:
    10604529
  • 财政年份:
    2020
  • 资助金额:
    $ 36.21万
  • 项目类别:
Data-driven Computational Modeling and Refinement of Protein Structures on Genomic Scales
数据驱动的计算建模和基因组尺度蛋白质结构的细化
  • 批准号:
    10707069
  • 财政年份:
    2020
  • 资助金额:
    $ 36.21万
  • 项目类别:
Data-driven Computational Modeling and Refinement of Protein Structures on Genomic Scales
数据驱动的计算建模和基因组尺度蛋白质结构的细化
  • 批准号:
    10456948
  • 财政年份:
    2020
  • 资助金额:
    $ 36.21万
  • 项目类别:

相似海外基金

Cerebral infarction treatment strategy using collagen-like "triple helix peptide" containing functional amino acid sequence
含功能氨基酸序列的类胶原“三螺旋肽”治疗脑梗塞策略
  • 批准号:
    23K06972
  • 财政年份:
    2023
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Establishment of a screening method for functional microproteins independent of amino acid sequence conservation
不依赖氨基酸序列保守性的功能性微生物蛋白筛选方法的建立
  • 批准号:
    23KJ0939
  • 财政年份:
    2023
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Effects of amino acid sequence and lipids on the structure and self-association of transmembrane helices
氨基酸序列和脂质对跨膜螺旋结构和自缔合的影响
  • 批准号:
    19K07013
  • 财政年份:
    2019
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Construction of electron-transfer amino acid sequence probe with an interaction for protein and cell
蛋白质与细胞相互作用的电子转移氨基酸序列探针的构建
  • 批准号:
    16K05820
  • 财政年份:
    2016
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development of artificial antibody of anti-bitter taste receptor using random amino acid sequence library
利用随机氨基酸序列库开发抗苦味受体人工抗体
  • 批准号:
    16K08426
  • 财政年份:
    2016
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The aa15-17 amino acid sequence in the terminal protein domain of HBV polymerase as a viral factor affect-ing in vivo as well as in vitro replication activity of the virus.
HBV聚合酶末端蛋白结构域中的aa15-17氨基酸序列作为影响病毒体内和体外复制活性的病毒因子。
  • 批准号:
    25461010
  • 财政年份:
    2013
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Amino acid sequence analysis of fossil proteins using mass spectrometry
使用质谱法分析化石蛋白质的氨基酸序列
  • 批准号:
    23654177
  • 财政年份:
    2011
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Precise hybrid synthesis of glycoprotein through amino acid sequence-specific introduction of oligosaccharide followed by enzymatic transglycosylation reaction
通过氨基酸序列特异性引入寡糖,然后进行酶促糖基转移反应,精确杂合合成糖蛋白
  • 批准号:
    22550105
  • 财政年份:
    2010
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Estimating selection on amino-acid sequence polymorphisms in Drosophila
果蝇氨基酸序列多态性选择的估计
  • 批准号:
    NE/D00232X/1
  • 财政年份:
    2006
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Research Grant
Construction of a neural network for detecting novel domains from amino acid sequence information only
构建仅从氨基酸序列信息检测新结构域的神经网络
  • 批准号:
    16500189
  • 财政年份:
    2004
  • 资助金额:
    $ 36.21万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了