III: Medium: Collaborative Research: Multi-level computational approaches to protein function prediction
III:媒介:协作研究:蛋白质功能预测的多级计算方法
基本信息
- 批准号:1901191
- 负责人:
- 金额:$ 81.67万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2019
- 资助国家:美国
- 起止时间:2019-08-01 至 2023-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Proteins are the workhorse molecules of life which participate in nearly every activity of cellular processes, including signal transduction, enzyme catalysis, structural support, bodily movement, and defense against pathogens. Interpretation of specific functional roles that each protein molecule plays in cell is thus critical for us to understand the fundamental principles of the biological processes and to design new drug treatments to regulate the processes for improving human health. The task is however highly non-trivial in modern molecular biology studies. The most accurate method to interpret protein biological functions is through structural biology and biochemistry experiments. But the cost of the experimental studies is high, and the process is too slow for large-scale application due to the involvement of manual skill and data processing. As a result, the majority of proteins in human and other important species remain unknown despite decades of efforts. The lack of genome-wide protein function information has significantly impeded the progress of system biology studies aiming at a comprehensive understanding of the life process. In this project, the investigators plan to develop advanced computational methods for automatic and yet reliable protein function annotations. The developed methods and databases will be freely released to the scientific community, which can be used for large-scale and genome-wide protein function annotation studies. The project will also provide opportunities to promote participations of underrepresented groups, including women and African Americans, in computational biology education and method developments.Built on the assumption that similar sequences have similar function, a routine approach to computational protein function annotations is comparative modeling, which deduces functions of target proteins from known homologous proteins. However, the accuracy and coverage of the approach are limited due to the diversity of gene evolution. Significant progress has been recently achieved in protein 3D structure prediction and the state-of-the-art algorithms can generate high-quality structures for distant-homology proteins with an unprecedented capacity. This project seeks to explore various new ideas to enhance the accuracy of distant-homology protein function annotations by using 3D models from the cutting-edge protein structure predictions, with a focus on ligand-protein binding interactions, gene ontology and post-translational modifications. Meanwhile, thermal motion and intrinsic disordering of protein structures are integrated in the pipelines for better function annotations. While the proposed approaches do not expect to address all the fundamental issues, like the first-principle methods, as of how and why proteins fold and function, the success of the studies should help establish a practical knowledge-based relation of structure and function that can be used for genome-scale applications with models useful for guiding new experimental design, and thus significantly enhance the impact of protein structure modeling on biological studies.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
Proteins are the workhorse molecules of life which participate in nearly every activity of cellular processes, including signal transduction, enzyme catalysis, structural support, bodily movement, and defense against pathogens. Interpretation of specific functional roles that each protein molecule plays in cell is thus critical for us to understand the fundamental principles of the biological processes and to design new drug treatments to regulate the processes for improving human health. The task is however highly non-trivial in modern molecular biology studies. The most accurate method to interpret protein biological functions is through structural biology and biochemistry experiments. But the cost of the experimental studies is high, and the process is too slow for large-scale application due to the involvement of manual skill and data processing. As a result, the majority of proteins in human and other important species remain unknown despite decades of efforts. The lack of genome-wide protein function information has significantly impeded the progress of system biology studies aiming at a comprehensive understanding of the life process. In this project, the investigators plan to develop advanced computational methods for automatic and yet reliable protein function annotations. The developed methods and databases will be freely released to the scientific community, which can be used for large-scale and genome-wide protein function annotation studies. The project will also provide opportunities to promote participations of underrepresented groups, including women and African Americans, in computational biology education and method developments.Built on the assumption that similar sequences have similar function, a routine approach to computational protein function annotations is comparative modeling, which deduces functions of target proteins from known homologous proteins. However, the accuracy and coverage of the approach are limited due to the diversity of gene evolution. Significant progress has been recently achieved in protein 3D structure prediction and the state-of-the-art algorithms can generate high-quality structures for distant-homology proteins with an unprecedented capacity. This project seeks to explore various new ideas to enhance the accuracy of distant-homology protein function annotations by using 3D models from the cutting-edge protein structure predictions, with a focus on ligand-protein binding interactions, gene ontology and post-translational modifications. Meanwhile, thermal motion and intrinsic disordering of protein structures are integrated in the pipelines for better function annotations. While the proposed approaches do not expect to address all the fundamental issues, like the first-principle methods, as of how and why proteins fold and function, the success of the studies should help establish a practical knowledge-based relation of structure and function that can be used for genome-scale applications with models useful for guiding new experimental design, and thus significantly enhance the impact of protein structure modeling on biological studies.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
项目成果
期刊论文数量(22)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction
- DOI:10.1186/s12859-020-3342-z
- 发表时间:2020-04-23
- 期刊:
- 影响因子:3
- 作者:Thapa, Niraj;Chaudhari, Meenal;KC, Dukka B.
- 通讯作者:KC, Dukka B.
FUpred: detecting protein domains through deep-learning-based contact map prediction
- DOI:10.1093/bioinformatics/btaa217
- 发表时间:2020-06-15
- 期刊:
- 影响因子:5.8
- 作者:Zheng, Wei;Zhou, Xiaogen;Zhang, Yang
- 通讯作者:Zhang, Yang
pLMSNOSite: an ensemble-based approach for predicting protein S-nitrosylation sites by integrating supervised word embedding and embedding from pre-trained protein language model.
- DOI:10.1186/s12859-023-05164-9
- 发表时间:2023-02-08
- 期刊:
- 影响因子:3
- 作者:
- 通讯作者:
Protein Structure and Sequence Reanalysis of 2019-nCoV Genome Refutes Snakes as Its Intermediate Host and the Unique Similarity between Its Spike Protein Insertions and HIV-1
- DOI:10.1021/acs.jproteome.0c00129
- 发表时间:2020-04-03
- 期刊:
- 影响因子:4.4
- 作者:Zhang, Chengxin;Zheng, Wei;Zhang, Yang
- 通讯作者:Zhang, Yang
The Human DNA Mismatch Repair Protein MSH3 Contains Nuclear Localization and Export Signals That Enable Nuclear-Cytosolic Shuttling in Response to Inflammation
- DOI:10.1128/mcb.00029-20
- 发表时间:2020-07-01
- 期刊:
- 影响因子:5.3
- 作者:Tseng-Rogenski, Stephanie S.;Munakata, Koji;Carethers, John M.
- 通讯作者:Carethers, John M.
{{
                item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:{{ item.author }} 
数据更新时间:{{ patent.updateTime }}
Brian Athey其他文献
Valproic Acid Induces the NEUROD1 Transcriptional Program of Neurogenesis after Traumatic Brain Injury
- DOI:10.1016/j.jamcollsurg.2016.06.347 
- 发表时间:2016-10-01 
- 期刊:
- 影响因子:
- 作者:Patrick E. Georgoff;Gerald Higgins;Vahagn C. Nikolian;Ted Bambakidis;Simone E. Dekker;Martin Sillesen;Brian Athey;Hasan B. Alam 
- 通讯作者:Hasan B. Alam 
Brian Athey的其他文献
{{
              item.title }}
{{ item.translation_title }}
- DOI:{{ item.doi }} 
- 发表时间:{{ item.publish_year }} 
- 期刊:
- 影响因子:{{ item.factor }}
- 作者:{{ item.authors }} 
- 通讯作者:{{ item.author }} 
相似海外基金
III : Medium: Collaborative Research: From Open Data to Open Data Curation
III:媒介:协作研究:从开放数据到开放数据管理
- 批准号:2420691 
- 财政年份:2024
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: III: Medium: Designing AI Systems with Steerable Long-Term Dynamics
合作研究:III:中:设计具有可操纵长期动态的人工智能系统
- 批准号:2312865 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:2312932 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: III: Medium: Algorithms for scalable inference and phylodynamic analysis of tumor haplotypes using low-coverage single cell sequencing data
合作研究:III:中:使用低覆盖率单细胞测序数据对肿瘤单倍型进行可扩展推理和系统动力学分析的算法
- 批准号:2415562 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
III: Medium: Collaborative Research: Integrating Large-Scale Machine Learning and Edge Computing for Collaborative Autonomous Vehicles
III:媒介:协作研究:集成大规模机器学习和边缘计算以实现协作自动驾驶汽车
- 批准号:2348169 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Continuing Grant 
Collaborative Research: III: Medium: VirtualLab: Integrating Deep Graph Learning and Causal Inference for Multi-Agent Dynamical Systems
协作研究:III:媒介:VirtualLab:集成多智能体动态系统的深度图学习和因果推理
- 批准号:2312501 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: III: Medium: Knowledge discovery from highly heterogeneous, sparse and private data in biomedical informatics
合作研究:III:中:生物医学信息学中高度异构、稀疏和私有数据的知识发现
- 批准号:2312862 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: III: MEDIUM: Responsible Design and Validation of Algorithmic Rankers
合作研究:III:媒介:算法排序器的负责任设计和验证
- 批准号:2312930 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: III: Medium: New Machine Learning Empowered Nanoinformatics System for Advancing Nanomaterial Design
合作研究:III:媒介:新的机器学习赋能纳米信息学系统,促进纳米材料设计
- 批准号:2347592 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 
Collaborative Research: IIS: III: MEDIUM: Learning Protein-ish: Foundational Insight on Protein Language Models for Better Understanding, Democratized Access, and Discovery
协作研究:IIS:III:中等:学习蛋白质:对蛋白质语言模型的基础洞察,以更好地理解、民主化访问和发现
- 批准号:2310113 
- 财政年份:2023
- 资助金额:$ 81.67万 
- 项目类别:Standard Grant 

 刷新
              刷新
            
















 {{item.name}}会员
              {{item.name}}会员
            



