Computational Methods for Wrapping and Threading Remote Protein Homologs
包裹和穿线远程蛋白质同系物的计算方法
基本信息
- 批准号:8076912
- 负责人:
- 金额:$ 25.71万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2008
- 资助国家:美国
- 起止时间:2008-06-01 至 2013-05-31
- 项目状态:已结题
- 来源:
- 关键词:AllergensAmino Acid SequenceAmino AcidsBacterial AdhesinsBenchmarkingBotulismCoinComputing MethodologiesDependencyDetectionDistantElementsEmployee StrikesFamilyGenerationsHandHealthHomologous GeneHomologous ProteinHumanInternetInterventionLeadLibrariesMapsMedicalMethodsMetricModelingPathogenesisPectate lyasePenetrationPeptide Sequence DeterminationPertussisPollenProtein FamilyProteinsReportingRestRosaRunningSequence AlignmentSequence HomologySet proteinShapesSource CodeSpeedStructureSurfaceTestingToxinTraining ProgramsTrefoilValidationVertebral columnVirulenceWorkabstractingbasebeta pleated sheetcomparativedesignflexibilityimprovedmarkov modelmembermicrobialnovelnovel strategiespathogenprogramsprotein functionprotein structureprotein structure predictionresearch studythree dimensional structureweb site
项目摘要
DESCRIPTION (provided by applicant): The "twillight zone" is a term coined by Burkhard Rost to refer to remote protein homologs whose sequence similarity to proteins of known structure is sufficiently low that computational detection of the homology becomes quite challenging. We propose to construct new threading methods that extend protein structure prediction and sequence/structure alignments further into the "twilight zone". We attack the problem on two fronts: first, we intend to extend the "wrapping" methods we designed to successfully attack two hard special cases of this problem to other SCOP superfamilies. This involves casting the pairwise dependencies in the wrapping portion of the energy function into the general framework of Markov Random Fields, while still allowing them to wrap multiple aligned sequences to narrow the search space; then using a more sophisticated energy function based on a backbone-dependent rotamer library for sidechain packing. Second, our programs for the beta-helix and trefoil folds used human intervention to construct the core structural templates on which we are wrapping the sequences to predict whether they could fold into these structures or not. In order to construct a general threading program with reasonable fold library coverage for the PDB, we need to solve the challenge of automating the construction of a structural template from a set of proteins that, for example, all belong to the same SCOP superfamily. Most current threading programs train too closely to a backbone of one particular structure to be able to capture remote homologs. We propose a novel multiple structure alignment that adds geometric flexibility to capture similarities between more distant homologs, from which more general core templates can be abstracted. The applications of better computational protein structure prediction to speed up medical discovery are well-known. BetaWrap, our first beta-helix prediction program, already uncovered a previously-unknown relationship between the beta-helix fold and the virulence of microbial pathogens. A striking prediction of the BetaWrap program is that the beta-helix fold is predicted for many surface adhesins, toxins, and other recognition/penetration proteins of human pathogens. Our prediction that a major pollen allergen forms the beta-helix shape has just recently been confirmed experimentally. PUBLIC HEALTH RELEVANCE: Advances in computational protein structure prediction can help guide prediction of protein function, and thus speed medical discovery. This proposal is especially targeted at improving prediction of beta-structural motifs, which include many protein families that are important for bacterial pathogenesis, with representatives from whooping cough toxin (beta-helices) to the botulism toxin (beta-trefoils). BetaWrap, our first beta-helix prediction program, already uncovered a previously-unknown relationship between the beta-helix fold and the virulence of microbial pathogens. A striking prediction of the BetaWrap program is that the beta-helix fold is predicted for many surface adhesins, toxins, and other recognition/penetration proteins of human pathogens. Our prediction that a major pollen allergen forms the beta-helix shape has just recently been confirmed experimentally.
描述(由申请人提供):“暮光地带”是由Burkhard Rost创造的一个术语,指与已知结构的蛋白质序列相似性足够低,以至于同源性的计算检测变得相当具有挑战性的远程蛋白质同源物。我们建议构建新的线程方法,将蛋白质结构预测和序列/结构比对进一步扩展到“模糊区域”。我们从两个方面来解决这个问题:首先,我们打算将我们设计的“包装”方法扩展到其他SCOP超族,以成功地解决这个问题的两个硬特殊情况。这包括将能量函数包装部分的成对依赖关系转换到马尔可夫随机场的一般框架中,同时仍然允许它们包装多个对齐的序列以缩小搜索空间;然后使用基于依赖于主干的转子库的更复杂的能量函数进行侧链填充。其次,我们的β -螺旋和三叶折叠程序使用人工干预来构建核心结构模板,我们在其上包裹序列,以预测它们是否可以折叠成这些结构。为了为PDB构建一个具有合理折叠库覆盖的通用线程程序,我们需要解决从一组蛋白质中自动构建结构模板的挑战,例如,所有这些蛋白质都属于同一个SCOP超家族。大多数当前的线程程序训练过于接近一个特定结构的主干,无法捕获远程同源。我们提出了一种新的多结构对齐方法,增加了几何灵活性,以捕获更远的同源物之间的相似性,从中可以抽象出更通用的核心模板。更好的计算蛋白质结构预测在加速医学发现方面的应用是众所周知的。BetaWrap是我们的第一个β -螺旋预测程序,已经揭示了β -螺旋折叠与微生物病原体毒力之间以前未知的关系。BetaWrap程序的一个惊人的预测是,β -螺旋折叠被预测为许多表面粘附素、毒素和其他人类病原体的识别/渗透蛋白。我们的预测是一种主要的花粉过敏原形成了β -螺旋形状,这一预测最近得到了实验证实。公共卫生相关性:计算蛋白质结构预测的进展可以帮助指导蛋白质功能的预测,从而加快医学发现。该建议特别针对改善β结构基基的预测,其中包括许多对细菌发病机制很重要的蛋白质家族,从百日咳毒素(β -螺旋)到肉毒毒素(β -三叶)的代表。BetaWrap是我们的第一个β -螺旋预测程序,已经揭示了β -螺旋折叠与微生物病原体毒力之间以前未知的关系。BetaWrap程序的一个惊人的预测是,β -螺旋折叠被预测为许多表面粘附素、毒素和其他人类病原体的识别/渗透蛋白。我们的预测是一种主要的花粉过敏原形成了β -螺旋形状,这一预测最近得到了实验证实。
项目成果
期刊论文数量(12)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Augmented training of hidden Markov models to recognize remote homologs via simulated evolution.
- DOI:10.1093/bioinformatics/btp265
- 发表时间:2009-07-01
- 期刊:
- 影响因子:0
- 作者:Kumar A;Cowen L
- 通讯作者:Cowen L
Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data.
- DOI:10.1186/1471-2105-14-23
- 发表时间:2013-01-18
- 期刊:
- 影响因子:3
- 作者:Gallant A;Leiserson MD;Kachalov M;Cowen LJ;Hescott BJ
- 通讯作者:Hescott BJ
Going the distance for protein function prediction: a new distance metric for protein interaction networks.
- DOI:10.1371/journal.pone.0076339
- 发表时间:2013
- 期刊:
- 影响因子:3.7
- 作者:Cao M;Zhang H;Park J;Daniels NM;Crovella ME;Cowen LJ;Hescott B
- 通讯作者:Hescott B
SMURFLite: combining simplified Markov random fields with simulated evolution improves remote homology detection for beta-structural proteins into the twilight zone.
- DOI:10.1093/bioinformatics/bts110
- 发表时间:2012-05-01
- 期刊:
- 影响因子:0
- 作者:Daniels NM;Hosur R;Berger B;Cowen LJ
- 通讯作者:Cowen LJ
Compressive genomics for protein databases.
- DOI:10.1093/bioinformatics/btt214
- 发表时间:2013-07-01
- 期刊:
- 影响因子:0
- 作者:Daniels NM;Gallant A;Peng J;Cowen LJ;Baym M;Berger B
- 通讯作者:Berger B
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
LENORE Jennifer COWEN其他文献
LENORE Jennifer COWEN的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('LENORE Jennifer COWEN', 18)}}的其他基金
Computational Methods for Wrapping and Threading Remote Protein Homologs
包裹和穿线远程蛋白质同系物的计算方法
- 批准号:
7624601 - 财政年份:2008
- 资助金额:
$ 25.71万 - 项目类别:
Computational Methods for Wrapping and Threading Remote Protein Homologs
包裹和穿线远程蛋白质同系物的计算方法
- 批准号:
7851282 - 财政年份:2008
- 资助金额:
$ 25.71万 - 项目类别:
Computational Methods for Wrapping and Threading Remote Protein Homologs
包裹和穿线远程蛋白质同系物的计算方法
- 批准号:
7460514 - 财政年份:2008
- 资助金额:
$ 25.71万 - 项目类别:
相似海外基金
Cerebral infarction treatment strategy using collagen-like "triple helix peptide" containing functional amino acid sequence
含功能氨基酸序列的类胶原“三螺旋肽”治疗脑梗塞策略
- 批准号:
23K06972 - 财政年份:2023
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Establishment of a screening method for functional microproteins independent of amino acid sequence conservation
不依赖氨基酸序列保守性的功能性微生物蛋白筛选方法的建立
- 批准号:
23KJ0939 - 财政年份:2023
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Effects of amino acid sequence and lipids on the structure and self-association of transmembrane helices
氨基酸序列和脂质对跨膜螺旋结构和自缔合的影响
- 批准号:
19K07013 - 财政年份:2019
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Construction of electron-transfer amino acid sequence probe with an interaction for protein and cell
蛋白质与细胞相互作用的电子转移氨基酸序列探针的构建
- 批准号:
16K05820 - 财政年份:2016
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Development of artificial antibody of anti-bitter taste receptor using random amino acid sequence library
利用随机氨基酸序列库开发抗苦味受体人工抗体
- 批准号:
16K08426 - 财政年份:2016
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
The aa15-17 amino acid sequence in the terminal protein domain of HBV polymerase as a viral factor affect-ing in vivo as well as in vitro replication activity of the virus.
HBV聚合酶末端蛋白结构域中的aa15-17氨基酸序列作为影响病毒体内和体外复制活性的病毒因子。
- 批准号:
25461010 - 财政年份:2013
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Amino acid sequence analysis of fossil proteins using mass spectrometry
使用质谱法分析化石蛋白质的氨基酸序列
- 批准号:
23654177 - 财政年份:2011
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Precise hybrid synthesis of glycoprotein through amino acid sequence-specific introduction of oligosaccharide followed by enzymatic transglycosylation reaction
通过氨基酸序列特异性引入寡糖,然后进行酶促糖基转移反应,精确杂合合成糖蛋白
- 批准号:
22550105 - 财政年份:2010
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Estimating selection on amino-acid sequence polymorphisms in Drosophila
果蝇氨基酸序列多态性选择的估计
- 批准号:
NE/D00232X/1 - 财政年份:2006
- 资助金额:
$ 25.71万 - 项目类别:
Research Grant
Construction of a neural network for detecting novel domains from amino acid sequence information only
构建仅从氨基酸序列信息检测新结构域的神经网络
- 批准号:
16500189 - 财政年份:2004
- 资助金额:
$ 25.71万 - 项目类别:
Grant-in-Aid for Scientific Research (C)