UTILIZING TERAGRID TO DETECT REMOTE SIMILARITY PROTEIN SEQUENCES

利用 teragrid 检测远程相似性蛋白质序列

基本信息

  • 批准号:
    7723381
  • 负责人:
  • 金额:
    $ 0.05万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2008
  • 资助国家:
    美国
  • 起止时间:
    2008-08-01 至 2009-07-31
  • 项目状态:
    已结题

项目摘要

This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. The subproject and investigator (PI) may have received primary funding from another NIH source, and thus could be represented in other CRISP entries. The institution listed is for the Center, which is not necessarily the institution for the investigator. The structure of a protein is often a key to its function. However, significant time and cost is required to determine the structure of a protein by experimental methods, such as the X-ray crystallography or the Nuclear Magnetic Resonance. There are currently less than 50,000 protein structures deposited in the Protein Data Bank (PDB), of which about 80% are redundant. On the other hand, the genomic sequencing efforts, such as the Human Genome Project, have populated protein sequence databases with well over 5 million sequences. With the increasing gap between known sequences and experimentally determined structures, the computational methods capable of predicting the structure and function of proteins will play an increasing role in protein annotation studies. The ultimate goal of the research described in this proposal is to develop a new protein sequence homology detection method that leverages the growing body of protein sequence data in ways that existing methods do not. The increased sensitivity in recognizing relationships between amino acid sequences will be achieved through the applications of intermediate sequence search strategies and profile-profile techniques. To date, the progress in this area has been limited by the lack of the computational resources needed to perform the transitive profile-profile search. We propose to utilize the TeraGrid to develop and test the first intermediate profile-profile algorithm for detecting protein sequence similarities. The algorithm constructs a sequential profile for the input amino acid sequence (target) and uses it to transitively search the database of all representative profiles for sequences in nr. In the transitive search, the matches found after running the first sequence comparison are used as new queries against the database. The whole process is repeated, iteratively with these new matches. The similarity between the target profile and the profile from the database is established through the intermediate sequences. Our project will be carried out in two stages: 1. In the first stage we will generate the set of representative alignment profiles for sequences from the non-redundant protein sequence database nr. 2. In the second phase we will deploy and test our algorithm.
该子项目是利用该技术的众多研究子项目之一 资源由 NIH/NCRR 资助的中心拨款提供。子项目和 研究者 (PI) 可能已从 NIH 的另一个来源获得主要资金, 因此可以在其他 CRISP 条目中表示。列出的机构是 对于中心来说,它不一定是研究者的机构。 蛋白质的结构通常是其功能的关键。然而,通过X射线晶体学或核磁共振等实验方法确定蛋白质的结构需要大量的时间和成本。目前蛋白质数据库(PDB)中存有不到 50,000 个蛋白质结构,其中约 80% 是冗余的。另一方面,人类基因组计划等基因组测序工作已经在蛋白质序列数据库中填充了超过 500 万条序列。随着已知序列和实验确定的结构之间的差距越来越大,能够预测蛋白质结构和功能的计算方法将在蛋白质注释研究中发挥越来越重要的作用。该提案中描述的研究的最终目标是开发一种新的蛋白质序列同源性检测方法,以现有方法无法做到的方式利用不断增长的蛋白质序列数据。通过中间序列搜索策略和图谱技术的应用,可以提高识别氨基酸序列之间关系的灵敏度。迄今为止,由于缺乏执行传递轮廓搜索所需的计算资源,该领域的进展受到限制。我们建议利用 TeraGrid 开发和测试第一个用于检测蛋白质序列相似性的中间轮廓-轮廓算法。该算法为输入的氨基酸序列(目标)构建一个序列图谱,并使用它来传递性地搜索所有代表性图谱的数据库以查找 nr 中的序列。在传递搜索中,运行第一次序列比较后找到的匹配项将用作针对数据库的新查询。整个过程会通过这些新匹配迭代地重复进行。目标图谱与数据库中的图谱之间的相似性是通过中间序列建立的。我们的项目将分两个阶段进行: 1. 在第一阶段,我们将为来自非冗余蛋白质序列数据库 nr 的序列生成一组代表性的比对图谱。 2. 在第二阶段,我们将部署并测试我们的算法。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

MARK FIENUP其他文献

MARK FIENUP的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('MARK FIENUP', 18)}}的其他基金

UTILIZING TERAGRID TO DETECT REMOTE SIMILARITY PROTEIN SEQUENCES
利用 teragrid 检测远程相似性蛋白质序列
  • 批准号:
    7956240
  • 财政年份:
    2009
  • 资助金额:
    $ 0.05万
  • 项目类别:
REMOTE PROTEIN SEQUENCE HOMOLOGY DETECTION
远程蛋白质序列同源性检测
  • 批准号:
    7956227
  • 财政年份:
    2009
  • 资助金额:
    $ 0.05万
  • 项目类别:
REMOTE PROTEIN SEQUENCE HOMOLOGY DETECTION
远程蛋白质序列同源性检测
  • 批准号:
    7723368
  • 财政年份:
    2008
  • 资助金额:
    $ 0.05万
  • 项目类别:

相似海外基金

Cerebral infarction treatment strategy using collagen-like "triple helix peptide" containing functional amino acid sequence
含功能氨基酸序列的类胶原“三螺旋肽”治疗脑梗塞策略
  • 批准号:
    23K06972
  • 财政年份:
    2023
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Establishment of a screening method for functional microproteins independent of amino acid sequence conservation
不依赖氨基酸序列保守性的功能性微生物蛋白筛选方法的建立
  • 批准号:
    23KJ0939
  • 财政年份:
    2023
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Effects of amino acid sequence and lipids on the structure and self-association of transmembrane helices
氨基酸序列和脂质对跨膜螺旋结构和自缔合的影响
  • 批准号:
    19K07013
  • 财政年份:
    2019
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Construction of electron-transfer amino acid sequence probe with an interaction for protein and cell
蛋白质与细胞相互作用的电子转移氨基酸序列探针的构建
  • 批准号:
    16K05820
  • 财政年份:
    2016
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development of artificial antibody of anti-bitter taste receptor using random amino acid sequence library
利用随机氨基酸序列库开发抗苦味受体人工抗体
  • 批准号:
    16K08426
  • 财政年份:
    2016
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The aa15-17 amino acid sequence in the terminal protein domain of HBV polymerase as a viral factor affect-ing in vivo as well as in vitro replication activity of the virus.
HBV聚合酶末端蛋白结构域中的aa15-17氨基酸序列作为影响病毒体内和体外复制活性的病毒因子。
  • 批准号:
    25461010
  • 财政年份:
    2013
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Amino acid sequence analysis of fossil proteins using mass spectrometry
使用质谱法分析化石蛋白质的氨基酸序列
  • 批准号:
    23654177
  • 财政年份:
    2011
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Precise hybrid synthesis of glycoprotein through amino acid sequence-specific introduction of oligosaccharide followed by enzymatic transglycosylation reaction
通过氨基酸序列特异性引入寡糖,然后进行酶促糖基转移反应,精确杂合合成糖蛋白
  • 批准号:
    22550105
  • 财政年份:
    2010
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Estimating selection on amino-acid sequence polymorphisms in Drosophila
果蝇氨基酸序列多态性选择的估计
  • 批准号:
    NE/D00232X/1
  • 财政年份:
    2006
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Research Grant
Construction of a neural network for detecting novel domains from amino acid sequence information only
构建仅从氨基酸序列信息检测新结构域的神经网络
  • 批准号:
    16500189
  • 财政年份:
    2004
  • 资助金额:
    $ 0.05万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了