MARKOVIAN MODELS FOR PROTEIN IDENTIFICATION FROM TANDEM MASS SPECTROMETRY

串联质谱蛋白质鉴定的马尔可夫模型

基本信息

  • 批准号:
    8364375
  • 负责人:
  • 金额:
    $ 0.11万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2011
  • 资助国家:
    美国
  • 起止时间:
    2011-09-15 至 2013-07-31
  • 项目状态:
    已结题

项目摘要

This subproject is one of many research subprojects utilizing the resources provided by a Center grant funded by NIH/NCRR. Primary support for the subproject and the subproject's principal investigator may have been provided by other sources, including other NIH sources. The Total Cost listed for the subproject likely represents the estimated amount of Center infrastructure utilized by the subproject, not direct funding provided by the NCRR grant to the subproject or subproject staff. Biomedical Research has been revolutionized with technological advances leading to massive accumulation of data. All this data now needs to be mined in order to draw actionable insights into the various biological processes. Complex machine learning algorithms are being developed to perform automated analyses of these large datasets and to come up with robust models that explain the observed data. Such models are then used to identify patterns in data that enable solving of challenging decision problems like diagnosis and prognosis of disease. Our research involves one such class of algorithms called Hidden Markov Models, which are used extensively in sequential data mining problems in Biology. Our particular focus is on development of novel algorithms for identification and quantification of protein sequences in complex biological samples using data that comes out of mass spectrometers. Such analysis will lead to molecular characterization of target conditions like diseased states. Our algorithms involve learning models from large training datasets and are computationally intensive. Additionally, in order to learn a robust model that will perform well across a variety of future test data, we are proposing to perform large-scale experiments with different model topologies and features, and require learning of hundreds of different models worth many days of number-crunching work. However, the entire experimentation can be parallelized trivially since all the models can be learned independently from each other and hence, the need for computing machines that can run multiple jobs in parallel. Our algorithms (homegrown) have been implemented using Python programming language and can take advantage of presence of multiple processing units or cores. After speaking with consultants at PSC, we were suggested that the Blacklight machines are most suitable for our needs.
这个子项目是利用资源的许多研究子项目之一。 由NIH/NCRR资助的中心拨款提供。对子项目的主要支持 子项目的首席调查员可能是由其他来源提供的, 包括美国国立卫生研究院的其他来源。为子项目列出的总成本可能 表示该子项目使用的中心基础设施的估计数量, 不是由NCRR赠款提供给次级项目或次级项目工作人员的直接资金。 随着技术的进步,生物医学研究发生了革命性的变化,导致了大量数据的积累。现在需要挖掘所有这些数据,以便对各种生物过程得出可操作的见解。人们正在开发复杂的机器学习算法,以对这些大型数据集进行自动分析,并提出稳健的模型来解释观察到的数据。然后,这些模型被用来识别数据中的模式,这些模式能够解决具有挑战性的决策问题,如疾病的诊断和预后。我们的研究涉及这样一类算法,称为隐马尔可夫模型,它在生物学中的序列数据挖掘问题中被广泛使用。我们的重点是开发新的算法,利用质谱仪获得的数据识别和量化复杂生物样本中的蛋白质序列。这样的分析将导致目标条件的分子表征,如疾病状态。我们的算法涉及从大型训练数据集中学习模型,并且计算密集型。此外,为了学习一个健壮的模型,该模型将在未来的各种测试数据中表现良好,我们建议用不同的模型拓扑和功能进行大规模实验,并需要学习数百个不同的模型,这需要许多天的数值计算工作。然而,整个实验可以微不足道地并行化,因为所有的模型都可以相互独立地学习,因此需要能够并行运行多个作业的计算机器。我们的算法(自主开发的)是使用Python编程语言实现的,可以利用多个处理单元或核心的存在。在与PSC的顾问交谈后,我们被建议Blacklight机器最适合我们的需求。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Vanathi Gopalakrishnan其他文献

Vanathi Gopalakrishnan的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Vanathi Gopalakrishnan', 18)}}的其他基金

Transfer Rule Learning for Knowledge Based Biomarker Discovery and Predictive Bio
基于知识的生物标志物发现和预测生物的转移规则学习
  • 批准号:
    8711497
  • 财政年份:
    2012
  • 资助金额:
    $ 0.11万
  • 项目类别:
Transfer Rule Learning with Functional Mapping for Integrative Modeling of Panomics Data
具有功能映射的转移规则学习用于全景数据的集成建模
  • 批准号:
    9246538
  • 财政年份:
    2012
  • 资助金额:
    $ 0.11万
  • 项目类别:
Transfer Rule Learning with Functional Mapping for Integrative Modeling of Panomics Data
具有功能映射的转移规则学习用于全景数据的集成建模
  • 批准号:
    9111473
  • 财政年份:
    2012
  • 资助金额:
    $ 0.11万
  • 项目类别:
Transfer Rule Learning for Knowledge Based Biomarker Discovery and Predictive Bio
基于知识的生物标志物发现和预测生物的转移规则学习
  • 批准号:
    8549840
  • 财政年份:
    2012
  • 资助金额:
    $ 0.11万
  • 项目类别:
Transfer Rule Learning for Knowledge Based Biomarker Discovery and Predictive Bio
基于知识的生物标志物发现和预测生物的转移规则学习
  • 批准号:
    8373065
  • 财政年份:
    2012
  • 资助金额:
    $ 0.11万
  • 项目类别:
Bayesian Rule Learning Methods for Disease Prediction and Biomarker Discovery
用于疾病预测和生物标志物发现的贝叶斯规则学习方法
  • 批准号:
    8318619
  • 财政年份:
    2011
  • 资助金额:
    $ 0.11万
  • 项目类别:
Bayesian Rule Learning Methods for Disease Prediction and Biomarker Discovery
用于疾病预测和生物标志物发现的贝叶斯规则学习方法
  • 批准号:
    8024941
  • 财政年份:
    2011
  • 资助金额:
    $ 0.11万
  • 项目类别:
Intelligent Aids for Proteomic Data Mining
蛋白质组数据挖掘的智能辅助工具
  • 批准号:
    7089794
  • 财政年份:
    2004
  • 资助金额:
    $ 0.11万
  • 项目类别:
Intelligent Aids for Proteomic Data Mining
蛋白质组数据挖掘的智能辅助工具
  • 批准号:
    6811846
  • 财政年份:
    2004
  • 资助金额:
    $ 0.11万
  • 项目类别:
Intelligent Aids for Proteomic Data Mining
蛋白质组数据挖掘的智能辅助工具
  • 批准号:
    7460715
  • 财政年份:
    2004
  • 资助金额:
    $ 0.11万
  • 项目类别:

相似海外基金

Cerebral infarction treatment strategy using collagen-like "triple helix peptide" containing functional amino acid sequence
含功能氨基酸序列的类胶原“三螺旋肽”治疗脑梗塞策略
  • 批准号:
    23K06972
  • 财政年份:
    2023
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Establishment of a screening method for functional microproteins independent of amino acid sequence conservation
不依赖氨基酸序列保守性的功能性微生物蛋白筛选方法的建立
  • 批准号:
    23KJ0939
  • 财政年份:
    2023
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Effects of amino acid sequence and lipids on the structure and self-association of transmembrane helices
氨基酸序列和脂质对跨膜螺旋结构和自缔合的影响
  • 批准号:
    19K07013
  • 财政年份:
    2019
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Construction of electron-transfer amino acid sequence probe with an interaction for protein and cell
蛋白质与细胞相互作用的电子转移氨基酸序列探针的构建
  • 批准号:
    16K05820
  • 财政年份:
    2016
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Development of artificial antibody of anti-bitter taste receptor using random amino acid sequence library
利用随机氨基酸序列库开发抗苦味受体人工抗体
  • 批准号:
    16K08426
  • 财政年份:
    2016
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The aa15-17 amino acid sequence in the terminal protein domain of HBV polymerase as a viral factor affect-ing in vivo as well as in vitro replication activity of the virus.
HBV聚合酶末端蛋白结构域中的aa15-17氨基酸序列作为影响病毒体内和体外复制活性的病毒因子。
  • 批准号:
    25461010
  • 财政年份:
    2013
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Amino acid sequence analysis of fossil proteins using mass spectrometry
使用质谱法分析化石蛋白质的氨基酸序列
  • 批准号:
    23654177
  • 财政年份:
    2011
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Precise hybrid synthesis of glycoprotein through amino acid sequence-specific introduction of oligosaccharide followed by enzymatic transglycosylation reaction
通过氨基酸序列特异性引入寡糖,然后进行酶促糖基转移反应,精确杂合合成糖蛋白
  • 批准号:
    22550105
  • 财政年份:
    2010
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Estimating selection on amino-acid sequence polymorphisms in Drosophila
果蝇氨基酸序列多态性选择的估计
  • 批准号:
    NE/D00232X/1
  • 财政年份:
    2006
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Research Grant
Construction of a neural network for detecting novel domains from amino acid sequence information only
构建仅从氨基酸序列信息检测新结构域的神经网络
  • 批准号:
    16500189
  • 财政年份:
    2004
  • 资助金额:
    $ 0.11万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了