SBIR PHASE I - TOPIC 407 - CLOUD-BASED SOFTWARE FOR THE CANCER RESEARCH DATA COMMONSPERIOD OF PERFORMANCE: 09/16/2020-06/15/2021
SBIR 第一阶段 - 主题 407 - 用于癌症研究数据公共性能的云软件:09/16/2020-06/15/2021
基本信息
- 批准号:10250982
- 负责人:
- 金额:$ 25.21万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2020
- 资助国家:美国
- 起止时间:2020-09-16 至 2021-06-15
- 项目状态:已结题
- 来源:
- 关键词:AlgorithmsAmino Acid SequenceCloud ComputingCodeComplexComputer softwareDataData SetFutureGenomicsHumanLinkMiningMutationPerformancePhaseProcessProtein IsoformsProteomeProteomicsSamplingScientistShotgunsSmall Business Innovation Research GrantTechniquesTimeanticancer researchcloud basedcloud platformmolecular sequence databaseneoantigensnovelproteogenomicssearch engineterabytetool
项目摘要
Shotgun proteomics is the most commonly used technique for understanding the proteome of complex human samples. There is terabytes of proteomics data available on NCI's proteomics cloud, but unfortunately most proteomics search engine search a canonical human sequence database along with a
handful of PTMs. Thus more than 50% of the MS/MS spectra remain uninterpreted. We have developed a cloud-based algorithm, Bolt, that can search almost more than 2.4 million protein sequences (canonical, isoforms and mutations) along with 41 PTMs and partial tryptic search in a matter of minutes. Bolt is able to search this 50 times larger search space in minutes whereas the traditional search engine need much longer even on a smaller search space. Utilizing the power of cloud computing, we are able to sequence 20% more MS/MS than traditional search engines. We propose adapting and integrating Bolt to the NCI's cloud platform, and linking it to the CRDC proteomics pipeline (for shotgun proteomics and neoantigen discovery) where scientists can analyse their existing and future data sets seamlessly while mining them much deeper than before. We also propose to link the genomic node to extract sample-specific information to futher mine proteogenomics data set for novel coding regions.
鸟枪蛋白质组学是了解复杂人类样本蛋白质组的最常用技术。在NCI的蛋白质组学云上有数TB的蛋白质组学数据,但不幸的是,大多数蛋白质组学搜索引擎沿着
一些PTM。因此,超过50%的MS/MS光谱仍然无法解释。我们已经开发了一种基于云的算法Bolt,它可以在几分钟内搜索超过240万个蛋白质序列(典型的,异构体和突变),沿着41个PTM和部分胰蛋白酶搜索。Bolt能够在几分钟内搜索50倍的搜索空间,而传统搜索引擎即使在较小的搜索空间上也需要更长的时间。利用云计算的力量,我们能够比传统搜索引擎多测序20%的MS/MS。我们建议将Bolt适应并集成到NCI的云平台,并将其连接到CRDC蛋白质组学管道(用于鸟枪蛋白质组学和新抗原发现),科学家可以无缝分析他们现有和未来的数据集,同时比以前更深入地挖掘它们。我们还建议将基因组节点连接起来,以提取样本特异性信息,进一步挖掘新的编码区的蛋白质基因组学数据集。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
AMOL PRAKASH其他文献
AMOL PRAKASH的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
相似海外基金
Cerebral infarction treatment strategy using collagen-like "triple helix peptide" containing functional amino acid sequence
含功能氨基酸序列的类胶原“三螺旋肽”治疗脑梗塞策略
- 批准号:
23K06972 - 财政年份:2023
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Establishment of a screening method for functional microproteins independent of amino acid sequence conservation
不依赖氨基酸序列保守性的功能性微生物蛋白筛选方法的建立
- 批准号:
23KJ0939 - 财政年份:2023
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Effects of amino acid sequence and lipids on the structure and self-association of transmembrane helices
氨基酸序列和脂质对跨膜螺旋结构和自缔合的影响
- 批准号:
19K07013 - 财政年份:2019
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Construction of electron-transfer amino acid sequence probe with an interaction for protein and cell
蛋白质与细胞相互作用的电子转移氨基酸序列探针的构建
- 批准号:
16K05820 - 财政年份:2016
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Development of artificial antibody of anti-bitter taste receptor using random amino acid sequence library
利用随机氨基酸序列库开发抗苦味受体人工抗体
- 批准号:
16K08426 - 财政年份:2016
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
The aa15-17 amino acid sequence in the terminal protein domain of HBV polymerase as a viral factor affect-ing in vivo as well as in vitro replication activity of the virus.
HBV聚合酶末端蛋白结构域中的aa15-17氨基酸序列作为影响病毒体内和体外复制活性的病毒因子。
- 批准号:
25461010 - 财政年份:2013
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Amino acid sequence analysis of fossil proteins using mass spectrometry
使用质谱法分析化石蛋白质的氨基酸序列
- 批准号:
23654177 - 财政年份:2011
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Precise hybrid synthesis of glycoprotein through amino acid sequence-specific introduction of oligosaccharide followed by enzymatic transglycosylation reaction
通过氨基酸序列特异性引入寡糖,然后进行酶促糖基转移反应,精确杂合合成糖蛋白
- 批准号:
22550105 - 财政年份:2010
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Estimating selection on amino-acid sequence polymorphisms in Drosophila
果蝇氨基酸序列多态性选择的估计
- 批准号:
NE/D00232X/1 - 财政年份:2006
- 资助金额:
$ 25.21万 - 项目类别:
Research Grant
Construction of a neural network for detecting novel domains from amino acid sequence information only
构建仅从氨基酸序列信息检测新结构域的神经网络
- 批准号:
16500189 - 财政年份:2004
- 资助金额:
$ 25.21万 - 项目类别:
Grant-in-Aid for Scientific Research (C)