Similarity-Based Indexing and Integration of Protein Sequence and Structure Databases
基于相似性的蛋白质序列和结构数据库的索引和集成
基本信息
- 批准号:0750891
- 负责人:
- 金额:$ 49.81万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2008
- 资助国家:美国
- 起止时间:2008-08-15 至 2012-07-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The Ohio State University is awarded a grant to develop database indexing and similarity search technologies to manage, analyze, and integrate protein sequence and structure databases. Searching for similar sequences and structures in genomic and proteomic databases is a fundamental task in bioinformatics. As the size of the available data increases rapidly, it is essential to build indexing schemes so that integrated maintenance and querying of both sequence and structure data can be achieved effectively. To address this challenge, this project uses a unified theme for both types of data: extracting key features and mapping them into compact feature vectors spaces to facilitate construction of integrated index structures with sensitive, accurate, and efficient querying capabilities. For the sequence data, the project will develop novel feature extraction that involve physiochemical properties of the amino acids and detect low level of similarities. For the structural data, the project will develop methods to capture local structural motifs using contact maps and spatial motifs. In both cases, compact representation of features will be constructed, as well as efficient structure to index them. The approach incorporates biochemical proteins of molecules into feature extraction to discover functional sites of proteins and to return biologically relevant query results. Finally, based on the unified feature representation and indexing framework, the project will develop methods to integrate sequence and structure data effectively at various levels. A holistic approach combining sequence and structure data would help to overcome the limitations of each, and provide more accurate query results. The results of the project will benefit a wide range of application areas in natural and health sciences, including: comparative and functional genomics, protein modeling and design, drug development, and preventative and personalized medicine. Software developed in this project will facilitate large-scale genome-wide research projects which require iterative and interactive querying of available sequence and structure databases. The novel representations and sensitive motif extraction methods developed are also applicable to biological data visualization, classification, and multiple alignment problems. The software and the results of this project will be available at the website: http://bio.cse.ohio-state.edu.
俄亥俄州州立大学获得一笔赠款,用于开发数据库索引和相似性搜索技术,以管理、分析和整合蛋白质序列和结构数据库。在基因组和蛋白质组数据库中寻找相似的序列和结构是生物信息学的一项基本任务。随着可用数据量的迅速增加,建立索引方案以便有效地实现对序列和结构数据的集成维护和查询是非常必要的。为了应对这一挑战,该项目对这两种类型的数据使用统一的主题:提取关键特征并将其映射到紧凑的特征向量空间,以促进构建具有敏感,准确和高效查询功能的集成索引结构。对于序列数据,该项目将开发涉及氨基酸理化特性的新型特征提取并检测低水平的相似性。对于结构数据,该项目将开发使用接触图和空间图案捕捉局部结构图案的方法。在这两种情况下,将构造特征的紧凑表示,以及索引它们的有效结构。该方法将生物化学蛋白质的分子特征提取,发现蛋白质的功能位点,并返回生物相关的查询结果。最后,基于统一的特征表示和索引框架,该项目将开发在各个级别上有效集成序列和结构数据的方法。结合序列和结构数据的整体方法将有助于克服各自的局限性,并提供更准确的查询结果。该项目的成果将使自然科学和健康科学的广泛应用领域受益,包括:比较和功能基因组学,蛋白质建模和设计,药物开发以及预防和个性化医疗。在这个项目中开发的软件将促进大规模的全基因组研究项目,需要迭代和交互式查询可用的序列和结构数据库。开发的新的表示和敏感的基序提取方法也适用于生物数据可视化,分类和多个对齐问题。该软件和该项目的结果将在以下网站上提供:http://bio.cse.ohio-state.edu。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Hakan Ferhatosmanoglu其他文献
MicroarrayDesigner: an online search tool and repository for near-optimal microarray experimental designs
- DOI:
10.1186/1471-2105-10-304 - 发表时间:
2009-09-22 - 期刊:
- 影响因子:3.300
- 作者:
Ahmet Sacan;Nilgun Ferhatosmanoglu;Hakan Ferhatosmanoglu - 通讯作者:
Hakan Ferhatosmanoglu
Hakan Ferhatosmanoglu的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Hakan Ferhatosmanoglu', 18)}}的其他基金
CAREER: Exploration of Dynamic Sequences in Scientific Databases
职业:探索科学数据库中的动态序列
- 批准号:
0546713 - 财政年份:2006
- 资助金额:
$ 49.81万 - 项目类别:
Continuing Grant
相似国自然基金
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Incentive and governance schenism study of corporate green washing behavior in China: Based on an integiated view of econfiguration of environmental authority and decoupling logic
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
Exploring the Intrinsic Mechanisms of CEO Turnover and Market Reaction: An Explanation Based on Information Asymmetry
- 批准号:W2433169
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金项目
A study on prototype flexible multifunctional graphene foam-based sensing grid (柔性多功能石墨烯泡沫传感网格原型研究)
- 批准号:
- 批准年份:2020
- 资助金额:20 万元
- 项目类别:
基于tag-based单细胞转录组测序解析造血干细胞发育的可变剪接
- 批准号:81900115
- 批准年份:2019
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
应用Agent-Based-Model研究围术期单剂量地塞米松对手术切口愈合的影响及机制
- 批准号:81771933
- 批准年份:2017
- 资助金额:50.0 万元
- 项目类别:面上项目
Reality-based Interaction用户界面模型和评估方法研究
- 批准号:61170182
- 批准年份:2011
- 资助金额:57.0 万元
- 项目类别:面上项目
Multistage,haplotype and functional tests-based FCAR 基因和IgA肾病相关关系研究
- 批准号:30771013
- 批准年份:2007
- 资助金额:30.0 万元
- 项目类别:面上项目
差异蛋白质组技术结合Array-based CGH 寻找骨肉瘤分子标志物
- 批准号:30470665
- 批准年份:2004
- 资助金额:8.0 万元
- 项目类别:面上项目
GaN-based稀磁半导体材料与自旋电子共振隧穿器件的研究
- 批准号:60376005
- 批准年份:2003
- 资助金额:20.0 万元
- 项目类别:面上项目
相似海外基金
Indexing the Kansei value of traditional textiles based on the consumption process for sustainable fashion
基于可持续时尚的消费过程对传统纺织品的感性价值进行索引
- 批准号:
22K02180 - 财政年份:2022
- 资助金额:
$ 49.81万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Tagmentation-based Indexing for Methylation Sequencing as a novel method of high-throughput methylation clock measurement
基于标签的甲基化测序索引作为高通量甲基化时钟测量的新方法
- 批准号:
10273233 - 财政年份:2021
- 资助金额:
$ 49.81万 - 项目类别:
Indexing of reassurance, attachment, and healing effects based on visual and tactile evaluation of textiles
基于纺织品的视觉和触觉评估对安心、依恋和治愈效果进行索引
- 批准号:
20K02365 - 财政年份:2020
- 资助金额:
$ 49.81万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A Study on Automatic Indexing Based on Textual Mentions to Geographical Location in Story Archiving
故事归档中基于地理位置文本提及的自动索引研究
- 批准号:
18K11982 - 财政年份:2018
- 资助金额:
$ 49.81万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Semantics4Art&Architecture. Consolidation of a sustainable research infrastructure for ontology-based documentation and indexing of art and architecture
语义艺术
- 批准号:
401756994 - 财政年份:2018
- 资助金额:
$ 49.81万 - 项目类别:
Research data and software (Scientific Library Services and Information Systems)
String Indexing Based on Space-Optimal Grammar Compression and Its Application to Knowledge Discovery from Stream Data
基于空间最优语法压缩的字符串索引及其在流数据知识发现中的应用
- 批准号:
18K18111 - 财政年份:2018
- 资助金额:
$ 49.81万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
NSF Postdoctoral Fellowship in Biology FY 2017: Improving RNA-Seq Analysis through Graph-based Analysis and Computational Indexing
2017 财年 NSF 生物学博士后奖学金:通过基于图形的分析和计算索引改进 RNA-Seq 分析
- 批准号:
1711984 - 财政年份:2017
- 资助金额:
$ 49.81万 - 项目类别:
Fellowship Award
New Graph Indexing Structures based on Decomposition
基于分解的新图索引结构
- 批准号:
16K12393 - 财政年份:2016
- 资助金额:
$ 49.81万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Development of an MEI and TEI based Model for Contextual Indexing of Music Documentation: Holdings of the Detmold Court Theatre (1825 - 1875)
开发基于 MEI 和 TEI 的音乐文献上下文索引模型:Detmold 宫廷剧院的藏品(1825 - 1875)
- 批准号:
257284058 - 财政年份:2014
- 资助金额:
$ 49.81万 - 项目类别:
Cataloguing and Digitisation (Scientific Library Services and Information Systems)
EAGER: Leveraging 3D structure estimates for photo collection based geo-localization and semantic indexing
EAGER:利用 3D 结构估计进行基于照片收集的地理定位和语义索引
- 批准号:
1349074 - 财政年份:2013
- 资助金额:
$ 49.81万 - 项目类别:
Standard Grant














{{item.name}}会员




