ABI Innovation: Next-Gen Clustering: Fast and Accurate Ways to Cluster Proteins

ABI 创新:下一代聚类:快速准确的蛋白质聚类方法

基本信息

  • 批准号:
    1262664
  • 负责人:
  • 金额:
    $ 66.7万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2013
  • 资助国家:
    美国
  • 起止时间:
    2013-09-01 至 2018-08-31
  • 项目状态:
    已结题

项目摘要

DNA sequencing technology has decreased the cost and increased the speed of genome sequencing, and this has led to an exponential growth in the availability of sequenced genomes. In order to take full advantage of the situation, researchers must have easy-to-use but powerful software tools. Unfortunately, it has been a struggle for providers of open-source software to keep pace with the immense volumes of biological data generated on a daily basis. As a result, many researchers are using tools for tasks the software was not meant to perform resulting in less than optimal results. Suitable open-source software tools are needed to process the quantities of genome data and obtain the knowledge needed for scientific breakthroughs. The intellectual merit of the proposed project is that it will fill one significant void, the need for a software program that can be used to cluster millions of protein sequences from sequenced genomes quickly and accurately using cloud computing. From these protein clusters, a phylogenomics approach can be used to predict the function of uncharacterized proteins. Several versions of the software tool pClust have already been developed, and results obtained are far superior to those generated by any other means. For this work a version of pClust will be developed for use in the cloud and with support for incremental clustering. In addition pClust will be used to cluster proteins from all available whole genome sequences for the entire bacterial phylum Proteobacteria (approximately 1770 species presently). This will be the most comprehensive study of the Proteobacteria phylum ever and will represent a scale that has never as yet been achieved.The broader impacts of this project will occur in several different areas. When made available via cloud computing and a user-friendly graphical user interface, the parallel configuration of the pClust program will allow scientists worldwide to analyze thousands of genomes at once, quickly and accurately clustering all the proteins within the genomes. Deployment of this software tool will have a significant impact on discoveries in science and medicine. In terms of education, both graduate and undergraduate students will be trained. Emphasis will be placed on recruiting and engaging female students into this interdisciplinary project, and the project team will participate in annual outreach programs to middle and high school students. Key research findings originating from this project will be published in peer-reviewed journals and conferences. Also, the software tools developed as part of this project will be published as open source at Google Code, and the results of the clustering, including the protein clusters themselves, phylogenetic trees, phylogenetic profiles, and percentage agreement among the phylogenetic profiles will be made freely available on the WSU School of Electrical Engineering and Computer Science's Bioinformatics and Computational Biology Web site.
DNA测序技术降低了基因组测序的成本并提高了测序的速度,这导致测序基因组的可用性呈指数增长。为了充分利用这种情况,研究人员必须拥有易于使用但功能强大的软件工具。不幸的是,开源软件提供商一直在努力跟上每天产生的大量生物数据的步伐。因此,许多研究人员正在使用工具来执行软件不打算执行的任务,从而导致不太理想的结果。需要适当的开放源码软件工具来处理大量的基因组数据并获得科学突破所需的知识。该项目的智力价值在于,它将填补一个重要的空白,即需要一个软件程序,可以使用云计算快速准确地从测序的基因组中聚类数百万个蛋白质序列。从这些蛋白质簇中,可以使用一种基因组学方法来预测未表征蛋白质的功能。已经开发了几个版本的软件工具pClust,所获得的结果远远优于任何其他手段产生的结果上级。对于这项工作,将开发一个版本的pClust用于云计算,并支持增量集群。此外,pClust将用于对整个细菌门变形菌门(目前约1770个物种)的所有可用全基因组序列的蛋白质进行聚类。这将是变形菌门有史以来最全面的研究,并将代表一个规模,从来没有达到。该项目的更广泛的影响将发生在几个不同的领域。当通过云计算和用户友好的图形用户界面提供时,pClust程序的并行配置将使世界各地的科学家能够同时分析数千个基因组,快速准确地对基因组中的所有蛋白质进行聚类。该软件工具的部署将对科学和医学的发现产生重大影响。在教育方面,将培养研究生和本科生。重点将放在招募和吸引女学生参与这个跨学科项目上,项目团队将参与针对初中和高中生的年度外展计划。该项目的主要研究成果将发表在同行评审的期刊和会议上。此外,作为该项目的一部分开发的软件工具将在Google Code上作为开源发布,聚类结果,包括蛋白质簇本身,系统发育树,系统发育概况以及系统发育概况之间的百分比一致性将在WSU电气工程和计算机科学学院的生物信息学和计算生物学网站上免费提供。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Shira Broschat其他文献

Shira Broschat的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Shira Broschat', 18)}}的其他基金

Integration of Wave Theory and Statistical Pattern Classification for Ultrasound Imaging of Breast Tumors
波理论与统计模式分类相结合用于乳腺肿瘤超声成像
  • 批准号:
    9711051
  • 财政年份:
    1997
  • 资助金额:
    $ 66.7万
  • 项目类别:
    Standard Grant
Presidential Faculty Fellow: Numerical Studies in Wave Propagation and Scattering
总统教职研究员:波传播和散射的数值研究
  • 批准号:
    9253547
  • 财政年份:
    1992
  • 资助金额:
    $ 66.7万
  • 项目类别:
    Continuing Grant
NeXT Workstation Laboratory for Undergraduate Electromagnetics
NeXT 本科生电磁学工作站实验室
  • 批准号:
    9050682
  • 财政年份:
    1991
  • 资助金额:
    $ 66.7万
  • 项目类别:
    Standard Grant
Presidential Young Investigator Award
总统青年研究员奖
  • 批准号:
    9058186
  • 财政年份:
    1990
  • 资助金额:
    $ 66.7万
  • 项目类别:
    Continuing Grant

相似海外基金

Baycrest Academy for Research and Education Summer Program in Aging (SPA): Strengthening research competencies, cultivating empathy, building interprofessional networks and skills, and fostering innovation among the next generation of healthcare workers t
Baycrest Academy for Research and Education Summer Program in Aging (SPA):加强研究能力,培养同理心,建立跨专业网络和技能,并促进下一代医疗保健工作者的创新
  • 批准号:
    498310
  • 财政年份:
    2024
  • 资助金额:
    $ 66.7万
  • 项目类别:
    Operating Grants
HEALTH INNOVATION NEXT GENERATION PAYMENT&PRICING MODELS: HI- PRIX PROJECT
健康创新下一代支付
  • 批准号:
    10062152
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
Health Innovation Next Generation Payment & Pricing Models (HI-PRIX)
健康创新下一代支付
  • 批准号:
    10058086
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
HEALTH INNOVATION NEXT GENERATION PAYMENT & PRICING MODELS
健康创新下一代支付
  • 批准号:
    10062415
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
Boosting innovation in breeding for the next generation of legume crops for Europe
促进欧洲下一代豆类作物育种创新
  • 批准号:
    10083603
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
Boosting innovation in breeding for the next generation of legume crops for Europe
促进欧洲下一代豆类作物育种创新
  • 批准号:
    10077978
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
Legume Generation: Boosting innovation in breeding for the next generation of legume crops for Europe
豆类世代:促进欧洲下一代豆类作物育种创新
  • 批准号:
    10079743
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
Boosting innovation in breeding for the next generation of legume crops for Europe
促进欧洲下一代豆类作物育种创新
  • 批准号:
    10078428
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
Legume Generation: Boosting innovation in breeding for the next generation of legume crops for Europe
豆类世代:促进欧洲下一代豆类作物育种创新
  • 批准号:
    10073595
  • 财政年份:
    2023
  • 资助金额:
    $ 66.7万
  • 项目类别:
    EU-Funded
IUCRC Planning Grant Virginia Tech: Center for Wireless Innovation towards Secure, Pervasive, Efficient and Resilient Next G Networks (WISPER)
IUCRC 规划拨款弗吉尼亚理工大学:实现安全、普遍、高效和有弹性的下一代网络 (WISPER) 的无线创新中心
  • 批准号:
    2209662
  • 财政年份:
    2022
  • 资助金额:
    $ 66.7万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了