The Dundee Resource for Sequence Analysis and Structure Prediction (DRSASP) - 2023 and Beyond

邓迪序列分析和结构预测资源 (DRSASP) - 2023 年及以后

基本信息

  • 批准号:
    BB/X018628/1
  • 负责人:
  • 金额:
    $ 88.13万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2023
  • 资助国家:
    英国
  • 起止时间:
    2023 至 无数据
  • 项目状态:
    未结题

项目摘要

This resource application is focused continuing to support computer tools and techniques developed at the University of Dundee that are in daily use by thousands of biological research scientists and students throughout the UK and the world. The resource will not only ensure that these tools are readily available to all, but also improve the ability of scientists and students to use them through better interfaces and via training videos and other on-line materials. The tools focus on the analysis of protein sequences and structures which are briefly introduced here. The plans to make a plant, animal or micro-organism are encoded as the molecule DNA and known as its genome. The genome can be represented as a long word made up of four different letters (A, C, G, T). The genome may be a few thousand letters long for a virus, to several billion letters for plants and animals. The genome is divided up into regions called genes which are translated by complex molecular machines into other molecules such as proteins. Humans and other animals have 20-30,000 genes that code for proteins and each protein made up of a sequence of 20 different amino acid types joined together in a chain. Protein sequences from an organism vary in length from a few amino acids, to several thousand and can be represented as a word made up of 20 different letter types. The protein chain folds up into a complex three-dimensional shape that is defined primarily by its sequence. The shape of the protein, its "conformation", dictates the biological function of the protein, so understanding the conformation of a protein is vitally important to understanding the protein function. Over recent years there have been huge advances in technology to sequence DNA and so the genomes of many different organisms have been determined. As a consequence, the sequences of several million proteins are now known but less than 150,000 have had their detailed three-dimensional structures worked out. The computational tools that will make up this resource help to bridge this information gap by classifying protein sequences and making predictions of protein structure available in easy to interpret way that can guide biologists to design more efficient and effective experiments. This proposal will provide support, maintenance and training for the popular JPred protein structure prediction server which performs around 30,000 predictions monthly for scientists in >100 countries and other techniques that we have developed. It will enhance JPred to include structure predictions from the latest methods to predict protein three-dimensional structures. Web sites are good for humans to interact with, but less useful for computer software to interface to. Since our tools are useful for large analyses that might be done on many thousands of proteins, the resource also supports a novel "web services" interface to the tools using technology called 'Slivka' developed in Dundee that makes it easy to add new services. Web services allow a program or application to be run remotely from within a program. For example, I might have a program running on my desktop computer but call for an intensive calculation to be done on a remote high-performance computer system. Our services are complex computer systems but we will use technologies called docker and conda to package them in a way that other institutions can install and run them. This will help ensure the services remain available even if those at Dundee fail.
该资源应用程序的重点是继续支持邓迪大学开发的计算机工具和技术,这些工具和技术被英国和世界各地成千上万的生物研究科学家和学生日常使用。该资源不仅将确保所有人都能随时使用这些工具,而且还将通过更好的界面以及培训视频和其他在线材料,提高科学家和学生使用这些工具的能力。这些工具专注于蛋白质序列和结构的分析,这里简要介绍。制造植物、动物或微生物的计划被编码为DNA分子,称为其基因组。基因组可以表示为由四个不同字母(A,C,G,T)组成的长单词。病毒的基因组可能只有几千个字母,而动植物的基因组可能有几十亿个字母。基因组被分成称为基因的区域,这些区域被复杂的分子机器翻译成其他分子,如蛋白质。人类和其他动物有20- 30,000个编码蛋白质的基因,每个蛋白质由20种不同氨基酸类型的序列组成。生物体的蛋白质序列长度从几个氨基酸到几千个氨基酸不等,可以表示为由20种不同字母类型组成的单词。蛋白质链折叠成复杂的三维形状,主要由其序列定义。蛋白质的形状,它的“构象”,决定了蛋白质的生物学功能,因此了解蛋白质的构象对于了解蛋白质功能至关重要。近年来,DNA测序技术取得了巨大进步,许多不同生物体的基因组已经确定。结果,现在已知了几百万种蛋白质的序列,但只有不到15万种蛋白质的详细三维结构得到了确定。构成这一资源的计算工具有助于弥合这一信息差距,方法是对蛋白质序列进行分类,并以易于解释的方式对蛋白质结构进行预测,从而指导生物学家设计更高效和更有效的实验。该提案将为流行的JPred蛋白质结构预测服务器提供支持,维护和培训,该服务器每月为超过100个国家的科学家和我们开发的其他技术进行约30,000次预测。它将增强JPred,包括最新方法的结构预测,以预测蛋白质三维结构。网站对于人类的互动是很好的,但对于计算机软件的接口就不那么有用了。由于我们的工具对于可能对数千种蛋白质进行的大型分析非常有用,因此该资源还支持使用邓迪开发的名为“Slivka”的技术的工具的新型“Web服务”界面,从而可以轻松添加新服务。Web服务允许程序或应用程序从程序内部远程运行。例如,我可能有一个程序在我的台式计算机上运行,但需要在远程高性能计算机系统上进行密集计算。我们的服务是复杂的计算机系统,但我们将使用称为Docker和Conda的技术来打包它们,以便其他机构可以安装和运行它们。这将有助于确保即使邓迪的服务失败,这些服务仍然可用。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Geoffrey Barton其他文献

Geoffrey Barton的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Geoffrey Barton', 18)}}的其他基金

The Dundee Resource for Sequence Analysis and Structure Prediction
邓迪序列分析和结构预测资源
  • 批准号:
    BB/R014752/1
  • 财政年份:
    2018
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grant
The Jalview Resource for Sequence Analysis and Annotation
用于序列分析和注释的 Jalview 资源
  • 批准号:
    BB/L020742/1
  • 财政年份:
    2014
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grant
The Dundee Resource for Protein Structure Prediction and Sequence Analysis
邓迪蛋白质结构预测和序列分析资源
  • 批准号:
    BB/J019364/1
  • 财政年份:
    2013
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grant
The Jalview Resource for Sequence Analysis and Annotation - www.jalview.org
用于序列分析和注释的 Jalview 资源 - www.jalview.org
  • 批准号:
    BB/G022682/1
  • 财政年份:
    2009
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grant

相似海外基金

REDfly: The regulatory sequence resource for Drosophila and other insects
REDfly:果蝇和其他昆虫的调控序列资源
  • 批准号:
    10267371
  • 财政年份:
    2021
  • 资助金额:
    $ 88.13万
  • 项目类别:
Development of a sequence-indexed collection of transposon tagged maize mutants as a novel public resource and its application in root developmental biology
转座子标记玉米突变体序列索引集合的开发作为新型公共资源及其在根发育生物学中的应用
  • 批准号:
    415165461
  • 财政年份:
    2019
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grants
The Dundee Resource for Sequence Analysis and Structure Prediction
邓迪序列分析和结构预测资源
  • 批准号:
    BB/R014752/1
  • 财政年份:
    2018
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grant
REDfly: The regulatory sequence resource for Drosophila and other insects
REDfly:果蝇和其他昆虫的调控序列资源
  • 批准号:
    9024852
  • 财政年份:
    2016
  • 资助金额:
    $ 88.13万
  • 项目类别:
REDfly: The regulatory sequence resource for Drosophila and other insects
REDfly:果蝇和其他昆虫的调控序列资源
  • 批准号:
    9215682
  • 财政年份:
    2016
  • 资助金额:
    $ 88.13万
  • 项目类别:
UniProt: A centralized protein sequence and function resource
UniProt:集中的蛋白质序列和功能资源
  • 批准号:
    9114369
  • 财政年份:
    2014
  • 资助金额:
    $ 88.13万
  • 项目类别:
The Jalview Resource for Sequence Analysis and Annotation
用于序列分析和注释的 Jalview 资源
  • 批准号:
    BB/L020742/1
  • 财政年份:
    2014
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Research Grant
UniProt: A Protein Sequence and Function Resource for Biomedical Science
UniProt:生物医学的蛋白质序列和功能资源
  • 批准号:
    10267787
  • 财政年份:
    2014
  • 资助金额:
    $ 88.13万
  • 项目类别:
A Sequence-Indexed Reverse Genetics Resource for Maize: A Set of Lines with Single Ds-GFP Insertions Spread throughout the Genome
玉米的序列索引反向遗传学资源:一组具有单个 Ds-GFP 插入的品系,分布在整个基因组中
  • 批准号:
    1339238
  • 财政年份:
    2014
  • 资助金额:
    $ 88.13万
  • 项目类别:
    Continuing Grant
UniProt: A centralized protein sequence and function resource
UniProt:集中的蛋白质序列和功能资源
  • 批准号:
    8739769
  • 财政年份:
    2014
  • 资助金额:
    $ 88.13万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了