The Dundee Resource for Sequence Analysis and Structure Prediction
邓迪序列分析和结构预测资源
基本信息
- 批准号:BB/R014752/1
- 负责人:
- 金额:$ 101.28万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2018
- 资助国家:英国
- 起止时间:2018 至 无数据
- 项目状态:未结题
- 来源:
- 关键词:
项目摘要
This resource application is focused on supporting and maintaining computer tools and techniques developed at the University of Dundee that are in daily use by thousands of biological scientists throughout the UK and the world. The resource will not only ensure that these tools are readily available to all scientists, but also improve the ability of scientists and students to use them through better interfaces and via regular face-to-face training courses and other on-line materials. The tools focus on the analysis of protein sequences and structures which are briefly introduced here. The plans to make a plant, animal or micro-organism are encoded as the molecule DNA and known as its genome. The genome can be represented as a long word made up of four different letters (A, C, G, T). The genome may be a few thousand letters long for a virus, to several billion letters for plants and animals. The genome is divided up into regions called genes which are translated by complex molecular machines into other molecules such as proteins. Humans and other animals have 20-30,000 genes that code for proteins and each protein made up of a sequence of 20 different amino acid types joined together in a chain. Protein sequences from an organism vary in length from a few amino acids, to several thousand and can be represented as a word made up of 20 different letter types. The protein chain folds up into a complex three-dimensional shape that is defined primarily by its sequence. The shape of the protein, its "conformation", dictates the biological function of the protein, so understanding the conformation of a protein is vitally important to understanding the protein function. Over recent years there have been huge advances in technology to sequence DNA and so the genomes of many different organisms have been determined. As a consequence, the sequences of several million proteins are now known but less than 150,000 have had their detailed three-dimensional structures worked out. The computational tools that will make up this resource help to bridge this information gap by classifying protein sequences and making predictions of protein structure that can guide biologists to design more efficient and effective experiments. A major objective of the proposal are to provide support, maintenance and training for the popular JPred protein structure prediction server which performs up to 500,000 predictions monthly for scientists in 200 countries and other techniques that we have developed. Web sites are good for humans to interact with, but less useful for computer software to interface to. Since our tools are useful for large analyses that might be done on many thousands of proteins, the new resource will also support a novel "web services" interface to the tools. Web services allow a program or application to be run remotely from within a program. For example, I might have a program running on my desktop computer, but call for an intensive calculation to be done on a remote high-performance computer system. We will develop our new framework for web services called "Slivka" that makes installation of web services easier. A key part of the new resource will be to store the results of analyses and predictions for many organisms in an innovative database called the ProteoCache.
该资源应用程序的重点是支持和维护邓迪大学开发的计算机工具和技术,这些工具和技术被英国和世界各地成千上万的生物科学家日常使用。该资源不仅将确保所有科学家都能随时使用这些工具,而且还将通过更好的界面和定期的面对面培训课程及其他在线材料,提高科学家和学生使用这些工具的能力。这些工具专注于蛋白质序列和结构的分析,这里简要介绍。制造植物、动物或微生物的计划被编码为DNA分子,称为其基因组。基因组可以表示为由四个不同字母(A、C、G、T)组成的长词。病毒的基因组可能只有几千个字母,而动植物的基因组可能有几十亿个字母。基因组被分成称为基因的区域,这些区域被复杂的分子机器翻译成其他分子,如蛋白质。人类和其他动物有20- 30,000个编码蛋白质的基因,每个蛋白质由20种不同氨基酸类型的序列组成。生物体的蛋白质序列长度从几个氨基酸到几千个氨基酸不等,可以表示为由20种不同字母类型组成的单词。蛋白质链折叠成复杂的三维形状,主要由其序列定义。蛋白质的形状,它的“构象”,决定了蛋白质的生物学功能,因此了解蛋白质的构象对于了解蛋白质功能至关重要。近年来,DNA测序技术取得了巨大进步,许多不同生物体的基因组已经确定。结果,现在已知了几百万种蛋白质的序列,但只有不到15万种蛋白质的详细三维结构得到了确定。构成这一资源的计算工具有助于通过对蛋白质序列进行分类和对蛋白质结构进行预测来弥合这一信息鸿沟,从而指导生物学家设计更高效和更有效的实验。该提案的一个主要目标是为流行的JPred蛋白质结构预测服务器提供支持,维护和培训,该服务器每月为200个国家的科学家和我们开发的其他技术进行多达50万次预测。网站对于人类的互动是很好的,但对于计算机软件的接口就不那么有用了。由于我们的工具对于可能对数千种蛋白质进行的大型分析非常有用,因此新资源还将支持工具的新型“Web服务”接口。Web服务允许程序或应用程序从程序内部远程运行。例如,我可能有一个程序在我的台式计算机上运行,但需要在远程高性能计算机系统上进行密集计算。我们将为Web服务开发新的框架,称为“Slivka”,使Web服务的安装更容易。新资源的一个关键部分将是将许多生物的分析和预测结果存储在一个名为ProteoCache的创新数据库中。
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Effects of common mutations in the SARS-CoV-2 Spike RBD domain and its ligand the human ACE2 receptor on binding affinity and kinetics
- DOI:10.1101/2021.05.18.444646
- 发表时间:2021-05
- 期刊:
- 影响因子:0
- 作者:Michael I. MacGowan;Stuart Kutuzov;Mikhail Dushek;O. Barton;Geoffrey J. Merwe;P. Anton;Michael I. Barton;Stuart A. MacGowan;M. Kutuzov;Omer Dushek;G. Barton;A. V. D. Merwe
- 通讯作者:Michael I. MacGowan;Stuart Kutuzov;Mikhail Dushek;O. Barton;Geoffrey J. Merwe;P. Anton;Michael I. Barton;Stuart A. MacGowan;M. Kutuzov;Omer Dushek;G. Barton;A. V. D. Merwe
A unified approach to evolutionary conservation and population constraint in protein domains highlights structural features and pathogenic sites
蛋白质领域进化保护和种群限制的统一方法突出了结构特征和致病位点
- DOI:10.21203/rs.3.rs-3160340/v1
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:MacGowan S
- 通讯作者:MacGowan S
Effects of common mutations in the SARS-CoV-2 Spike RBD and its ligand, the human ACE2 receptor on binding affinity and kinetics.
- DOI:10.7554/elife.70658
- 发表时间:2021-08-26
- 期刊:
- 影响因子:7.7
- 作者:Barton MI;MacGowan SA;Kutuzov MA;Dushek O;Barton GJ;van der Merwe PA
- 通讯作者:van der Merwe PA
Missense variants in human ACE2 strongly affect binding to SARS-CoV-2 Spike providing a mechanism for ACE2 mediated genetic risk in Covid-19: A case study in affinity predictions of interface variants.
- DOI:10.1371/journal.pcbi.1009922
- 发表时间:2022-03
- 期刊:
- 影响因子:4.3
- 作者:MacGowan SA;Barton MI;Kutuzov M;Dushek O;van der Merwe PA;Barton GJ
- 通讯作者:Barton GJ
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Geoffrey Barton其他文献
Geoffrey Barton的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Geoffrey Barton', 18)}}的其他基金
The Dundee Resource for Sequence Analysis and Structure Prediction (DRSASP) - 2023 and Beyond
邓迪序列分析和结构预测资源 (DRSASP) - 2023 年及以后
- 批准号:
BB/X018628/1 - 财政年份:2023
- 资助金额:
$ 101.28万 - 项目类别:
Research Grant
The Jalview Resource for Sequence Analysis and Annotation
用于序列分析和注释的 Jalview 资源
- 批准号:
BB/L020742/1 - 财政年份:2014
- 资助金额:
$ 101.28万 - 项目类别:
Research Grant
The Dundee Resource for Protein Structure Prediction and Sequence Analysis
邓迪蛋白质结构预测和序列分析资源
- 批准号:
BB/J019364/1 - 财政年份:2013
- 资助金额:
$ 101.28万 - 项目类别:
Research Grant
The Jalview Resource for Sequence Analysis and Annotation - www.jalview.org
用于序列分析和注释的 Jalview 资源 - www.jalview.org
- 批准号:
BB/G022682/1 - 财政年份:2009
- 资助金额:
$ 101.28万 - 项目类别:
Research Grant
相似海外基金
The Dundee Resource for Sequence Analysis and Structure Prediction (DRSASP) - 2023 and Beyond
邓迪序列分析和结构预测资源 (DRSASP) - 2023 年及以后
- 批准号:
BB/X018628/1 - 财政年份:2023
- 资助金额:
$ 101.28万 - 项目类别:
Research Grant
REDfly: The regulatory sequence resource for Drosophila and other insects
REDfly:果蝇和其他昆虫的调控序列资源
- 批准号:
10267371 - 财政年份:2021
- 资助金额:
$ 101.28万 - 项目类别:
Development of a sequence-indexed collection of transposon tagged maize mutants as a novel public resource and its application in root developmental biology
转座子标记玉米突变体序列索引集合的开发作为新型公共资源及其在根发育生物学中的应用
- 批准号:
415165461 - 财政年份:2019
- 资助金额:
$ 101.28万 - 项目类别:
Research Grants
REDfly: The regulatory sequence resource for Drosophila and other insects
REDfly:果蝇和其他昆虫的调控序列资源
- 批准号:
9024852 - 财政年份:2016
- 资助金额:
$ 101.28万 - 项目类别:
REDfly: The regulatory sequence resource for Drosophila and other insects
REDfly:果蝇和其他昆虫的调控序列资源
- 批准号:
9215682 - 财政年份:2016
- 资助金额:
$ 101.28万 - 项目类别:
UniProt: A centralized protein sequence and function resource
UniProt:集中的蛋白质序列和功能资源
- 批准号:
9114369 - 财政年份:2014
- 资助金额:
$ 101.28万 - 项目类别:
The Jalview Resource for Sequence Analysis and Annotation
用于序列分析和注释的 Jalview 资源
- 批准号:
BB/L020742/1 - 财政年份:2014
- 资助金额:
$ 101.28万 - 项目类别:
Research Grant
UniProt: A Protein Sequence and Function Resource for Biomedical Science
UniProt:生物医学的蛋白质序列和功能资源
- 批准号:
10267787 - 财政年份:2014
- 资助金额:
$ 101.28万 - 项目类别:
A Sequence-Indexed Reverse Genetics Resource for Maize: A Set of Lines with Single Ds-GFP Insertions Spread throughout the Genome
玉米的序列索引反向遗传学资源:一组具有单个 Ds-GFP 插入的品系,分布在整个基因组中
- 批准号:
1339238 - 财政年份:2014
- 资助金额:
$ 101.28万 - 项目类别:
Continuing Grant
UniProt: A centralized protein sequence and function resource
UniProt:集中的蛋白质序列和功能资源
- 批准号:
8739769 - 财政年份:2014
- 资助金额:
$ 101.28万 - 项目类别: