Keeping pace with protein sequence annotation; consolidating and enhancing Pfam and InterPro's methodologies for functional prediction
与蛋白质序列注释保持同步;
基本信息
- 批准号:BB/L024136/1
- 负责人:
- 金额:$ 69.49万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2014
- 资助国家:英国
- 起止时间:2014 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
New technologies, developed in the last few years, have greatly increased the amount of biological sequence information that it is possible for laboratories to produce. As a result, there is now a very large and ever-growing amount of sequence data entering public databases. The overwhelming majority of these sequences have not been examined by scientists, nor is there any experimental information to suggest what their function might be. The Pfam and InterPro resources help plug this gap, using probabilistic models to predict the function of proteins by examining their amino acid sequences. Pfam is arguably the most well-known and one of the largest producers of such models. InterPro, meanwhile, does not produce models directly, but takes them from Pfam and 10 other complementary databases, integrating them together and adding functional information. InterPro is regularly run against the full contents of the main public repository for protein sequences, the UniProt Knowledgebase, so that its functional predictions can be transferred.In order that InterPro and Pfam can continue to cover the growing number of sequences and remain accurate in their predictions, new models need to be made and integrated, existing models need to be checked and the proteins that they match evaluated. One aim of the project is to support this effort. Another aim is to look at other prediction methods, not currently used by either Pfam or InterPro, that identify the individual amino acids in a protein sequence that are responsible for the protein's functions. We will add this functionality to the resources and use it to make their predictions more accurate. This will in turn improve the quality of information associated with large numbers of proteins in the UniProt Knowledgebase. Adding to the resources in this way will require changes to some of the underlying software. At the same time, we will update the InterPro and Pfam web sites, so that users can easily see the new and improved data, and understand what it means. Finally, we will prepare and organise training materials and courses to introduce new users to the resources and educate existing users about the new and updated features.
过去几年发展的新技术极大地增加了实验室可能产生的生物序列信息量。因此,现在有非常大量且不断增长的序列数据进入公共数据库。这些序列中的绝大多数还没有被科学家研究过,也没有任何实验信息表明它们的功能可能是什么。Pfam和InterPro资源有助于填补这一空白,它们使用概率模型通过检查蛋白质的氨基酸序列来预测蛋白质的功能。Pfam可以说是这类车型最知名、最大的生产商之一。与此同时,InterPro并不直接制作模型,而是从Pfam和其他10个互补的数据库中提取模型,将它们集成在一起,并添加功能信息。InterPro定期根据蛋白质序列的主要公共存储库-UniProt知识库的全部内容运行,以便其功能预测可以转移。为了使InterPro和Pfam能够继续涵盖不断增长的序列数量并保持预测的准确性,需要建立新的模型并进行集成,需要检查现有模型并评估它们匹配的蛋白质。该项目的一个目标是支持这一努力。另一个目标是寻找目前Pfam或InterPro都没有使用的其他预测方法,这些方法可以识别蛋白质序列中负责蛋白质功能的个别氨基酸。我们将向资源添加此功能,并使用它来使他们的预测更准确。这将反过来提高与UniProt知识库中大量蛋白质相关的信息质量。以这种方式添加资源将需要对一些底层软件进行更改。同时,我们将更新InterPro和Pfam的网站,以便用户可以轻松地看到新的和改进的数据,并了解其含义。最后,我们将准备和组织培训材料和课程,向新用户介绍这些资源,并向现有用户介绍新功能和最新功能。
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam.
- DOI:10.1093/bib/bbu053
- 发表时间:2015-09
- 期刊:
- 影响因子:9.5
- 作者:Chiang Z;Vastermark A;Punta M;Coggill PC;Mistry J;Finn RD;Saier MH Jr
- 通讯作者:Saier MH Jr
The Pfam protein families database: towards a more sustainable future.
- DOI:10.1093/nar/gkv1344
- 发表时间:2016-01-04
- 期刊:
- 影响因子:14.9
- 作者:Finn RD;Coggill P;Eberhardt RY;Eddy SR;Mistry J;Mitchell AL;Potter SC;Punta M;Qureshi M;Sangrador-Vegas A;Salazar GA;Tate J;Bateman A
- 通讯作者:Bateman A
InterPro in 2017-beyond protein family and domain annotations.
- DOI:10.1093/nar/gkw1107
- 发表时间:2017-01-04
- 期刊:
- 影响因子:14.9
- 作者:Finn RD;Attwood TK;Babbitt PC;Bateman A;Bork P;Bridge AJ;Chang HY;Dosztányi Z;El-Gebali S;Fraser M;Gough J;Haft D;Holliday GL;Huang H;Huang X;Letunic I;Lopez R;Lu S;Marchler-Bauer A;Mi H;Mistry J;Natale DA;Necci M;Nuka G;Orengo CA;Park Y;Pesseat S;Piovesan D;Potter SC;Rawlings ND;Redaschi N;Richardson L;Rivoire C;Sangrador-Vegas A;Sigrist C;Sillitoe I;Smithers B;Squizzato S;Sutton G;Thanki N;Thomas PD;Tosatto SC;Wu CH;Xenarios I;Yeh LS;Young SY;Mitchell AL
- 通讯作者:Mitchell AL
Gene Ontology Consortium: going forward.
- DOI:10.1093/nar/gku1179
- 发表时间:2015-01
- 期刊:
- 影响因子:14.9
- 作者:Gene Ontology Consortium
- 通讯作者:Gene Ontology Consortium
The Gene Ontology resource: enriching a GOld mine.
- DOI:10.1093/nar/gkaa1113
- 发表时间:2021-01-08
- 期刊:
- 影响因子:14.9
- 作者:Gene Ontology Consortium
- 通讯作者:Gene Ontology Consortium
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Alex Bateman其他文献
Bioinformatics Applications Note Databases and Ontologies Codex: Exploration of Semantic Changes between Ontology Versions
生物信息学应用笔记数据库和本体法典:本体版本之间语义变化的探索
- DOI:
- 发表时间:
- 期刊:
- 影响因子:0
- 作者:
Michael Hartung;Anika Groß;E. Rahm;Alex Bateman - 通讯作者:
Alex Bateman
Bioinformatics Advance Access published May 31, 2007
生物信息学高级访问发表于 2007 年 5 月 31 日
- DOI:
10.1007/s10015-009-0735-5 - 发表时间:
2007 - 期刊:
- 影响因子:0.9
- 作者:
Alex Bateman - 通讯作者:
Alex Bateman
Alex Bateman的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Alex Bateman', 18)}}的其他基金
Improving accuracy, coverage, and sustainability of functional protein annotation in InterPro, Pfam and FunFam using Deep Learning methods
使用深度学习方法提高 InterPro、Pfam 和 FunFam 中功能蛋白注释的准确性、覆盖范围和可持续性
- 批准号:
BB/X018660/1 - 财政年份:2024
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
UKRI/BBSRC-NSF/BIO: Unifying Pfam protein sequence and ECOD structural classifications with structure models
UKRI/BBSRC-NSF/BIO:通过结构模型统一 Pfam 蛋白质序列和 ECOD 结构分类
- 批准号:
BB/X012492/1 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
Exploiting data driven computational approaches for understanding protein structure and function in InterPro and Pfam
利用数据驱动的计算方法来理解 InterPro 和 Pfam 中的蛋白质结构和功能
- 批准号:
BB/S020381/1 - 财政年份:2019
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
Rfam: The community resource for RNA families
Rfam:RNA 家族的社区资源
- 批准号:
BB/S020462/1 - 财政年份:2019
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
RNAcentral, the RNA sequence database
RNAcentral,RNA 序列数据库
- 批准号:
BB/N019199/1 - 财政年份:2017
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
Rfam: Towards a sustainable resource for understanding the genomic functional ncRNA repertoire
Rfam:寻找了解基因组功能 ncRNA 库的可持续资源
- 批准号:
BB/M011690/1 - 财政年份:2015
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
The RNAcentral database of non-coding RNAs
非编码RNA的RNA中央数据库
- 批准号:
BB/J019232/1 - 财政年份:2012
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
Embracing new technologies to streamline improve and sustain InterPro and its contributing databases
采用新技术来简化、改进和维护 InterPro 及其贡献数据库
- 批准号:
BB/F010435/1 - 财政年份:2008
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
相似海外基金
M-PACE: Establishing an Urban PACE towards Cultivating Healthy Diets for All Communities
M-PACE:建立城市 PACE,为所有社区培养健康饮食
- 批准号:
BB/Z514408/1 - 财政年份:2024
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
WHyGro-in-Me: Waste-based Hybrid Growing Media for PACE Horticulture using biobased polyurethane binders and biowaste filler
WHyGro-in-Me:使用生物基聚氨酯粘合剂和生物废物填料的 PACE 园艺废物基混合生长介质
- 批准号:
BB/Z514433/1 - 财政年份:2024
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
InSPACE-VT_Development and Validation of Virtual Pace Mapping to Guide Catheter Ablation of Ventricular Tachycardia
InSPACE-VT_虚拟起搏测绘的开发和验证以指导室性心动过速导管消融
- 批准号:
EP/Z001145/1 - 财政年份:2024
- 资助金额:
$ 69.49万 - 项目类别:
Fellowship
Target Infusion Project: Geospatial Data Systems-Policy Analysis Curriculum Enhancement Project (GDS-PACE)
目标注入项目:地理空间数据系统-政策分析课程增强项目(GDS-PACE)
- 批准号:
2306533 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
Standard Grant
FER (H&L) AMR PACE (A-0438) grant funding agreement
费率(H
- 批准号:
107541 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
Collaborative R&D
Are the current practices of dispute resolution in financial transactions keeping pace with the future of financial markets?
目前金融交易中争议解决的做法是否与金融市场的未来保持同步?
- 批准号:
2886800 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
Studentship
FER (H&L) AMR PACE (A-0438) consortium agreement
费率(H
- 批准号:
107535 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
Collaborative R&D
Changing the pace of change: Disability inclusion in development responses to sexual violence for women with disabilities through arts & humanities
改变变革的步伐:通过艺术将残疾纳入针对残疾妇女性暴力的发展对策
- 批准号:
AH/X009505/1 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
Research Grant
1/4-American Consortium of Early Liver Transplantation-Prospective Alcohol-associated liver disease Cohort Evaluation (ACCELERATE-PACE)
1/4-美国早期肝移植联盟-前瞻性酒精相关性肝病队列评估(ACCELERATE-PACE)
- 批准号:
10711811 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别:
4/4-American Consortium of Early Liver Transplantation-Prospective Alcohol-associated liver disease Cohort Evaluation (ACCELERATE-PACE)
4/4-美国早期肝移植联盟-前瞻性酒精相关性肝病队列评估(ACCELERATE-PACE)
- 批准号:
10711018 - 财政年份:2023
- 资助金额:
$ 69.49万 - 项目类别: