PIT-DB: A Resource for Sharing, Annotating and Analysing Translated Genomic Elements
PIT-DB:用于共享、注释和分析翻译基因组元素的资源
基本信息
- 批准号:BB/M020118/1
- 负责人:
- 金额:$ 15.64万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2015
- 资助国家:英国
- 起止时间:2015 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The publication of the human genome in 2001 was rightly hailed as a major scientific achievement, but over a decade later we are still far from a complete understanding of the structure of the genome and the role of the various elements within it. While protein coding regions of the genome were identified and used to annotate the genome soon after it was sequenced, many more exotic genomic elements have subsequently attracted interest including pseudogenes, non-coding RNAs and short open reading frames (sORFs).In recent years, post-genomic bioanalytical techniques such as RNA-seq transcriptomics (which tells us which genomic elements are expressed) and mass spectrometry based proteomics (which tells us which of the expressed elements are translated into peptides or proteins) have helped refine our understanding of the human genome at a fundamental level. Just this year, two proteomics studies published in Nature caused a stir by showing that no experimental evidence could be found for the expression of several genomic elements widely accepted to code for protein, while other regions of the genome that were not previously thought to be protein coding were in fact found to produce proteins. If this is the situation for the intensively studied human genome, we must assume that the genome annotations for less studied species (so called non-model organisms) are even less accurate.We recently developed (and tested, and published) a methodology called proteomics informed by transcriptomics (PIT) that rapidly generates large numbers of genome annotations underpinned by multiple sources of experimental evidence. In PIT, every sample is analysed using both RNA-seq and proteomic mass spectrometry and the data from these two analyses integrated to provide a list of observed proteins and any other translated genomic elements (TGEs), together with the detailed transcriptomic and spectral evidence that underpins these observations. The beauty of PIT compared with traditional proteomics is that no prior sequence knowledge is needed, so novel TGEs (be they proteins or other more exotic features) can be detected. RNA-seq can be used by itself to rapidly generate genome annotations without prior knowledge, but without PIT's mass spectrometry step the confidence in these annotations is limited and there is no guarantee that transcribed elements actually get translated.In a recent BBSRC TRDF project we developed easy to use web-based software workflows, implement in the popular Galaxy platform, to process the data from PIT experiments in a repeatable way with uniformly formatted output files. This has proven very useful for answering individual biological questions, but there is currently no meaningful way to share the results of PIT experiments. In this project we propose to plug this gap by developing PIT-DB, a web-accessible database of results produced by PIT. This publicly available database will immediately be populated with data from experiments conducted on various species at the University of Bristol, but other groups will be actively encouraged to submission their own data.Having data from multiple PIT experiments in one database will deliver exciting new scientific insights. As well as simply allowing researchers to share their results from individual PIT experiments, PIT-DB will pool information about individual novel TGEs from multiple experiments so evidence can be accumulated for each individual TGE. Improving the quality of results by using data from replicate experiments is a fundamental concept in science and the utility of doing this on a community-wide basis has been repeatedly demonstrated by other bioinformatics databases such as Ensembl, UniProt and PRIDE. As well as being of interest individually, the well evidenced TGEs in PIT-DB will provide large numbers of experimentally derived (as opposed to computationally predicted) genome annotations for all of the species for which data is present in the database.
2001年人类基因组的发表被理所当然地誉为一项重大的科学成就,但十多年过去了,我们对基因组的结构和其中各种元素的作用仍远未完全了解。虽然基因组的蛋白质编码区在测序后不久就被识别并用于注释基因组,但许多更多的外来基因组元件随后引起了人们的兴趣,包括假基因、非编码rna和短开放阅读框(sorf)。近年来,后基因组生物分析技术,如RNA-seq转录组学(它告诉我们哪些基因组元件被表达)和基于质谱的蛋白质组学(它告诉我们哪些表达的元件被翻译成肽或蛋白质),已经帮助我们在基础水平上完善了我们对人类基因组的理解。就在今年,发表在《自然》(Nature)杂志上的两项蛋白质组学研究引起了轰动,因为它们表明,没有找到实验证据证明被广泛接受为蛋白质编码的几个基因组元素的表达,而基因组中以前不被认为是蛋白质编码的其他区域实际上被发现可以产生蛋白质。如果这是被深入研究的人类基因组的情况,我们必须假设对研究较少的物种(所谓的非模式生物)的基因组注释更不准确。我们最近开发(并测试并发表)了一种称为转录组学(PIT)的蛋白质组学方法,该方法可以快速生成大量的基因组注释,并得到多种实验证据来源的支持。在PIT中,每个样本都使用RNA-seq和蛋白质组质谱分析,并将这两种分析的数据整合在一起,提供观察到的蛋白质和任何其他翻译基因组元件(TGEs)的列表,以及支持这些观察结果的详细转录组学和光谱证据。与传统的蛋白质组学相比,PIT的优点在于不需要事先了解序列,因此可以检测到新的tge(无论是蛋白质还是其他更奇特的特征)。RNA-seq本身可以在没有先验知识的情况下快速生成基因组注释,但没有PIT的质谱步骤,对这些注释的信心是有限的,并且不能保证转录的元件实际上被翻译。在最近的BBSRC TRDF项目中,我们开发了易于使用的基于web的软件工作流,在流行的Galaxy平台上实现,以可重复的方式处理PIT实验数据,并使用统一格式的输出文件。事实证明,这对于回答个别生物学问题非常有用,但目前还没有有意义的方法来分享PIT实验的结果。在这个项目中,我们建议通过开发PIT- db来填补这一空白,这是一个可以通过网络访问的PIT结果数据库。这个公开可用的数据库将立即被布里斯托尔大学对不同物种进行的实验数据填充,但其他团体将积极鼓励提交他们自己的数据。在一个数据库中拥有来自多个PIT实验的数据将提供令人兴奋的新的科学见解。除了简单地允许研究人员分享他们从单个PIT实验中获得的结果外,PIT- db还将汇集来自多个实验的单个新颖TGE的信息,以便为每个TGE积累证据。通过使用来自重复实验的数据来提高结果的质量是科学中的一个基本概念,其他生物信息学数据库(如Ensembl、UniProt和PRIDE)反复证明了在社区范围内这样做的效用。除了对个体感兴趣外,PIT-DB中充分证明的TGEs将为数据库中存在数据的所有物种提供大量实验衍生(而不是计算预测)的基因组注释。
项目成果
期刊论文数量(4)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Novel application of heuristic optimisation enables the creation and thorough evaluation of robust support vector machine ensembles for machine learning applications.
- DOI:10.1007/s11306-015-0894-4
- 发表时间:2016
- 期刊:
- 影响因子:0
- 作者:Chatzimichali EA;Bessant C
- 通讯作者:Bessant C
Proteomics technique opens new frontiers in mobilome research.
- DOI:10.1080/2159256x.2017.1362494
- 发表时间:2017
- 期刊:
- 影响因子:0
- 作者:Davidson AD;Matthews DA;Maringer K
- 通讯作者:Maringer K
Proteomics informed by transcriptomics for characterising active transposable elements and genome annotation in Aedes aegypti.
蛋白质组学通过转录组学告知,以表征伊蚊中的主动转座元件和基因组注释。
- DOI:10.1186/s12864-016-3432-5
- 发表时间:2017-01-19
- 期刊:
- 影响因子:4.4
- 作者:Maringer K;Yousuf A;Heesom KJ;Fan J;Lee D;Fernandez-Sesma A;Bessant C;Matthews DA;Davidson AD
- 通讯作者:Davidson AD
PITDB: a database of translated genomic elements.
- DOI:10.1093/nar/gkx906
- 发表时间:2018-01-04
- 期刊:
- 影响因子:14.9
- 作者:Saha S;Chatzimichali EA;Matthews DA;Bessant C
- 通讯作者:Bessant C
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Conrad Bessant其他文献
Deriving Meaningful Aspects of Health Related to Physical Activity in Chronic Disease: Concept Elicitation Using Machine Learning–Assisted Coding of Online Patient Conversations
- DOI:
10.1016/j.jval.2023.01.022 - 发表时间:
2023-07-01 - 期刊:
- 影响因子:
- 作者:
Bill Byrom;Conrad Bessant;Fabrizio Smeraldi;Maryam Abdollahyan;Yasemin Bridges;Marzana Chowdhury;Asiyya Tahsin - 通讯作者:
Asiyya Tahsin
Conrad Bessant的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Conrad Bessant', 18)}}的其他基金
Proteomics Goes Viral: Novel Resources for Identification and Quantification of Virus Proteins
蛋白质组学病毒式传播:用于病毒蛋白鉴定和定量的新资源
- 批准号:
BB/L018438/1 - 财政年份:2014
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
An Integrated Open Source Software Resource for Quantitative Proteomics
用于定量蛋白质组学的集成开源软件资源
- 批准号:
BB/I001131/2 - 财政年份:2013
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
Galaxy Workflows for Proteomics Informed by Transcriptomics (PIT)
Galaxy 转录组学蛋白质组学工作流程 (PIT)
- 批准号:
BB/K016075/1 - 财政年份:2013
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
An Integrated Open Source Software Resource for Quantitative Proteomics
用于定量蛋白质组学的集成开源软件资源
- 批准号:
BB/I001131/1 - 财政年份:2010
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
X-tracker: a generic quantitation tool for MS-based proteomics:
X-tracker:基于 MS 的蛋白质组学通用定量工具:
- 批准号:
BB/F016107/1 - 财政年份:2008
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
Further Development of the Genome Annotating Proteomic Pipeline
基因组注释蛋白质组管道的进一步发展
- 批准号:
BB/E01237X/1 - 财政年份:2007
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
Bioinformatics for High Throughput Proteomics (Short Course)
高通量蛋白质组学生物信息学(短期课程)
- 批准号:
BB/D007216/1 - 财政年份:2006
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
相似国自然基金
有氧运动介导Wnt/β-catenin 信号通路抑制神经炎症改善db/db小鼠认知功能障碍的作用与机制
- 批准号:2025JJ70587
- 批准年份:2025
- 资助金额:0.0 万元
- 项目类别:省市级项目
黄精多糖调控 AR 和 STAT-Snail 通路重塑前
列腺上皮细胞表型改善 db/db 小鼠糖尿病性
BPH
- 批准号:TGY24H050012
- 批准年份:2024
- 资助金额:0.0 万元
- 项目类别:省市级项目
DB白矮星形成的新通道及其吸积污染的研究
- 批准号:12163005
- 批准年份:2021
- 资助金额:37 万元
- 项目类别:地区科学基金项目
ROS/TRPM2/Calpain介导自噬流受阻在布比卡因加重db/db小鼠神经损伤中的作用机制
- 批准号:81974187
- 批准年份:2019
- 资助金额:55.0 万元
- 项目类别:面上项目
过氧化还原酶4抑制剂Db2337通过ROS/ERs通路诱导肺癌细胞焦亡的作用和机制研究
- 批准号:81903074
- 批准年份:2019
- 资助金额:20.5 万元
- 项目类别:青年科学基金项目
运动联合二甲双胍“错峰”干预对db/db小鼠肝脏糖脂代谢的分子机制研究
- 批准号:
- 批准年份:2019
- 资助金额:0.0 万元
- 项目类别:省市级项目
0-dB功率回退超高速QAM发射机硅基芯片集成技术研究
- 批准号:
- 批准年份:2019
- 资助金额:10.0 万元
- 项目类别:省市级项目
基于补气固摄理论探讨芪参益气滴丸对db/db小鼠蛋白尿的治疗作用
- 批准号:81803858
- 批准年份:2018
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
MiR-320通过抑制PI3K-AKt-CREB信号通路调控糖尿病db/db小鼠认知功能障碍的研究
- 批准号:2018JJ2347
- 批准年份:2018
- 资助金额:0.0 万元
- 项目类别:省市级项目
微流体芯片应用于Db轻同族元素Nb,Ta的萃取方法研究
- 批准号:11705243
- 批准年份:2017
- 资助金额:23.0 万元
- 项目类别:青年科学基金项目
相似海外基金
4K高速カメラによる中小スパン橋梁の全視野センシング及びインフラ維持管理DBの構築
利用4K高速摄像机对中小跨桥梁进行全场感知并构建基础设施维护管理数据库
- 批准号:
23K26177 - 财政年份:2024
- 资助金额:
$ 15.64万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
高齢者の歯科受診の状況と影響:多国間比較研究と介護DBを用いた施設介入効果の検証
老年人看牙医的现状和影响:利用跨国比较研究和护理数据库验证设施干预效果
- 批准号:
24KJ0065 - 财政年份:2024
- 资助金额:
$ 15.64万 - 项目类别:
Grant-in-Aid for JSPS Fellows
Targeting non-small cell lung cancer with personalised doggybone DNA vaccines (db-PCV)
使用个性化狗骨 DNA 疫苗 (db-PCV) 靶向非小细胞肺癌
- 批准号:
MR/X030342/1 - 财政年份:2023
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
Analysing the effectiveness of the DB Gold cylinder passivation treatment using ultra-low amount fraction gas stability studies
使用超低含量分数气体稳定性研究分析 DB Gold 气缸钝化处理的有效性
- 批准号:
10039291 - 财政年份:2022
- 资助金额:
$ 15.64万 - 项目类别:
Collaborative R&D
Design, build and test, scaled prototype of an electro-mechanical Superplastic Forming/Diffusion Bonding (SPF/DB) Press
设计、构建和测试机电超塑性成型/扩散粘合 (SPF/DB) 压力机的缩放原型
- 批准号:
10035570 - 财政年份:2022
- 资助金额:
$ 15.64万 - 项目类别:
BEIS-Funded Programmes
NDBと介護DBに対応したAI-ベイジアンネット型の持続的リスク評価システム構築
构建兼容NDB和护理DB的基于AI-贝叶斯网络的持续风险评估系统
- 批准号:
21K12708 - 财政年份:2021
- 资助金额:
$ 15.64万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Effects of acute exercise on insulin secretion capacity in db/db mice
急性运动对db/db小鼠胰岛素分泌能力的影响
- 批准号:
20K19636 - 财政年份:2020
- 资助金额:
$ 15.64万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
ゼニゴケゲノムDBの拡張開発と仮想化技術を応用した高可搬性ゲノムDBの開発
利用虚拟化技术扩展地草基因组数据库并开发高度可移植的基因组数据库
- 批准号:
20K15783 - 财政年份:2020
- 资助金额:
$ 15.64万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
PhosphoX-db: A web-based bioinformatics platform for studying non-canonical phosphorylation
PhosphoX-db:用于研究非规范磷酸化的基于网络的生物信息学平台
- 批准号:
BB/R02216X/1 - 财政年份:2018
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant
A Chemical Imaging Platform for Discovery Biosciences (CIP-DB).
Discovery Biosciences 的化学成像平台 (CIP-DB)。
- 批准号:
BB/R013667/1 - 财政年份:2018
- 资助金额:
$ 15.64万 - 项目类别:
Research Grant