Development of integrated web interfaces for Bioconductor genomic data analysis annotation and visualization tools

开发用于 Bioconductor 基因组数据分析注释和可视化工具的集成 Web 界面

基本信息

  • 批准号:
    BB/E001653/1
  • 负责人:
  • 金额:
    $ 11.55万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2006
  • 资助国家:
    英国
  • 起止时间:
    2006 至 无数据
  • 项目状态:
    已结题

项目摘要

Genomic data, particularly microarray expression profiling studies, comes in the shape of huge matrices of numbers, anywhere from 10,000 to 6,000,000 rows by hundreds to thousands of columns. These data need to be transformed, standardized, visualized, and annotated. The rows of these matrices report activity (expression levels) of genes under various conditions. The huge data volume, as well as the complexity involved with describing such experimental data, resulted in the creation of a few major public repositories for array-based high throughput genomics data: GEO (NCBI, USA) and ArrayExpress (EBI, Cambridge, UK). Our group at the EBI also has developed Expression Profiler (EP), a web-based platform for exploratory data analysis, which can provide some basic insights into the public data in ArrayExpress. The major thrust of the scientific community's work in creating tools for dealing with such large-scale data has concentrated within the set of open source command-line driven tools collectively called Bioconductor. These tools, or 'packages', are developed by leaders in specialized areas of application: normalization (mathematical methods of making data coming from different laboratories comparable), signalling pathway analysis, clustering analysis, meta-analysis, etc., and are therefore the de facto standard for cutting-edge functional genomics analysis technologies. At the same time, by and large the only users of Bioconductor remain the sophisticated bioinformaticians, while wet-lab biologists (experimentalists who produce the actual data) find the learning curve of the R environment too steep to learn, the R language too complex to master, and the command-line flexibility details too daunting. Moreover, even within Bioconductor, different packages offer different, often incompatible, paradigms of dealing with the data input, output and interchange. There is a definite, clear need to provide easy access to the power of Bioconductor for biologists involved in functional genomics and proteomics experimental research. This project proposes to utilise the EP analytical framework to develop a set of standard, unified look-and-feel web-based interfaces to core Bioconductor modules, which will also make use of the ArrayExpress database. The proposed system will enable biologists to upload securely their experimental data, analyse them with the best available Bioconductor algorithms and to compare or analyse them together with related public high-throughput data in the repository. The data analysis routines will take advantage of the high-power computing infrastructure available at the EBI, and the results will be stored within the system, accessible form anywhere in the world via a web-browser. A further unique advantage is provided by the integration of Bioconductor packages within a set of web interfaces: the interfaces can also be accessed as Web Services, i.e. can be incorporated in automatic data analysis workflows. In other words, even sophisticated bioinformaticians are likely to find this system useful (see attached letters of support).
基因组数据,特别是微阵列表达谱研究,以巨大的数字矩阵的形式出现,从 10,000 到 6,000,000 行乘以数百到数千列。这些数据需要进行转换、标准化、可视化和注释。这些矩阵的行报告了各种条件下基因的活性(表达水平)。庞大的数据量以及描述此类实验数据的复杂性,导致创建了一些用于基于阵列的高通量基因组数据的主要公共存储库:GEO(NCBI,美国)和 ArrayExpress(EBI,英国剑桥)。我们 EBI 的团队还开发了 Expression Profiler (EP),这是一个基于 Web 的探索性数据分析平台,可以提供对 ArrayExpress 中公共数据的一些基本见解。科学界在创建处理此类大规模数据的工具方面的主要工作集中在一组统称为 Bioconductor 的开源命令行驱动工具中。这些工具或“包”由专业应用领域的领导者开发:标准化(使来自不同实验室的数据具有可比性的数学方法)、信号通路分析、聚类分析、荟萃分析等,因此是尖端功能基因组学分析技术的事实上的标准。与此同时,总的来说,Bioconductor 的唯一用户仍然是经验丰富的生物信息学家,而湿实验室生物学家(产生实际数据的实验家)发现 R 环境的学习曲线太陡峭而难以学习,R 语言太复杂而难以掌握,命令行灵活性细节太令人畏惧。此外,即使在 Bioconductor 中,不同的包也提供不同的、通常不兼容的处理数据输入、输出和交换的范例。明确、明确地需要为参与功能基因组学和蛋白质组学实验研究的生物学家提供轻松使用 Bioconductor 的功能。该项目建议利用 EP 分析框架为核心 Bioconductor 模块开发一套标准的、统一的外观和感觉的基于 Web 的界面,该模块也将利用 ArrayExpress 数据库。拟议的系统将使生物学家能够安全地上传他们的实验数据,使用现有的最佳 Bioconductor 算法对其进行分析,并将它们与存储库中的相关公共高通量数据进行比较或分析。数据分析例程将利用 EBI 提供的高功率计算基础设施,结果将存储在系统内,可通过网络浏览器在世界任何地方访问。 Bioconductor 软件包在一组 Web 界面中的集成提供了另一个独特的优势:这些界面也可以作为 Web 服务进行访问,即可以合并到自动数据分析工作流程中。换句话说,即使是经验丰富的生物信息学家也可能会发现该系统有用(请参阅随附的支持信)。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Gene Expression Atlas update--a value-added database of microarray and sequencing-based functional genomics experiments.
  • DOI:
    10.1093/nar/gkr913
  • 发表时间:
    2012-01
  • 期刊:
  • 影响因子:
    14.9
  • 作者:
    Kapushesky M;Adamusiak T;Burdett T;Culhane A;Farne A;Filippov A;Holloway E;Klebanov A;Kryvych N;Kurbatova N;Kurnosov P;Malone J;Melnichuk O;Petryszak R;Pultsin N;Rustici G;Tikhonov A;Travillian RS;Williams E;Zorin A;Parkinson H;Brazma A
  • 通讯作者:
    Brazma A
A pipeline for RNA-seq data processing and quality assessment.
  • DOI:
    10.1093/bioinformatics/btr012
  • 发表时间:
    2011-03-15
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Goncalves A;Tikhonov A;Brazma A;Kapushesky M
  • 通讯作者:
    Kapushesky M
ArrayExpress update--trends in database growth and links to data analysis tools.
  • DOI:
    10.1093/nar/gks1174
  • 发表时间:
    2013-01
  • 期刊:
  • 影响因子:
    14.9
  • 作者:
    Rustici G;Kolesnikov N;Brandizi M;Burdett T;Dylag M;Emam I;Farne A;Hastings E;Ison J;Keays M;Kurbatova N;Malone J;Mani R;Mupo A;Pedro Pereira R;Pilicheva E;Rung J;Sharma A;Tang YA;Ternent T;Tikhonov A;Welter D;Williams E;Brazma A;Parkinson H;Sarkans U
  • 通讯作者:
    Sarkans U
REMBI: Recommended Metadata for Biological Images-enabling reuse of microscopy data in biology.
  • DOI:
    10.1038/s41592-021-01166-8
  • 发表时间:
    2021-12
  • 期刊:
  • 影响因子:
    48
  • 作者:
    Sarkans U;Chiu W;Collinson L;Darrow MC;Ellenberg J;Grunwald D;Hériché JK;Iudin A;Martins GG;Meehan T;Narayan K;Patwardhan A;Russell MRG;Saibil HR;Strambio-De-Castillia C;Swedlow JR;Tischer C;Uhlmann V;Verkade P;Barlow M;Bayraktar O;Birney E;Catavitello C;Cawthorne C;Wagner-Conrad S;Duke E;Paul-Gilloteaux P;Gustin E;Harkiolaki M;Kankaanpää P;Lemberger T;McEntyre J;Moore J;Nicholls AW;Onami S;Parkinson H;Parsons M;Romanchikova M;Sofroniew N;Swoger J;Utz N;Voortman LM;Wong F;Zhang P;Kleywegt GJ;Brazma A
  • 通讯作者:
    Brazma A
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Alvis Brazma其他文献

Reuse of public genome-wide gene expression data
公共全基因组基因表达数据的再利用
  • DOI:
    10.1038/nrg3394
  • 发表时间:
    2012-12-27
  • 期刊:
  • 影响因子:
    52.000
  • 作者:
    Johan Rung;Alvis Brazma
  • 通讯作者:
    Alvis Brazma
Transparency and reproducibility in artificial intelligence
人工智能中的透明度和可重复性
  • DOI:
    10.1038/s41586-020-2766-y
  • 发表时间:
    2020-10-14
  • 期刊:
  • 影响因子:
    48.500
  • 作者:
    Benjamin Haibe-Kains;George Alexandru Adam;Ahmed Hosny;Farnoosh Khodakarami;Levi Waldron;Bo Wang;Chris McIntosh;Anna Goldenberg;Anshul Kundaje;Casey S. Greene;Tamara Broderick;Michael M. Hoffman;Jeffrey T. Leek;Keegan Korthauer;Wolfgang Huber;Alvis Brazma;Joelle Pineau;Robert Tibshirani;Trevor Hastie;John P. A. Ioannidis;John Quackenbush;Hugo J. W. L. Aerts
  • 通讯作者:
    Hugo J. W. L. Aerts
Alleviating batch effects in cell type deconvolution with SCCAF-D
使用 SCCAF-D 缓解细胞类型解卷积中的批处理效应
  • DOI:
    10.1038/s41467-024-55213-x
  • 发表时间:
    2024-12-30
  • 期刊:
  • 影响因子:
    15.700
  • 作者:
    Shuo Feng;Liangfeng Huang;Anna Vathrakokoili Pournara;Ziliang Huang;Xinlu Yang;Yongjian Zhang;Alvis Brazma;Ming Shi;Irene Papatheodorou;Zhichao Miao
  • 通讯作者:
    Zhichao Miao
Standards for systems biology
系统生物学的标准
  • DOI:
    10.1038/nrg1922
  • 发表时间:
    2006-08-01
  • 期刊:
  • 影响因子:
    52.000
  • 作者:
    Alvis Brazma;Maria Krestyaninova;Ugis Sarkans
  • 通讯作者:
    Ugis Sarkans
Visualization of large microarray experiments with space maps
  • DOI:
    10.1186/1471-2105-10-s13-o7
  • 发表时间:
    2009-10-19
  • 期刊:
  • 影响因子:
    3.300
  • 作者:
    Nils Gehlenborg;Alvis Brazma
  • 通讯作者:
    Alvis Brazma

Alvis Brazma的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Alvis Brazma', 18)}}的其他基金

BioStudies and the Image Data Resource: Expanding Imaging Datasets, Linkage, Metadata, and Value
生物研究和图像数据资源:扩展成像数据集、链接、元数据和价值
  • 批准号:
    BB/R015082/1
  • 财政年份:
    2018
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Research Grant
VBO - A Tool for Bridging Vertebrate Anatomy Ontologies
VBO - 脊椎动物解剖学本体的桥接工具
  • 批准号:
    BB/G022755/1
  • 财政年份:
    2010
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Research Grant
MICheckout: Supporting compliance with consensus reporting requirements
MICheckout:支持遵守共识报告要求
  • 批准号:
    BB/G000638/1
  • 财政年份:
    2009
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Research Grant

相似国自然基金

greenwashing behavior in China:Basedon an integrated view of reconfiguration of environmental authority and decoupling logic
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    万元
  • 项目类别:
    外国学者研究基金项目
焦虑症小鼠模型整合模式(Integrated) 行为和精细行为评价体系的构建
  • 批准号:
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
HER2特异性双抗原表位识别诊疗一体化探针研制与临床前诊疗效能研究
  • 批准号:
    82372014
  • 批准年份:
    2023
  • 资助金额:
    48.00 万元
  • 项目类别:
    面上项目
基于贝叶斯网络可靠度演进模型的城市雨水管网整体优化设计理论研究
  • 批准号:
    51008191
  • 批准年份:
    2010
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

DEVELOPMENT OF A FULLY-INTEGRATED CHATBOT TO IMPROVE MEMBER ENGAGEMENT IN ONLINE ADDICTION TREATMENT PROGRAMS
开发完全集成的聊天机器人以提高会员对在线成瘾治疗计划的参与度
  • 批准号:
    10261142
  • 财政年份:
    2021
  • 资助金额:
    $ 11.55万
  • 项目类别:
Integrated Global Health on Child Health and Development
综合全球卫生对儿童健康和发展的影响
  • 批准号:
    10416018
  • 财政年份:
    2020
  • 资助金额:
    $ 11.55万
  • 项目类别:
Integrated Global Health on Child Health and Development
综合全球卫生对儿童健康和发展的影响
  • 批准号:
    10627816
  • 财政年份:
    2020
  • 资助金额:
    $ 11.55万
  • 项目类别:
Development and evaluation of integrated Web support tool that promotes positive coping and <lifeonstruction of HIV-positive people
开发和评估综合网络支持工具,促进艾滋病毒阳性者的积极应对和生命建设
  • 批准号:
    17H02168
  • 财政年份:
    2017
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Development and Usability Testing of HEARTPAIN: An Integrated Smartphone and Web-Based Intervention for Women with Cardiac Pain
HEARTPAIN 的开发和可用性测试:针对女性心痛患者的综合智能手机和基于网络的干预措施
  • 批准号:
    369382
  • 财政年份:
    2017
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Operating Grants
Swift.ai: research and development of an integrated platform for machine-assisted research synthesis
Swift.ai:机器辅助研究合成综合平台的研发
  • 批准号:
    10428382
  • 财政年份:
    2017
  • 资助金额:
    $ 11.55万
  • 项目类别:
Swift.ai: research and development of an integrated platform for machine-assisted research synthesis
Swift.ai:机器辅助研究合成综合平台的研发
  • 批准号:
    10259172
  • 财政年份:
    2017
  • 资助金额:
    $ 11.55万
  • 项目类别:
Disseminating Child and Youth Mental Health Practice Guidelines: The Development of a User-Informed, Social Media Integrated, Mobile Website
传播儿童和青少年心理健康实践指南:开发用户知情、社交媒体集成的移动网站
  • 批准号:
    343428
  • 财政年份:
    2016
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Miscellaneous Programs
Development of an integrated pipeline of ortholog identification for comparative genome analyses of vertebrates
开发用于脊椎动物比较基因组分析的直系同源物鉴定集成流程
  • 批准号:
    15K07172
  • 财政年份:
    2015
  • 资助金额:
    $ 11.55万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了