'Omics Data Sharing: the Investigation / Study / Assay (ISA) Infrastructure
组学数据共享:调查/研究/分析 (ISA) 基础设施
基本信息
- 批准号:BB/I000771/1
- 负责人:
- 金额:$ 104.45万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2010
- 资助国家:英国
- 起止时间:2010 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
There is a pressing and recognized need in the biological domain for improved data sharing and unified access to data from a wide range of sources. The use of 'omics technologies (such as genomics, metagenomics, transcriptomics, proteomics and metabolomics) is now wide-spread and the rate at which these technologies generate data is revolutionizing the scientific landscape. This massive influx of data brings both unprecedented scientific opportunities and a range of challenges that must be met if these data, and the public investment in science that they represent, are to be fully exploited. While there are many obstacles to overcome if we are to realize large-scale multi-omic data sharing at the community level, solutions are now possible due to the activities of a range of grass-roots standardisation projects including the 'Minimum Information for Biological and Biomedical Investigations' (MIBBI) project (http://mibbi.org/) and the Open Biological Ontologies (OBO) Foundry (http://obofoundry.org/). We propose to make more widely available our 'omics data sharing software based on the 'Investigation / Study / Assay' (ISA) concept (http://isatab.sf.net). The ISA concept allows the description of any 'Investigation' comprising one or more 'Studies' in which biological samples have been studied using one or more 'Assays' (technologies). The ISA concept is supported by the MIBBI community and has been used to structure a universal file format, ISA-Tab. The ISA-Tab file format leverages biologists' familiarity with, and trust of spreadsheet-based input and manipulation of information. Descriptive experimental information (metadata) captured in ISA-Tab format is made compliant with MIBBI-registered standards (for transcriptomics, MIAME; for proteomics, MIAPE; and for genomics, MIGS/MIMS) using pre-defined extensions. ISA-Tab can be configured to hold additional fields allowing users to comply with emerging standards as well. The availability of this universal file format has enabled the creation of a set of tools and a database to hold data sets captured in it. The current pilot-stage ISA Infrastructure provides a complete solution for managing multi-omic metadata at the community level. A core aspect of the design of the ISA Infrastructure is its integral use of OBO Foundry ontologies to describe investigations, rendering data descriptions unambiguous and computationally accessible. In the course of this proposed project, we will extend the current ISA Infrastructure implementation and work with identified research communities and their bioinformatic service providers to set up 'ISA Networks' in the UK and around the globe, covering a wide range of data types. These portals will serve as 'one-stop shops' for the aggregation and display of relevant datasets at the community level. The metadata captured will support searching and data discovery across organisms, technologies and data types. The shared use of minimum information standards, ontologies and a single file format will support exchange of data between communities and the transfer of data to and from public repositories. At the international level, we will work closely with the MIBBI and OBO Foundry communities to further unify MIBBI checklists and OBO Foundry ontologies to support descriptions of multi-omic investigations. The development of the ISA Infrastructure must be consensus-driven and is therefore best developed under the auspices of an international working group. We will therefore formalise the collaboration between ISA Networks and work within the data standardisation community to increase linkages between currently separated groups by launching the BioSharing Consortium (http://biosharing.org).
在生物学领域,迫切需要改进数据共享和统一获取来自广泛来源的数据。组学技术(如基因组学、宏基因组学、转录组学、蛋白质组学和代谢组学)的使用现在已经广泛传播,这些技术生成数据的速度正在彻底改变科学格局。大量数据的涌入既带来了前所未有的科学机遇,也带来了一系列挑战,如果要充分利用这些数据及其所代表的公共科学投资,就必须应对这些挑战。如果我们要在社区一级实现大规模的多组学数据共享,还有许多障碍需要克服,但由于一系列基层标准化项目的活动,包括“生物和生物医学调查最低信息”(MIBBI)项目(http://mibbi.org/)和开放生物本体论(OBO)铸造(http://obofoundry.org/),解决方案现在是可能的。我们建议更广泛地提供基于“调查/研究/分析”(伊萨)概念的“组学数据共享软件”(http:isatab.sf.net)。伊萨概念允许描述任何“调查”,包括一项或多项“研究”,其中使用一项或多项“测定”(技术)对生物样本进行研究。伊萨的概念得到了MIBBI社区的支持,并被用于构建一种通用的文件格式ISA-Tab。ISA-Tab文件格式利用了生物学家对基于电子表格的信息输入和操作的熟悉和信任。使用预定义的扩展,以ISA-Tab格式捕获的描述性实验信息(元数据)符合MIBBI注册标准(转录组学,MIAME;蛋白质组学,MIAPE;基因组学,MIGS/MIMS)。ISA-Tab可以配置为包含额外的字段,允许用户遵守新兴的标准。这种通用文件格式的可用性使得能够创建一套工具和一个数据库来保存在其中捕获的数据集。目前的试点阶段伊萨基础设施提供了一个完整的解决方案,用于在社区层面管理多组元数据。伊萨基础设施设计的一个核心方面是其整体使用OBO Foundry本体来描述调查,使数据描述明确且可计算访问。在这个拟议项目的过程中,我们将扩大目前的伊萨基础设施的实施,并与确定的研究团体及其生物信息学服务提供商合作,在英国和地球仪周围建立“伊萨网络”,覆盖广泛的数据类型。这些门户网站将作为“一站式商店”,在社区一级汇集和展示相关数据集。捕获的元数据将支持跨生物体、技术和数据类型的搜索和数据发现。共享使用最低信息标准、本体和单一文件格式将支持社区之间的数据交换以及公共储存库之间的数据转移。在国际层面,我们将与MIBBI和OBO Foundry社区密切合作,进一步统一MIBBI清单和OBO Foundry本体,以支持多组学研究的描述。伊萨基础设施的发展必须以共识为驱动力,因此最好在国际工作组的主持下发展。因此,我们将正式确定伊萨网络之间的合作,并在数据标准化社区内开展工作,通过启动BioSharing Consortium(http://www.example.com)来增加目前分离的团体之间的联系。biosharing.org
项目成果
期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
COPO - bridging the gap from data to publication in plant science
COPO - 弥合植物科学从数据到出版的差距
- DOI:10.7490/f1000research.1111380.1
- 发表时间:2016
- 期刊:
- 影响因子:0
- 作者:Anthony Etuk
- 通讯作者:Anthony Etuk
Consent insufficient for data release-Response.
同意不足以发布数据-响应。
- DOI:10.1126/science.aax7509
- 发表时间:2019
- 期刊:
- 影响因子:0
- 作者:Amann RI
- 通讯作者:Amann RI
Modeling biomedical experimental processes with OBI.
- DOI:10.1186/2041-1480-1-s1-s7
- 发表时间:2010-06-22
- 期刊:
- 影响因子:1.9
- 作者:Brinkman RR;Courtot M;Derom D;Fostel JM;He Y;Lord P;Malone J;Parkinson H;Peters B;Rocca-Serra P;Ruttenberg A;Sansone SA;Soldatova LN;Stoeckert CJ Jr;Turner JA;Zheng J;OBI consortium
- 通讯作者:OBI consortium
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Susanna Sansone其他文献
The 15th Genomic Standards Consortium meeting
- DOI:
10.4056/sigs.3457 - 发表时间:
2013-01-01 - 期刊:
- 影响因子:5.400
- 作者:
Lynn Schriml;Ilene Mizrachi;Peter Sterk;Dawn Field;Lynette Hirschman;Tatiana Tatusova;Susanna Sansone;Jack Gilbert;David Schindel;Neil Davies;Chris Meyer;Folker Meyer;George Garrity;Lita Proctor;M. H. Medema;Yemin Lan;Anna Klindworth;Frank Oliver Glöckner;Tonia Korves;Antonia Gonzalez;Peter Dwayndt;Markus Göker;Anjette Johnston;Evangelos Pafilis;Susanne Schneider;K. Baker;Cynthia Parr;G. Sutton;H. H. Creasy;Nikos Kyrpides;K. Eric Wommack;Patricia L. Whetzel;Daniel Nasko;Hilmar Lapp;Takamoto Fujisawa;Adam M. Phillippy;Renzo Kottman;Judith A. Blake;Junhua Li;Elizabeth M. Glass;Petra ten Hoopen;Rob Knight;Susan Holmes;Curtis Huttenhower;Steven L. Salzberg;Bing Ma;Owen White - 通讯作者:
Owen White
Meeting Report: Metagenomics, Metadata and MetaAnalysis (M3) at ISMB 2010
会议报告:ISMB 2010 上的宏基因组学、元数据和元分析 (M3)
- DOI:
- 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
Dawn Field;Susanna Sansone;Edward F. DeLong;P. Sterk;Iddo Friedberg;R. Kottmann;L. Hirschman;George Garrity;Guy Cochrane;J. Wooley;F. Meyer;Sarah Hunter;Owen White - 通讯作者:
Owen White
Susanna Sansone的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Susanna Sansone', 18)}}的其他基金
BioSharing and the National Bioscience Database Center - joining forces to better serve the research community worldwide
BioSharing 和国家生物科学数据库中心 - 联手更好地服务全球研究界
- 批准号:
BB/P025943/1 - 财政年份:2017
- 资助金额:
$ 104.45万 - 项目类别:
Research Grant
COpenPlantOmics (COPO): a Collaborative Bioinformatics Plant Science Platform
COpenPlantOmics (COPO):协作生物信息学植物科学平台
- 批准号:
BB/L024101/1 - 财政年份:2015
- 资助金额:
$ 104.45万 - 项目类别:
Research Grant
Establishing common standards and curation practices: towards real world biosharing.
建立共同标准和管理实践:实现现实世界的生物共享。
- 批准号:
BB/J020265/1 - 财政年份:2012
- 资助金额:
$ 104.45万 - 项目类别:
Research Grant
Building a global metagenomics portal ('MGportal') to handle next-generation sequencing data and associated metadata
建立全球宏基因组学门户(“MGportal”)来处理下一代测序数据和相关元数据
- 批准号:
BB/I025840/1 - 财政年份:2011
- 资助金额:
$ 104.45万 - 项目类别:
Research Grant
Omics Data Standards: synergy and implementations
组学数据标准:协同作用和实施
- 批准号:
BB/E025080/1 - 财政年份:2007
- 资助金额:
$ 104.45万 - 项目类别:
Research Grant
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
- 批准号:
- 批准年份:2020
- 资助金额:40 万元
- 项目类别:
基于Linked Open Data的Web服务语义互操作关键技术
- 批准号:61373035
- 批准年份:2013
- 资助金额:77.0 万元
- 项目类别:面上项目
Molecular Interaction Reconstruction of Rheumatoid Arthritis Therapies Using Clinical Data
- 批准号:31070748
- 批准年份:2010
- 资助金额:34.0 万元
- 项目类别:面上项目
高维数据的函数型数据(functional data)分析方法
- 批准号:11001084
- 批准年份:2010
- 资助金额:16.0 万元
- 项目类别:青年科学基金项目
染色体复制负调控因子datA在细胞周期中的作用
- 批准号:31060015
- 批准年份:2010
- 资助金额:25.0 万元
- 项目类别:地区科学基金项目
Computational Methods for Analyzing Toponome Data
- 批准号:60601030
- 批准年份:2006
- 资助金额:17.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: Frameworks: MobilityNet: A Trustworthy CI Emulation Tool for Cross-Domain Mobility Data Generation and Sharing towards Multidisciplinary Innovations
协作研究:框架:MobilityNet:用于跨域移动数据生成和共享以实现多学科创新的值得信赖的 CI 仿真工具
- 批准号:
2411152 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant
CAREER: Theory and Practice of Privacy-Utility Tradeoffs in Enterprise Data Sharing
职业:企业数据共享中隐私与效用权衡的理论与实践
- 批准号:
2338772 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Continuing Grant
Collaborative Research: Frameworks: MobilityNet: A Trustworthy CI Emulation Tool for Cross-Domain Mobility Data Generation and Sharing towards Multidisciplinary Innovations
协作研究:框架:MobilityNet:用于跨域移动数据生成和共享以实现多学科创新的值得信赖的 CI 仿真工具
- 批准号:
2411153 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant
Judicial Decision Data Gathering, Encoding and Sharing
司法决策数据收集、编码和共享
- 批准号:
EP/Y035992/1 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Research Grant
Collaborative Research: Frameworks: MobilityNet: A Trustworthy CI Emulation Tool for Cross-Domain Mobility Data Generation and Sharing towards Multidisciplinary Innovations
协作研究:框架:MobilityNet:用于跨域移动数据生成和共享以实现多学科创新的值得信赖的 CI 仿真工具
- 批准号:
2411151 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant
SWIFT-SAT: Observational Data Sharing
SWIFT-SAT:观测数据共享
- 批准号:
2332422 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant
Research Coordination Network (RCN) for Privacy Preserving Data Sharing and Analytics
用于隐私保护数据共享和分析的研究协调网络 (RCN)
- 批准号:
2413978 - 财政年份:2024
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant
Collaborative Research: IMR: MM-1B: Privacy-Preserving Data Sharing for Mobile Internet Measurement and Traffic Analytics
合作研究:IMR:MM-1B:移动互联网测量和流量分析的隐私保护数据共享
- 批准号:
2319486 - 财政年份:2023
- 资助金额:
$ 104.45万 - 项目类别:
Continuing Grant
Workshops on Smart Manufacturing with Open and Scaled Data Sharing in Semiconductor and Microelectronics Manufacturing; Virtual and In-Person; Washington, DC; October/November 2023
半导体和微电子制造中开放和规模化数据共享的智能制造研讨会;
- 批准号:
2334590 - 财政年份:2023
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant
EAGER: SMART-DMSP: Streamlining Metadata, Automation, and Research Tracking for Data Management and Sharing Plans
EAGER:SMART-DMSP:简化数据管理和共享计划的元数据、自动化和研究跟踪
- 批准号:
2332353 - 财政年份:2023
- 资助金额:
$ 104.45万 - 项目类别:
Standard Grant