II: Data Cooperatives: Rapid and Incremental Data Sharing with Applications to Bioinformatics
II:数据合作社:快速增量数据共享及其在生物信息学中的应用
基本信息
- 批准号:0513778
- 负责人:
- 金额:$ 129.53万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2005
- 资助国家:美国
- 起止时间:2005-07-15 至 2009-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Generic tools and technologies for creating and maintaining data cooperatives- confederations whose purpose is distributed data sharing-will be developed to overcome the difficultiess encountered in the sharing of information in life sciences, specifically in bioinformatics.The vision of large-scale data sharing has been a long-time goal of the bioinformatics field, much of it proceeding through data integration efforts. However, conventional approaches to data integration do not have the necessary flexibility and adaptability to make the existing and future plethora of data accessible and usable to typical biologists, while keeping it rapidly extensible to new concepts, domains, and types of queries, and thus fostering new research developments. The main reasons are that (1) different biologists work with different types of data and at differing levels of abstraction; (2) schemas in the bioinformatics world are typically large and complex; (3) queries and mappings may "break" without warning because of asynchronous updates; (4) it is logistically, economically and politically difficult to operate centralized data integration facilities. In response to these difficulties data cooperatives emphasize: decentralization for both scalability and flexibility, incremental development of resources such as schemas, mappings, and queries, rapid discovery mechanisms for finding the resources relevant to a topic, and tolerance for intermittent participation of members and for approximate consistency of mappings.More specifically, the technical goals of the proposal include: (1)collaboratively developed yellow pages of biological topics; (2) schema templates, capturing the part of the structure of data pertaining to a specific interest and functioning also as visual templates from which a query form created; (3) incremental specification of mappings; (4) reasoning about uncertainty in mappings by measuring with statistical tools their degree of reliability and using it in query answering; (5) multi-path answering for queries with caching and replication in a large-scale data cooperative where the participation of individual members may not always be assured.Data cooperatives will have broader impact through applications in a variety of scientific and industrial fields, but it is in the field of bioinformatics that they are likely to have an immediate and significant impact. Therefore, a specific data cooperative as a biological testbed for evaluating the proposed technologies. This testbed is based on a small set of databases which are already collaborating and exchanging data related to Plasmodium falciparum. Broader impact will be also be achieved through the proposed educational initiatives, specifically through a "compu-tational orchestra" bioinformatics course which will expose students to data integration issues through project work, and a workshop for the Greater Philadelphia Bioinformatics Alliance (GPBA). Minority involvement will also be encouraged through a GPBA internship program.
通用的工具和技术,用于创建和维护数据合作社-联盟,其目的是分布式数据共享-将被开发,以克服遇到的困难,在生命科学,特别是在bioinformations.The大规模的数据共享的愿景信息共享一直是生物信息学领域的一个长期目标,其中大部分是通过数据集成的努力进行。然而,传统的数据集成方法不具有必要的灵活性和适应性,使现有的和未来的大量数据可访问和可用的典型的生物学家,同时保持它快速扩展到新的概念,域和类型的查询,从而促进新的研究发展。主要原因是:(1)不同的生物学家使用不同类型的数据和不同的抽象层次;(2)生物信息学世界中的模式通常是庞大而复杂的;(3)由于异步更新,查询和映射可能会在没有警告的情况下“中断”;(4)在逻辑上,经济上和政治上难以操作集中的数据集成设施。针对这些困难,数据合作社强调:分散化的可伸缩性和灵活性,资源的增量开发,如模式,映射和查询,快速发现机制,寻找与主题相关的资源,以及容忍成员的间歇性参与和映射的近似一致性。更具体地说,该提案的技术目标包括:(1)合作开发的生物主题黄页;(2)模式模板,捕获与特定兴趣有关的数据结构的一部分,并且还用作创建查询表单的视觉模板;(3)映射的增量规范;(4)通过统计工具测量映射的可靠性程度来推理映射中的不确定性,并将其用于查询回答;(5)在大规模的网络中,利用缓存和复制对查询进行多路径应答,大规模的数据合作社,其中个别成员的参与可能并不总是得到保证。数据合作社将通过在各种科学和工业领域的应用产生更广泛的影响,但在生物信息学领域,它们可能会产生直接和重大的影响。因此,一个特定的数据合作社作为生物试验平台,用于评估拟议的技术。该试验平台以一小部分数据库为基础,这些数据库已经在合作和交换与恶性疟原虫有关的数据。 更广泛的影响也将通过拟议的教育举措,特别是通过一个“计算管弦乐队”生物信息学课程,这将使学生接触数据集成问题,通过项目工作,并为大费城生物信息学联盟(GPBA)的研讨会。还将通过GPBA实习计划鼓励少数族裔参与。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Susan Davidson其他文献
Cause and Effect: The Relationship Between Acne and Self-Esteem in the Adolescent Years
- DOI:
10.1016/j.nurpra.2008.01.021 - 发表时间:
2008-09-01 - 期刊:
- 影响因子:
- 作者:
Sandra L. Hedden;Susan Davidson;Christine B. Smith - 通讯作者:
Christine B. Smith
"Conversations: Rauschenberg in China"
《对话:劳森伯格在中国》
- DOI:
- 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Hiroko Ikegami;David White;Helen Hsu;Susan Davidson - 通讯作者:
Susan Davidson
Proceedings of Patient Reported Outcome Measure’s (PROMs) Conference Oxford 2017: Advances in Patient Reported Outcomes Research
- DOI:
10.1186/s12955-017-0757-y - 发表时间:
2017-10-01 - 期刊:
- 影响因子:3.400
- 作者:
Galina Velikova;Jose M. Valderas;Caroline Potter;Laurie Batchelder;Christine A’Court;Matthew Baker;Jennifer Bostock;Angela Coulter;Ray Fitzpatrick;Julien Forder;Diane Fox;Louise Geneen;Elizabeth Gibbons;Crispin Jenkinson;Karen Jones;Laura Kelly;Michele Peters;Brendan Mulhern;Alexander Labeit;Donna Rowen;Keith Meadows;Jackie Elliott;John Brazier;Emma Knowles;Anju Keetharuth;John Brazier;Janice Connell;Jill Carlton;Lizzie Taylor Buck;Thomas Ricketts;Michael Barkham;Pushpendra Goswami;Sam Salek;Tatyana Ionova;Esther Oliva;Adele K. Fielding;Marina Karakantza;Saad Al-Ismail;Graham P. Collins;Stewart McConnell;Catherine Langton;Daniel M. Jennings;Roger Else;Jonathan Kell;Helen Ward;Sophie Day;Elizabeth Lumley;Patrick Phillips;Rosie Duncan;Helen Buckley-Woods;Ahmed Aber;Gerogina Jones;Jonathan Michaels;Ian Porter;Jaheeda Gangannagaripalli;Antoinette Davey;Ignacio Ricci-Cabello;Kirstie Haywood;Stine Thestrup Hansen;Jose Valderas;Deb Roberts;Anil Gumber;Bélène Podmore;Andrew Hutchings;Jan van der Meulen;Ajay Aggarwal;Sujith Konan;Andrew Price;William Jackson;Nick Bottomley;Michael Philiips;Toby Knightley-Day;David Beard;Elizabeth Gibbons;Ray Fitzpatrick;Joanne Greenhalgh;Kate Gooding;Elizabeth Gibbons;Chema Valderas;Judy Wright;Sonia Dalkin;David Meads;Nick Black;Carol Fawkes;Robert Froud;Dawn Carnes;Andrew Price;Jonathan Cook;Helen Dakin;James Smith;Sujin Kang;David Beard;Catrin Griffiths;Ella Guest;Diana Harcourt;Mairead Murphy;Sandra Hollinghurst;Chris Salisbury;Jill Carlton;Jackie Elliott;Donna Rowen;Anqi Gao;Andrew Price;David Beard;Agnieszka Lemanska;Tao Chen;David P. Dearnaley;Rajesh Jena;Matthew Sydes;Sara Faithfull;A. E. Ades;Daphne Kounali;Guobing Lu;Ines Rombach;Alastair Gray;Crispin Jenkinson;Oliver Rivero-Arias;Patricia Holch;Marie Holmes;Zoe Rodgers;Sarah Dickinson;Beverly Clayton;Susan Davidson;Jacqui Routledge;Julia Glennon;Ann M. Henry;Kevin Franks;Galina Velikova;Roma Maguire;Lisa McCann;Teresa Young;Jo Armes;Jenny Harris;Christine Miaskowski;Grigorios Kotronoulas;Morven Miller;Emma Ream;Elizabeth Patiraki;Alexander Geiger;Geir V. Berg;Adrian Flowerday;Peter Donnan;Paul McCrone;Kathi Apostolidis;Patricia Fox;Eileen Furlong;Nora Kearney;Chris Gibbons;Felix Fischer;Chris Gibbons;Joel Coste;Jose Valderas Martinez;Matthias Rose;Alain Leplege;Sarah Shingler;Natalie Aldhouse;Tamara Al-Zubeidi;Andrew Trigg;Helen Kitchen;Antoinette Davey;Ian Porter;Colin Green;Jose M. Valderas;Joanna Coast;Sarah Smith;Jolijn Hendriks;Nick Black;Koonal Shah;Oliver Rivero-Arias;Juan-Manuel Ramos-Goni;Simone Kreimeier;Mike Herdman;Nancy Devlin;Aureliano Paolo Finch;John E. Brazier;Clara Mukuria;Bernarda Zamora;David Parkin;Yan Feng;Andrew Bateman;Mike Herdman;Nancy Devlin;Thomas Patton;Nils Gutacker;Koonal Shah - 通讯作者:
Koonal Shah
世界建築史15講』
世界建筑史15讲》
- DOI:
- 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Hiroko Ikegami;David White;Helen Hsu;Susan Davidson;加治屋健司;Hiroko Ikegami;辻泰岳 - 通讯作者:
辻泰岳
Past Disquiet: Artists, International Solidarity and Museums-in-Exile
过去的不安:艺术家、国际团结和流亡博物馆
- DOI:
- 发表时间:
2018 - 期刊:
- 影响因子:0
- 作者:
Hiroko Ikegami;David White;Helen Hsu;Susan Davidson;加治屋健司;Hiroko Ikegami;辻泰岳;Hiroko Ikegami;Izumi Nakajima - 通讯作者:
Izumi Nakajima
Susan Davidson的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Susan Davidson', 18)}}的其他基金
III: Medium: Collaborative Research: Citing Structured and Evolving Data
III:媒介:协作研究:引用结构化和不断变化的数据
- 批准号:
1302212 - 财政年份:2013
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
BPC-DP: Penn COMP-ACT: A College Service Learning Course to Promote and Increase COMPutational Thinking and ACTivities in Afterschool and Summer Programs
BPC-DP:宾夕法尼亚大学 COMP-ACT:大学服务学习课程,旨在促进和提高课后和暑期项目中的计算思维和活动
- 批准号:
0940511 - 财政年份:2010
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
III-COR-Medium: Providing Provenance through Workflows and Database Transformations
III-COR-Medium:通过工作流程和数据库转换提供来源
- 批准号:
0803524 - 财政年份:2008
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
SEIII: Workshop on Information Integration
SEIII:信息集成研讨会
- 批准号:
0632541 - 财政年份:2006
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
Collaborative Research: SEI+II ProtocolDB: Archiving and Querying Scientific Protocols, Data and Provenance
合作研究:SEI II ProtocolDB:归档和查询科学协议、数据和来源
- 批准号:
0612177 - 财政年份:2006
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
Preserving Constraints in XML Data Exchange
保留 XML 数据交换中的约束
- 批准号:
0415810 - 财政年份:2005
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
DLI Phase-2: Data Provenance
DLI 第 2 阶段:数据来源
- 批准号:
9817444 - 财政年份:1999
- 资助金额:
$ 129.53万 - 项目类别:
Continuing Grant
A Deterministic Model for Semistructured and Structured Data
半结构化和结构化数据的确定性模型
- 批准号:
9977408 - 财政年份:1999
- 资助金额:
$ 129.53万 - 项目类别:
Standard Grant
Mediated Access to Biological Databases and Applications
对生物数据库和应用程序的介导访问
- 批准号:
9402292 - 财政年份:1994
- 资助金额:
$ 129.53万 - 项目类别:
Continuing Grant
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
- 批准号:
- 批准年份:2020
- 资助金额:40 万元
- 项目类别:
基于Linked Open Data的Web服务语义互操作关键技术
- 批准号:61373035
- 批准年份:2013
- 资助金额:77.0 万元
- 项目类别:面上项目
Molecular Interaction Reconstruction of Rheumatoid Arthritis Therapies Using Clinical Data
- 批准号:31070748
- 批准年份:2010
- 资助金额:34.0 万元
- 项目类别:面上项目
高维数据的函数型数据(functional data)分析方法
- 批准号:11001084
- 批准年份:2010
- 资助金额:16.0 万元
- 项目类别:青年科学基金项目
染色体复制负调控因子datA在细胞周期中的作用
- 批准号:31060015
- 批准年份:2010
- 资助金额:25.0 万元
- 项目类别:地区科学基金项目
Computational Methods for Analyzing Toponome Data
- 批准号:60601030
- 批准年份:2006
- 资助金额:17.0 万元
- 项目类别:青年科学基金项目
相似海外基金
An innovative platform using ML/AI to analyse farm data and deliver insights to improve farm performance, increasing farm profitability by 5-10%
An%20innovative%20platform%20using%20ML/AI%20to%20analysis%20farm%20data%20and%20deliver%20insights%20to%20improv%20farm%20performance,%20increasing%20farm%20profitability%20by%205-10%
- 批准号:
10093235 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
Collaborative R&D
Seamless integration of Financial data into ESG data
将财务数据无缝集成到 ESG 数据中
- 批准号:
10099890 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
Collaborative R&D
Patient Lifestyle and Disease Data Interactium (PaLaDIn)
患者生活方式和疾病数据交互 (PaLaDIn)
- 批准号:
10103989 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
EU-Funded
Patient Lifestyle and Disease Data Interactium (PaLaDIn)
患者生活方式和疾病数据交互 (PaLaDIn)
- 批准号:
10105921 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
EU-Funded
Treecle - data and automation to unlock woodland creation in the UK to achieve net zero
Treecle - 数据和自动化解锁英国林地创造以实现净零排放
- 批准号:
10111492 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
SME Support
NEMO - Net zero events using multiple open data sources
NEMO - 使用多个开放数据源的净零事件
- 批准号:
10114096 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
SME Support
Facilitating circular construction practices in the UK: A data driven online marketplace for waste building materials
促进英国的循环建筑实践:数据驱动的废弃建筑材料在线市场
- 批准号:
10113920 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
SME Support
Quantum Machine Learning for Financial Data Streams
金融数据流的量子机器学习
- 批准号:
10073285 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
Feasibility Studies
N2Vision+: A robot-enabled, data-driven machine vision tool for nitrogen diagnosis of arable soils
N2Vision:一种由机器人驱动、数据驱动的机器视觉工具,用于耕地土壤的氮诊断
- 批准号:
10091423 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
Collaborative R&D
Tracking flood waters over Australia using space gravity data
使用空间重力数据跟踪澳大利亚的洪水
- 批准号:
DP240102399 - 财政年份:2024
- 资助金额:
$ 129.53万 - 项目类别:
Discovery Projects