QUINTON -- QUerying and INTegrating Over Nested data

QUINTON——嵌套数据的查询和集成

基本信息

  • 批准号:
    EP/T022124/1
  • 负责人:
  • 金额:
    $ 132.49万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2021
  • 资助国家:
    英国
  • 起止时间:
    2021 至 无数据
  • 项目状态:
    未结题

项目摘要

It has long been recognized that nested data models -- in which information is modelled as collections of tuples whose attributes may in turn take values that are collections -- are the most natural modelling formalism for a wide variety of information management scenarios. Query languages that support nested data have been developed decades ago. But even as emerging applications have made the need for querying of nested data more crucial, and even as many of the most important big data management frameworks assume programmatic interfaces based on nested data, processing large-scale nested data remains extremely cumbersome, radically more so than in the case of flat data. Our research hypothesis is that fundamental problems in querying and integrating nested data need to be resolved for this situation to change.This project will provide new foundations for both querying and integration nested data. On the side of querying we will establish a standard processing pipeline for queries over nested data. This will include a foundational study of the basic transformations involved in any such pipeline, such as the "shredding" of nested queries into relational queries. It will also include the development of algorithms and tools that implement this pipeline, working on top of scalable infrastructure for flat data, such as the Apache Spark project. On the side of integration, we will establish the foundations of specifying and querying virtual data sources consisting of nested data, and develop middleware that can implement queries over virtual data on top of heterogenous nested data sources.The impact of QUINTON is both practical and foundational. We will build infrastructure for querying and integration, but we also investigate the fundamental problems of scalable querying over materialized and virtual datasources, providing the foundations that can guide the research community in future implementations. We will also drill down into a particular compelling and timely application of nested data integration and management, working with an industrial partner to build components and novel analyses in the area of management for biomedical data. Our partner deals with unified interfaces to diverse biomedical datasources -- clinical, imaging, and genomic data -- and their usecases are a perfect fit for the technology we are developing.
人们早就认识到,嵌套数据模型--其中信息被建模为元组的集合,元组的属性反过来可以取集合的值--是各种信息管理方案的最自然的建模形式主义。支持嵌套数据的查询语言在几十年前就已经开发出来了。但是,即使新兴的应用程序使得查询嵌套数据的需求变得更加重要,即使许多最重要的大数据管理框架都采用了基于嵌套数据的编程接口,处理大规模嵌套数据仍然非常繁琐,比平面数据的情况更加复杂。我们的研究假设是,要改变这种状况,需要解决嵌套数据查询和集成的基本问题,本项目将为嵌套数据查询和集成提供新的基础。在查询方面,我们将为嵌套数据上的查询建立一个标准的处理管道。这将包括对任何此类管道中涉及的基本转换的基础研究,例如将嵌套查询“分解”为关系查询。它还将包括实现该管道的算法和工具的开发,在平面数据的可扩展基础设施之上工作,例如Apache Spark项目。在集成方面,我们将建立由嵌套数据组成的虚拟数据源的定义和查询的基础,并开发能够在异构嵌套数据源上实现虚拟数据查询的中间件。QUINTON的影响是实际的和基础的。我们将建立查询和集成的基础设施,但我们也调查了可扩展查询的基本问题,在物化和虚拟存储器,提供的基础,可以指导研究界在未来的实现。我们还将深入研究嵌套数据集成和管理的一个特别引人注目和及时的应用程序,与工业合作伙伴合作,在生物医学数据管理领域构建组件和新颖的分析。我们的合作伙伴处理各种生物医学数据的统一接口-临床,成像和基因组数据-他们的用例非常适合我们正在开发的技术。

项目成果

期刊论文数量(10)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Balancing Expressiveness and Inexpressiveness in View Design
平衡视图设计中的表现力和非表现力
Functional Collection Programming with Semi-Ring Dictionaries
使用半环字典进行函数式集合编程
  • DOI:
    10.48550/arxiv.2103.06376
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Shaikhha A
  • 通讯作者:
    Shaikhha A
Embedded Finite Models beyond Restricted Quantifier Collapse
超出限制量词崩溃的嵌入式有限模型
  • DOI:
    10.1109/lics56636.2023.10175804
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Benedikt M
  • 通讯作者:
    Benedikt M
Rewriting the infinite chase
重写无限追逐
Synthesizing Nested Relational Queries from Implicit Specifications
从隐式规范合成嵌套关系查询
  • DOI:
    10.1145/3584372.3588653
  • 发表时间:
    2023
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Benedikt M
  • 通讯作者:
    Benedikt M
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Michael Benedikt其他文献

Form Filling Based on Constraint Solving
基于约束求解的表单填写
Monadic Datalog, Tree Validity, and Limited Access Containment
单子数据记录、树有效性和有限访问遏制
Verification of Two-Variable Logic Revisited
重新审视二变量逻辑的验证
XPath leashed
XPath 束缚
  • DOI:
    10.1145/1456650.1456653
  • 发表时间:
    2009
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Michael Benedikt;Christoph Koch
  • 通讯作者:
    Christoph Koch
The FCC feasibility study
FCC 可行性研究

Michael Benedikt的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Michael Benedikt', 18)}}的其他基金

PDQ: Proof-driven Query Planning
PDQ:证明驱动的查询规划
  • 批准号:
    EP/M005852/1
  • 财政年份:
    2015
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Fellowship
Query-driven Data Acquisition from Web-based Data Sources
从基于 Web 的数据源进行查询驱动的数据采集
  • 批准号:
    EP/H017690/1
  • 财政年份:
    2010
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Research Grant
Enforcement of Constraints on XML Streams
对 XML 流实施约束
  • 批准号:
    EP/G004021/1
  • 财政年份:
    2009
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Research Grant
Describing and Perceiving Space in Architectural Environments
描述和感知建筑环境中的空间
  • 批准号:
    7817451
  • 财政年份:
    1979
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Standard Grant

相似海外基金

CICI:UCSS: ARMOR: Secure Querying of Massive Scientific Datasets
CICI:UCSS: ARMOR:海量科学数据集的安全查询
  • 批准号:
    2232813
  • 财政年份:
    2023
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Standard Grant
CAREER: Algorithmic Aspects of Pan-genomic Data Modeling, Indexing and Querying
职业:泛基因组数据建模、索引和查询的算法方面
  • 批准号:
    2316691
  • 财政年份:
    2023
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Continuing Grant
BRAIN CONNECTS: Rapid and Cost‐effective Connectomics with Intelligent Image Acquisition, Reconstruction, and Querying
大脑连接:具有智能图像采集、重建和查询功能的快速且经济有效的连接组学
  • 批准号:
    10663654
  • 财政年份:
    2023
  • 资助金额:
    $ 132.49万
  • 项目类别:
Querying Heterogeneous Data for Non-Expert Users
为非专家用户查询异构数据
  • 批准号:
    RGPIN-2021-03819
  • 财政年份:
    2022
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Discovery Grants Program - Individual
Mining Web Data Sources for Integrated Informative Querying and Recommendation
挖掘Web数据源以进行综合信息查询和推荐
  • 批准号:
    RGPIN-2019-04565
  • 财政年份:
    2022
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Discovery Grants Program - Individual
Building and Querying Knowledge Graphs from Text Corpora
从文本语料库构建和查询知识图
  • 批准号:
    RGPIN-2018-04270
  • 财政年份:
    2022
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Discovery Grants Program - Individual
Querying and Mining Dynamics in Evolving Graphs and Networks
演化图和网络中的查询和挖掘动态
  • 批准号:
    RGPIN-2020-04506
  • 财政年份:
    2022
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Discovery Grants Program - Individual
CAREER: Algorithmic Aspects of Pan-genomic Data Modeling, Indexing and Querying
职业:泛基因组数据建模、索引和查询的算法方面
  • 批准号:
    2146003
  • 财政年份:
    2022
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Continuing Grant
CAREER: Brain Imaging Genetics via multimodal modular structure querying
职业:通过多模式模块化结构查询进行脑成像遗传学
  • 批准号:
    2045848
  • 财政年份:
    2021
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Continuing Grant
Querying Heterogeneous Data for Non-Expert Users
为非专家用户查询异构数据
  • 批准号:
    DGECR-2021-00212
  • 财政年份:
    2021
  • 资助金额:
    $ 132.49万
  • 项目类别:
    Discovery Launch Supplement
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了