Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
基本信息
- 批准号:RGPIN-2018-05581
- 负责人:
- 金额:$ 4.08万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2022
- 资助国家:加拿大
- 起止时间:2022-01-01 至 2023-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
As the size of the data has grown rapidly in recent years, many techniques that were useful for small, older systems have become outdated for large, modern applications, since they occupy too much space to fit into faster levels of memory hierarchy. Most of this space is not the raw data, but structural information added to improve search efficiency. Succinct data structures were proposed to address this problem, so that the information in large systems can be retrieved quickly, but the space requirement is little more than that of the raw data. My main research interests are in the design of succinct data structures to represent fundamental structures such as strings, binary relations, trees and graphs, as well as the design of space-efficient solutions to geometric query problems, text indexing and classic query problems over trees and arrays. My research also involves other algorithmic techniques that can be applied to large data sets. When data are too big to fit in internal memory, I/O-efficient algorithms can be used to minimize the data transfer between internal memory and secondary storage which often dominates computational cost. The idea of implicit data structures is to encode a data structure as a permutation of its data elements if possible, so that no additional space is required. Adaptive algorithms aim at improving query efficiency for data with more inherent sortedness. Parallel algorithms use multiple processors to achieve speedup. Our work shows that these techniques and succinct structures can be combined to provide more efficient solutions for large data sets, and the advancement in one technique may lead to new results for another.To provide theoretical and practical solutions to modern systems that process large data sets such as web search engines, geographic information systems and bioinformatics applications, this program will extend the research on succinct data structures, and start new research directions on this subject. The proposed research will use succinct data structures to develop new solutions to fundamental problems in algorithms and computational geometry such as text search and range search. Not only will this research yield new data structures that are more space-efficient than those designed in previous work, it will also improve the query and update efficiency of standard data structures; the latter is achieved by exploiting the compactness of succinct data structures to store more structural information to speed up operations without increasing the space cost. We will also start a new line of research by designing succinct data structures for bioinformatics applications.
近年来,随着数据量的快速增长,许多对小型旧系统有用的技术对于大型现代应用程序来说已经过时,因为它们占用太多空间,无法适应更快的内存层次结构。这些空间中的大部分不是原始数据,而是为了提高搜索效率而添加的结构化信息。为了解决这个问题,人们提出了简洁的数据结构,这样可以快速地检索大型系统中的信息,但所需的空间比原始数据多一点。我的主要研究兴趣是设计简洁的数据结构来表示基本结构,如字符串,二元关系,树和图,以及设计几何查询问题,文本索引和经典查询问题的空间高效解决方案。我的研究还涉及其他可以应用于大型数据集的算法技术。当数据太大而无法容纳在内部存储器中时,可以使用I/O高效算法来最小化内部存储器和辅助存储器之间的数据传输,这通常主导计算成本。隐式数据结构的思想是尽可能将数据结构编码为其数据元素的排列,这样就不需要额外的空间。自适应算法的目的是提高查询效率的数据具有更多的内在排序。并行算法使用多个处理器来实现加速。我们的工作表明,这些技术和简洁结构可以结合起来,为大数据集提供更有效的解决方案,一种技术的进步可能会导致另一种技术的新结果。为了为处理大数据集的现代系统提供理论和实践解决方案,如网络搜索引擎,地理信息系统和生物信息学应用,该计划将扩展简洁数据结构的研究,并在这个课题上开始新的研究方向。拟议中的研究将使用简洁的数据结构来开发新的解决方案,以解决算法和计算几何中的基本问题,如文本搜索和范围搜索。这项研究不仅会产生新的数据结构,比以前的工作中设计的空间效率更高,它也将提高查询和更新效率的标准数据结构;后者是通过利用简洁的数据结构的紧凑性来存储更多的结构信息,以加快操作,而不增加空间成本。我们还将通过为生物信息学应用设计简洁的数据结构来开始新的研究路线。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
He, Meng其他文献
Magnetoelectric transport and quantum interference effect in ultrathin manganite films
超薄锰酸盐薄膜中的磁电输运和量子干涉效应
- DOI:
10.1063/1.4873337 - 发表时间:
2014-04 - 期刊:
- 影响因子:4
- 作者:
Zhao, Rui-qiang;Guo, Hai-zhong;He, Meng;Yang, Guo-zhen - 通讯作者:
Yang, Guo-zhen
Highly sensitive and selective H2S gas sensors based on flower-like WO3/CuO composites operating at low/room temperature
- DOI:
10.1016/j.jallcom.2019.01.349 - 发表时间:
2019-06-05 - 期刊:
- 影响因子:6.2
- 作者:
He, Meng;Xie, Lili;Zhu, Zhi-Gang - 通讯作者:
Zhu, Zhi-Gang
MiR-142-3p as an Indicator of OSA Severity Predicts Prognosis in Lung Adenocarcinoma with OSA.
- DOI:
10.2147/nss.s385755 - 发表时间:
2022 - 期刊:
- 影响因子:3.4
- 作者:
Yang, Ting;He, Fang;Zhang, Mingxiang;Ai, Li;He, Meng;Liu, Xin;Li, Yongxia - 通讯作者:
Li, Yongxia
Moisture and solvent responsive cellulose/SiO2 nanocomposite materials
- DOI:
10.1007/s10570-014-0527-5 - 发表时间:
2015-02-01 - 期刊:
- 影响因子:5.7
- 作者:
He, Meng;Duan, Bo;Zhang, Lina - 通讯作者:
Zhang, Lina
Thickness-dependent surface morphology of La0.9Sr0.1MnO3 ultrathin films
La0.9Sr0.1MnO3 超薄膜的厚度相关表面形貌
- DOI:
10.1016/j.apsusc.2007.01.011 - 发表时间:
2007-05-15 - 期刊:
- 影响因子:6.7
- 作者:
He, Meng;Qiu, Jie;Jin, Kui-Juan - 通讯作者:
Jin, Kui-Juan
He, Meng的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('He, Meng', 18)}}的其他基金
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2021
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2020
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2019
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2018
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Effective and Efficient Smart Meter Data Analytics
有效且高效的智能电表数据分析
- 批准号:
536292-2018 - 财政年份:2018
- 资助金额:
$ 4.08万 - 项目类别:
Engage Grants Program
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2017
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2016
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2015
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2014
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2013
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
Scalable Learning and Optimization: High-dimensional Models and Online Decision-Making Strategies for Big Data Analysis
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:合作创新研究团队
Data-driven Recommendation System Construction of an Online Medical Platform Based on the Fusion of Information
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国青年学者研究基金项目
Development of a Linear Stochastic Model for Wind Field Reconstruction from Limited Measurement Data
- 批准号:
- 批准年份:2020
- 资助金额:40 万元
- 项目类别:
基于Linked Open Data的Web服务语义互操作关键技术
- 批准号:61373035
- 批准年份:2013
- 资助金额:77.0 万元
- 项目类别:面上项目
Molecular Interaction Reconstruction of Rheumatoid Arthritis Therapies Using Clinical Data
- 批准号:31070748
- 批准年份:2010
- 资助金额:34.0 万元
- 项目类别:面上项目
高维数据的函数型数据(functional data)分析方法
- 批准号:11001084
- 批准年份:2010
- 资助金额:16.0 万元
- 项目类别:青年科学基金项目
染色体复制负调控因子datA在细胞周期中的作用
- 批准号:31060015
- 批准年份:2010
- 资助金额:25.0 万元
- 项目类别:地区科学基金项目
Computational Methods for Analyzing Toponome Data
- 批准号:60601030
- 批准年份:2006
- 资助金额:17.0 万元
- 项目类别:青年科学基金项目
相似海外基金
Efficient algorithms and succinct data structures for acceleration of telescoping and related problems
用于加速伸缩及相关问题的高效算法和简洁数据结构
- 批准号:
RGPIN-2021-03147 - 财政年份:2022
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Efficient algorithms and succinct data structures for acceleration of telescoping and related problems
用于加速伸缩及相关问题的高效算法和简洁数据结构
- 批准号:
RGPIN-2021-03147 - 财政年份:2021
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2021
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures
简洁的数据结构
- 批准号:
551960-2020 - 财政年份:2020
- 资助金额:
$ 4.08万 - 项目类别:
University Undergraduate Student Research Awards
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2020
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2019
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
RGPIN-2018-05581 - 财政年份:2018
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2017
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual
Theory and Practice of Succinct Data Structures
简洁数据结构理论与实践
- 批准号:
16H02781 - 财政年份:2016
- 资助金额:
$ 4.08万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Succinct Data Structures with Applications to Large Data Sets
简洁的数据结构及其在大数据集上的应用
- 批准号:
418613-2012 - 财政年份:2016
- 资助金额:
$ 4.08万 - 项目类别:
Discovery Grants Program - Individual