Compressing Unordered Data: Theory, Algorithms, and Applications

压缩无序数据:理论、算法和应用

基本信息

  • 批准号:
    0729069
  • 负责人:
  • 金额:
    --
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-09-15 至 2011-08-31
  • 项目状态:
    已结题

项目摘要

Systems for information storage, gathering, or communication should be designed with careful attention to how the information will eventually be used. Often data are used without regard to their ordering; e.g., most databases are accessed by searching, and order is irrelevant to statistics like means, medians, etc. The observation that order can be irrelevant is powerful because sometimes ignoring order dramatically improves compression. Remarkably, in some situations the reduction in bits can approach 100%. This project seeks to develop theory, algorithms, and applications for communication when ordering is fully or partially irrelevant. The theoretical aspect includes establishing performance bounds when very little is known about the information source prior to encoding and when ordering is partially maintained. The algorithmic focus is on computationally-efficient algorithms with limited buffering requirements. Lowering the required communication rates in networked data gathering could enable cheaper, smaller andlower-power devices and thus hasten the deployment of large-scale and battery-operated sensing systems. The genesis of this project is the following order reduction: Communicating any nontrivial sequence of n (ordered) symbols requires a number of bits that is linear in n; however, disregarding orderlowers the rate to O(log n) when the source alphabet is finite. Results for universal coding over countable alphabets and rate-distortion problems also show large differences between theordered and unordered communication problems. The project aims to establish fundamental bounds on compressing unordered data (discrete-and continuous-valued sources, with or without full distributional knowledge); develop compression techniques (scalar and vector quantizers, indexing, refinement); and apply data set (as opposed to sequence) compression in practice. Extending the results to partial preservation of order could have important consequences for conventional compression.
在设计信息存储、收集或交流系统时,应仔细考虑信息最终将如何使用。 数据的使用通常不考虑其顺序;例如,大多数数据库是通过搜索来访问的,并且顺序与像均值、中值等统计量无关。顺序可以是无关的这一观察是有力的,因为有时忽略顺序会显著地改进压缩。 值得注意的是,在某些情况下,比特的减少可以接近100%。 该项目旨在开发理论,算法和应用程序的通信时,订购是完全或部分无关。 理论方面包括建立性能界限时,很少知道编码前的信息源和排序时,部分保持。算法的重点是有限的缓冲要求的计算效率的算法。 降低网络数据收集中所需的通信速率可以使设备更便宜,更小和更低功耗,从而加快大规模和电池供电传感系统的部署。这个项目的起源是以下的降阶:通信任何非平凡序列的n(有序)符号需要的位数是线性的n;然而,忽略order降低到O(log n)时,源字母表是有限的。可数字母上的通用编码和率失真问题的结果也显示了有序和无序通信问题之间的巨大差异。 该项目旨在确定压缩无序数据(离散和连续值源,有或没有完整的分布知识)的基本界限;开发压缩技术(标量和矢量量化器,索引,细化);并在实践中应用数据集(相对于序列)压缩。 将结果扩展到部分保持顺序可能对传统压缩产生重要影响。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Vivek Goyal其他文献

A Review- on Different Types of Displays
不同类型显示器的回顾
  • DOI:
    10.14257/ijmue.2016.11.8.33
  • 发表时间:
    2016
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Shubham Shama;Udit Jindal;Mehul Goyal;Sahil Sharma;Vivek Goyal
  • 通讯作者:
    Vivek Goyal
Real Time Contingency Analysis for Power Grids
电网实时应急分析
  • DOI:
    10.1007/978-3-642-23397-5_31
  • 发表时间:
    2011
  • 期刊:
  • 影响因子:
    0
  • 作者:
    A. Mittal;J. Hazra;Nikhil Jain;Vivek Goyal;D. Seetharam;Yogish Sabharwal
  • 通讯作者:
    Yogish Sabharwal
IncSYS
系统公司
  • DOI:
    10.1109/mpe.2022.3194425
  • 发表时间:
    2022
  • 期刊:
  • 影响因子:
    2.8
  • 作者:
    A. Mittal;J. Hazra;Nikhil Jain;Vivek Goyal;D. Seetharam;Yogish Sabharwal
  • 通讯作者:
    Yogish Sabharwal

Vivek Goyal的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Vivek Goyal', 18)}}的其他基金

CCSS: Signal Processing for Single-Photon Detectors
CCSS:单光子探测器的信号处理
  • 批准号:
    2039762
  • 财政年份:
    2021
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Collaborative Research: CIF: Medium: Occlusion and Directional Resolution in Computational Imaging
合作研究:CIF:媒介:计算成像中的遮挡和方向分辨率
  • 批准号:
    1955219
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
CIF: Small: Sequential and Compound Estimation for Computational Imaging Systems
CIF:小型:计算成像系统的顺序和复合估计
  • 批准号:
    1815896
  • 财政年份:
    2018
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CIF: Small: Quantization for Acquisition and Computation Networks
CIF:小:采集和计算网络的量化
  • 批准号:
    1441917
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CIF: Small: Low-Light 3D Imaging: From Fundamental Limits to Practical Systems
CIF:小型:低光 3D 成像:从基本限制到实用系统
  • 批准号:
    1422034
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CIF:Medium:Space-from-Time Imaging: Fundamental Limits, Algorithms, and Preliminary Demonstrations
CIF:中:时空成像:基本限制、算法和初步演示
  • 批准号:
    1161413
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant
ICES: Small: Decision Making with Bounded Categorization
ICES:小:有界分类的决策
  • 批准号:
    1101147
  • 财政年份:
    2011
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CIF: Small: Quantization for Acquisition and Computation Networks
CIF:小:采集和计算网络的量化
  • 批准号:
    1115159
  • 财政年份:
    2011
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAREER: Acquisition, Approximation, and Compression - An Integrated Study
职业:采集、近似和压缩——一项综合研究
  • 批准号:
    0643836
  • 财政年份:
    2007
  • 资助金额:
    --
  • 项目类别:
    Continuing Grant

相似海外基金

Study on the controlling mechanism of fibrin gel formation by the unordered region of fibrinogen and N-linked saccharide chain
纤维蛋白原无序区和N联糖链控制纤维蛋白凝胶形成机制的研究
  • 批准号:
    24550245
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
A Study on Top-K algorithm for Large Unordered Tree Databases
大型无序树数据库Top-K算法研究
  • 批准号:
    24650042
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了