CIF:Small:Collaborative Research: Compressed databases for similarity queries: fundamental limits and algorithms
CIF:Small:协作研究:用于相似性查询的压缩数据库:基本限制和算法
基本信息
- 批准号:1321174
- 负责人:
- 金额:$ 25万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2013
- 资助国家:美国
- 起止时间:2013-07-01 至 2016-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Information theory has had a profound impact on the fields of data transmission and compression. In contrast, it has yielded comparably few insights into problems such as knowledge extraction from and efficient search of massive datasets. While current information-theoretic tools and techniques can be applied to these problems to some extent, the paradigms for which these tools were developed will be being carefully reexamined in this project. Models that accurately capture the fundamental challenges faced by efficient search in modern massive database systems will be developed and analyzed. The asymptotic fundamental limits, which characterize the tradeoffs between accuracy, compression rate and search efficiency, will be investigated, along with development of practical algorithms that approach the ultimate benchmarks. One concrete problem being pursued is that of compression for efficient query and search. In this setting, the goal is, given a compressed representation, to answer search queries about the data that was compressed. This is in stark contrast to traditional compression, where the data need be merely reconstructible from the compressed form. The approach taken is tailored to distributed database design, but is also relevant to compression schemes that allow search within the compressed domain. The fundamental quantities studied play a similar role to that of the channel capacity and entropy/rate-distortion in channel and source coding, respectively. On one hand, they yield an understanding of the fundamental limits on the performance that any system for similarity queries based on compressed representations can hope to attain. On the other, the insights obtained from the theory are guiding the construction of schemes that approach these limits in practice. We will investigate how existing practical approaches (such as various hashing and clustering techniques) perform with respect to the information theoretic limits, and the extent to which approaches that have proved to be practical in source and channel coding can be used as building blocks to develop new efficient search algorithms that significantly improve on the current state of the art.
信息理论对数据传输和压缩领域产生了深远的影响。相比之下,它对诸如从知识提取和对大规模数据集的有效搜索等问题等问题产生了相当多的见解。尽管当前的信息理论工具和技术可以在某种程度上应用于这些问题,但在该项目中将仔细地重新检查这些工具的范例。将开发和分析在现代大规模数据库系统中有效搜索所面临的基本挑战的模型。将研究渐近基本限制,这些限制将研究精度,压缩率和搜索效率之间的权衡,以及开发实现最终基准的实用算法的开发。要解决的一个具体问题是用于有效查询和搜索的压缩问题。在这种情况下,目标是给定压缩表示形式,以回答有关被压缩数据的搜索查询。这与传统的压缩形成鲜明对比,在这种压缩中,数据仅需从压缩形式重新结构。采用的方法是针对分布式数据库设计量身定制的,但也与允许在压缩域内进行搜索的压缩方案有关。研究的基本数量分别与通道和源源编码中的通道容量和熵/速率延伸的作用相似。一方面,他们对基于压缩表示的任何相似性查询系统的性能的基本限制产生了理解。另一方面,从理论中获得的见解指导了在实践中达到这些限制的方案的构建。我们将研究现有的实用方法(例如各种哈希和聚类技术)如何在信息理论限制方面执行如何执行,以及在源和渠道编码中被证明是实用的方法的程度,可以用作开发新的有效搜索算法的构建障碍,从而在现有艺术的现状中显着改善。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Tsachy Weissman其他文献
Communication-Efficient Federated Learning through Importance Sampling
通过重要性采样实现高效沟通的联邦学习
- DOI:
10.48550/arxiv.2306.12625 - 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Berivan Isik;Francesco Pase;Deniz Gündüz;Oluwasanmi Koyejo;Tsachy Weissman;Michele Zorzi - 通讯作者:
Michele Zorzi
Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
彩票改编:减轻法学硕士的破坏性干扰
- DOI:
- 发表时间:
2024 - 期刊:
- 影响因子:0
- 作者:
Ashwinee Panda;Berivan Isik;Xiangyu Qi;Sanmi Koyejo;Tsachy Weissman;Prateek Mittal - 通讯作者:
Prateek Mittal
Tsachy Weissman的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Tsachy Weissman', 18)}}的其他基金
Collaborative Research: CIF: Medium: An Information-Theoretic Foundation for Adaptive Bidding in First-Price Auctions
合作研究:CIF:媒介:一价拍卖中自适应出价的信息理论基础
- 批准号:
2106467 - 财政年份:2021
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
CIF: Small: Collaborative Research: Inference of Information Measures on Large Alphabets: Fundamental Limits, Fast Algorithms, and Applications
CIF:小型:协作研究:大字母表上信息测量的推断:基本限制、快速算法和应用
- 批准号:
1528159 - 财政年份:2015
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
EAGER: Action in Information Processing
EAGER:信息处理中的行动
- 批准号:
1049413 - 财政年份:2010
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: The Role of Feedback in Two-Way Communication Networks
协作研究:反馈在双向通信网络中的作用
- 批准号:
0729119 - 财政年份:2007
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
CAREER: Toward a Unified Approach to Universality in Information Processing
职业:走向信息处理通用性的统一方法
- 批准号:
0546535 - 财政年份:2006
- 资助金额:
$ 25万 - 项目类别:
Continuing Grant
相似国自然基金
基于超宽频技术的小微型无人系统集群协作关键技术研究与应用
- 批准号:
- 批准年份:2020
- 资助金额:57 万元
- 项目类别:面上项目
异构云小蜂窝网络中基于协作预编码的干扰协调技术研究
- 批准号:61661005
- 批准年份:2016
- 资助金额:30.0 万元
- 项目类别:地区科学基金项目
密集小基站系统中的新型接入理论与技术研究
- 批准号:61301143
- 批准年份:2013
- 资助金额:24.0 万元
- 项目类别:青年科学基金项目
ScFVCD3-9R负载Bcl-6靶向小干扰RNA治疗EAMG的试验研究
- 批准号:81072465
- 批准年份:2010
- 资助金额:31.0 万元
- 项目类别:面上项目
基于小世界网络的传感器网络研究
- 批准号:60472059
- 批准年份:2004
- 资助金额:21.0 万元
- 项目类别:面上项目
相似海外基金
Collaborative Research: CIF: Small: Mathematical and Algorithmic Foundations of Multi-Task Learning
协作研究:CIF:小型:多任务学习的数学和算法基础
- 批准号:
2343599 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research: CIF: Small: Mathematical and Algorithmic Foundations of Multi-Task Learning
协作研究:CIF:小型:多任务学习的数学和算法基础
- 批准号:
2343600 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research:CIF:Small:Acoustic-Optic Vision - Combining Ultrasonic Sonars with Visible Sensors for Robust Machine Perception
合作研究:CIF:Small:声光视觉 - 将超声波声纳与可见传感器相结合,实现强大的机器感知
- 批准号:
2326905 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research:CIF:Small:Fisher-Inspired Approach to Quickest Change Detection for Score-Based Models
合作研究:CIF:Small:Fisher 启发的基于评分模型的最快变化检测方法
- 批准号:
2334898 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant
Collaborative Research:CIF:Small:Fisher-Inspired Approach to Quickest Change Detection for Score-Based Models
合作研究:CIF:Small:Fisher 启发的基于评分模型的最快变化检测方法
- 批准号:
2334897 - 财政年份:2024
- 资助金额:
$ 25万 - 项目类别:
Standard Grant