权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

CIF: Small: Collaborative Research: Inference of Information Measures on Large Alphabets: Fundamental Limits, Fast Algorithms, and Applications

CIF：小型：协作研究：大字母表上信息测量的推断：基本限制、快速算法和应用

基本信息

批准号：
1528159
负责人：
Tsachy Weissman
金额：
$ 25万
依托单位：
Stanford University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2015
资助国家：
美国
起止时间：
2015-09-01 至 2018-08-31
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=1528159&HistoricalAwards=false
关键词：
CIF Small Collaborative Research Inference

项目摘要

A key task in information theory is to characterize fundamental performance limits in compression, communication, and more general operational problems involving the storage, transmission and processing of information. Such characterizations are usually in terms of information measures, among the most fundamental of which are the Shannon entropy and the mutual information. In addition to their prominent operational roles in the traditional realms of information theory, information measures have found numerous applications in many statistical modeling and machine learning tasks. Various modern data-analytic applications deal with data sets naturally viewed as samples from a probability distribution over a large domain. Due to the typically large alphabet size and resource constraints, the practitioner contends with the difficulty of undersampling in applications ranging from corpus linguistics to neuroscience. One of the main goals of this project is the development of a general theory based on a new set of mathematical tools that will facilitate the construction and analysis of optimal estimation of information measures on large alphabets. The other major facet of this project is the incorporation of the new theoretical methodologies into machine learning algorithms, thereby significantly impacting current real-world learning practices. Successful completion of this project will result in enabling technologies and practical schemes - in applications ranging from analysis of neural response data to learning graphical models - that are provably much closer to attaining the fundamental performance limits than existing ones. The findings of this project will enrich existing big data-analytic curricula. A new course dedicated to high-dimensional statistical inference that addresses estimation for large-alphabet data in depth will be created and offered. Workshops on the themes and findings of this project will be organized and held at Stanford and UIUC. A comprehensive approximation-theoretic approach to estimating functionals of distributions on large alphabets will be developed via computationally efficient procedures based on best polynomial approximation, with provable essential optimality guarantees. Rooted in the high-dimensional statistics literature, our key observation is that while estimating the distribution itself requires the sample size to scale linearly with the alphabet size, it is possible to accurately estimate functionals of the distribution, such as entropy or mutual information, with sub-linear sample complexity. This requires going beyond the conventional wisdom by developing more sophisticated approaches than maximal likelihood (?plug-in?) estimation. The other major facet of this project is translating the new theoretical methodologies into highly scalable and efficient machine learning algorithms, thereby significantly impacting current real-world learning practices and significantly boosting the performance in several of the most prevalent machine learning applications, such as learning graphical models, that rely on mutual information estimation.

信息论的一个关键任务是描述压缩、通信和涉及信息存储、传输和处理的更一般操作问题中的基本性能极限。这种表征通常是根据信息度量，其中最基本的是香农熵和互信息。除了在传统信息论领域中的重要作用外，信息测度在许多统计建模和机器学习任务中也有许多应用。各种现代数据分析应用程序处理的数据集自然被视为一个大的域上的概率分布的样本。由于通常较大的字母表大小和资源限制，从业者在从语料库语言学到神经科学的应用中难以进行欠采样。这个项目的主要目标之一是发展一个通用的理论基础上的一套新的数学工具，这将有利于建设和分析的最佳估计信息措施的大字母。该项目的另一个主要方面是将新的理论方法纳入机器学习算法，从而对当前的现实世界学习实践产生重大影响。该项目的成功完成将导致使能技术和实用方案-从神经反应数据分析到学习图形模型的应用-可以证明比现有的更接近实现基本性能极限。该项目的研究结果将丰富现有的大数据分析课程。一个新的课程，致力于高维统计推断，解决估计大字母表数据的深度将创建和提供。将在斯坦福大学和UIUC组织和举办关于该项目主题和研究结果的讲习班。一个全面的近似理论的方法来估计大字母表上的分布泛函将开发通过计算效率高的程序的基础上最佳多项式近似，可证明的基本最优性保证。植根于高维统计文献中，我们的关键观察是，虽然估计分布本身需要样本大小与字母表大小线性缩放，但可以准确估计分布的泛函，例如熵或互信息，具有次线性样本复杂性。这需要超越传统的智慧，开发比最大似然更复杂的方法（？插件？）估计。该项目的另一个主要方面是将新的理论方法转化为高度可扩展和高效的机器学习算法，从而显着影响当前的现实世界的学习实践，并显着提高几个最流行的机器学习应用程序的性能，例如学习图形模型，依赖于互信息估计。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Tsachy Weissman其他文献

Communication-Efficient Federated Learning through Importance Sampling

通过重要性采样实现高效沟通的联邦学习

DOI：
10.48550/arxiv.2306.12625
发表时间：
2023
期刊：
ArXiv
影响因子：
0
作者：
Berivan Isik;Francesco Pase;Deniz Gündüz;Oluwasanmi Koyejo;Tsachy Weissman;Michele Zorzi
通讯作者：
Michele Zorzi