Information Theoretic Research on Big Data Compression and Analytics: Theory, Algorithms, and Applications
大数据压缩与分析的信息论研究:理论、算法与应用
基本信息
- 批准号:RGPIN-2016-03871
- 负责人:
- 金额:$ 3.28万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2019
- 资助国家:加拿大
- 起止时间:2019-01-01 至 2020-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
With an explosion in data sets in our society, we are at the beginning of a big data revolution. Big data has a potential to accelerate the pace of discovery in science, engineering, and medicine, improve healthcare, finance, business, and our lives, and ultimately transform our society. ***To tap the opportunities afforded by the big data revolution, however, many challenges of big data have to be carefully addressed. For example, to fast access remotely, and reduce the on-going cost of maintaining, huge volumes of digital data, it is desirable to compress data as much as possible. Likewise, to glean knowledge and discoveries from huge volumes of digital data, tools and techniques based on context analytics have to be developed to organize, visualize, and manage huge volumes of digital data. Efficient data compression and accurate context analytics are of vital importance to turning big data to knowledge and discoveries to actions while achieving resource (bandwidth and power) efficiency.****In the past, data compression and data analytics have been largely investigated separately in different disciplines. Although data compression has been around for many years, its existing theories and techniques, especially source coding theory from information theory, have been largely developed for stationary and/or well-structured data (such as text, web pages, etc.). In the context of big data, however, most data types are nonstationary and unstructured/semi-structured; applying existing compression techniques directly to these data types often leads to unsatisfied compression performance. Therefore, it is imperative to develop novel compression theories and techniques by incorporating data analytics into data compression to handle the lack of structure and stationarity in an elegant way. In the opposite direction, it is also beneficial to investigate how to apply information theoretic ideas, particularly source coding theory and techniques, to data analytics to develop better solutions for data analytics such as cognitive distance, clustering, and organization.****Building on our early success, in this research, we will investigate and explore the interactions of data compression and analytics to advance knowledge in both fields. Our approach will be information theoretic. Three theme areas will be focused on: (1) interactions of data compression and analytics to improve lossy compression performance for nonstationary and unstructured data; (2) interactions of data compression and analytics to develop better cognitive distances and hence provide better tools for data clustering and organization; and (3) compression of and pattern discovery in large bipartite graphs. In the process of achieving our scientific objectives, we will maintain and enhance our leadership in the related areas, and train highly qualified personnel in information and communications technology for Canada. **
随着我们社会中数据集的爆炸性增长,我们正处于一场大数据革命的开端。大数据有可能加快科学、工程和医学领域的发现步伐,改善医疗保健、金融、商业和我们的生活,并最终改变我们的社会。*然而,要利用大数据革命带来的机遇,必须谨慎应对大数据的许多挑战。例如,为了快速远程访问,并降低维护海量数字数据的持续成本,需要尽可能地压缩数据。同样,为了从海量的数字数据中收集知识和发现,必须开发基于上下文分析的工具和技术来组织、可视化和管理海量的数字数据。高效的数据压缩和准确的上下文分析对于将大数据转化为知识,将发现转化为行动,同时实现资源(带宽和电力)效率至关重要。*过去,数据压缩和数据分析在很大程度上是在不同学科中分开研究的。虽然数据压缩已经存在多年,但其现有的理论和技术,特别是信息论中的信源编码理论,在很大程度上是针对静态和/或良好结构的数据(如文本、网页等)而发展起来的。然而,在大数据环境中,大多数数据类型是非静态和非结构化/半结构化的;将现有压缩技术直接应用于这些数据类型通常会导致令人不满意的压缩性能。因此,发展新的压缩理论和技术势在必行,将数据分析融入数据压缩中,以优雅的方式处理缺乏结构性和平稳性的问题。相反,研究如何将信息论思想,特别是源代码编码理论和技术应用到数据分析中,以开发更好的数据分析解决方案,如认知距离、聚类和组织,也是有益的。*在我们早期成功的基础上,在本研究中,我们将调查和探索数据压缩和分析的交互作用,以促进这两个领域的知识。我们的方法将是信息论。三个主题领域将集中于:(1)数据压缩和分析的相互作用,以改善非平稳和非结构化数据的有损压缩性能;(2)数据压缩和分析的相互作用,以发展更好的认知距离,从而为数据集群和组织提供更好的工具;以及(3)在大型二分图中的压缩和模式发现。在实现我们的科学目标的过程中,我们将保持和加强我们在相关领域的领导地位,并为加拿大培养高素质的信息和通信技术人才。**
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Yang, EnHui其他文献
Yang, EnHui的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Yang, EnHui', 18)}}的其他基金
Information Theory and Applications
信息论及其应用
- 批准号:
CRC-2016-00083 - 财政年份:2020
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information Theory and Applications
信息论及其应用
- 批准号:
CRC-2016-00083 - 财政年份:2019
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information Theory and Applications
信息论及其应用
- 批准号:
CRC-2016-00083 - 财政年份:2018
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information Theory and Applications
信息论及其应用
- 批准号:
CRC-2016-00083 - 财政年份:2017
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information Theory and Multimedia Compression
信息论和多媒体压缩
- 批准号:
1000218763-2009 - 财政年份:2016
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information-theoretic research of video coding: theory and algorithms
视频编码的信息论研究:理论与算法
- 批准号:
203035-2011 - 财政年份:2015
- 资助金额:
$ 3.28万 - 项目类别:
Discovery Grants Program - Individual
Information Theory and Multimedia Compression
信息论和多媒体压缩
- 批准号:
1218763-2009 - 财政年份:2015
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information Theory and Multimedia Compression
信息论和多媒体压缩
- 批准号:
1000218763-2009 - 财政年份:2014
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
Information-theoretic research of video coding: theory and algorithms
视频编码的信息论研究:理论与算法
- 批准号:
203035-2011 - 财政年份:2014
- 资助金额:
$ 3.28万 - 项目类别:
Discovery Grants Program - Individual
Information Theory and Multimedia Compression
信息论和多媒体压缩
- 批准号:
1000218763-2009 - 财政年份:2013
- 资助金额:
$ 3.28万 - 项目类别:
Canada Research Chairs
相似海外基金
Collaborative Research: Mathematical Foundation of Learning with Information-Theoretic Criteria from Non-Gaussian Data
协作研究:利用非高斯数据的信息理论标准学习的数学基础
- 批准号:
2110826 - 财政年份:2021
- 资助金额:
$ 3.28万 - 项目类别:
Standard Grant
Collaborative Research: CIF: Medium: An Information-Theoretic Foundation for Adaptive Bidding in First-Price Auctions
合作研究:CIF:媒介:一价拍卖中自适应出价的信息理论基础
- 批准号:
2106467 - 财政年份:2021
- 资助金额:
$ 3.28万 - 项目类别:
Standard Grant
Collaborative Research: Mathematical Foundation of Learning with Information-Theoretic Criteria from Non-Gaussian Data
协作研究:利用非高斯数据的信息理论标准学习的数学基础
- 批准号:
2111080 - 财政年份:2021
- 资助金额:
$ 3.28万 - 项目类别:
Continuing Grant
Collaborative Research: CIF: Medium: An Information-Theoretic Foundation for Adaptive Bidding in First-Price Auctions
合作研究:CIF:媒介:一价拍卖中自适应出价的信息理论基础
- 批准号:
2106508 - 财政年份:2021
- 资助金额:
$ 3.28万 - 项目类别:
Standard Grant
Information Theoretic Research on Big Data Compression and Analytics: Theory, Algorithms, and Applications
大数据压缩与分析的信息论研究:理论、算法与应用
- 批准号:
RGPIN-2016-03871 - 财政年份:2021
- 资助金额:
$ 3.28万 - 项目类别:
Discovery Grants Program - Individual
Information Theoretic Research on Big Data Compression and Analytics: Theory, Algorithms, and Applications
大数据压缩与分析的信息论研究:理论、算法与应用
- 批准号:
RGPIN-2016-03871 - 财政年份:2020
- 资助金额:
$ 3.28万 - 项目类别:
Discovery Grants Program - Individual
CIF: Medium: Collaborative Research: Information-theoretic Guarantees on Privacy in the Age of Learning
CIF:媒介:协作研究:学习时代隐私的信息理论保证
- 批准号:
1901243 - 财政年份:2019
- 资助金额:
$ 3.28万 - 项目类别:
Continuing Grant
CIF: Medium: Collaborative Research: Information-theoretic Guarantees on Privacy in the Age of Learning
CIF:媒介:协作研究:学习时代隐私的信息理论保证
- 批准号:
1900750 - 财政年份:2019
- 资助金额:
$ 3.28万 - 项目类别:
Continuing Grant
Information Theoretic Research on Big Data Compression and Analytics: Theory, Algorithms, and Applications
大数据压缩与分析的信息论研究:理论、算法与应用
- 批准号:
RGPIN-2016-03871 - 财政年份:2018
- 资助金额:
$ 3.28万 - 项目类别:
Discovery Grants Program - Individual
Information Theoretic Research on Big Data Compression and Analytics: Theory, Algorithms, and Applications
大数据压缩与分析的信息论研究:理论、算法与应用
- 批准号:
RGPIN-2016-03871 - 财政年份:2017
- 资助金额:
$ 3.28万 - 项目类别:
Discovery Grants Program - Individual