New Techniques for Audio, Image, and Video Compression
音频、图像和视频压缩新技术
基本信息
- 批准号:9523767
- 负责人:
- 金额:$ 36.91万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:1996
- 资助国家:美国
- 起止时间:1996-03-01 至 2000-02-29
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
NCR-9523767 Pearlman, William A. Rensselaer Polytechnic Institute New Techniques for Audio, Image, and Video Compression The goal of this research is to investigate more effective techniques for digital audio, image, and video data compression. For audio, the intention is to employ more efficient entropy-constrained, conditional and unconditional, quantization in the coding of the critical subbands of the MPEG audio standard. The dependence of the subbands is next exploited using set partitioning in hierarchical trees (SPIHT) as done originally in embedded, wavelet, zerotree coding of images. The resulting embedded coding allows delivery of different qualities of service with a single compressed bit stream. A newly developed SPIHT procedure, which obtained probably the best-to-date monochrome, still image results (lossy or lossless) and with extremely fast encoding and decoding, is utilized for audio and also for color image and video coding. The extremely fast execution of the SPIHT coding procedure makes it especially suitable for video, where a requirement of real-time coding becomes an insurmountable obstacle for other procedures. Both lossy and lossless coding are investigated, as they fit naturally into the SPIHT framework. It is expected that the results will surpass those of existing standard systems, which do not produce naturally embedded code, and are far more complex operationally and hence slower in execution. Other investigations involve the use of information theoretic criteria to create optimal adaptive subband compositions and the use of trellis coded quantization (TCQ) in block discrete cosine transform (DCT) and lapped orthogonal transform (LOT) coding. The adaptive subband decompositions pay off in better compression for a given subband coding technique. TCQ coding brings an efficient, but complex technique operating on transform blocks, which preserve regional characteristics of the image. Finally, the SPIHT coding algorithm pr ovokes several theoretical questions which are addressed. Among them are the minimal information rates to determine octave ranges of random variables and convey subset partition numbers and element numbers, and the mutual information between significant bits of dependent random variables and between their respective residual bits. Satisfactory answers to these questions will lead to more efficient coding techniques with decreased complexity.
9523767 Pellman,William A.Rensselaer理工学院音频、图像和视频压缩的新技术本研究的目标是研究更有效的数字音频、图像和视频数据压缩技术。对于音频,其目的是在对MPEG音频标准的关键子带进行编码时使用更有效的受熵约束的、有条件的和无条件的量化。如最初在图像的嵌入、小波、零树编码中所做的那样,接下来使用分层树中的集合分割(SPIHT)来利用子带的相关性。由此产生的嵌入式编码允许用单个压缩比特流传递不同的服务质量。一种新开发的SPIHT过程被用于音频以及彩色图像和视频编码,该过程获得了可能是迄今为止最好的单色静止图像结果(有损或无损),并且具有极快的编码和解码。SPIHT编码过程的极快执行使其特别适用于视频,其中实时编码的要求成为其他过程不可逾越的障碍。对有损和无损编码都进行了研究,因为它们自然地适合SPIHT框架。预计结果将超过现有标准系统,后者不会产生自然嵌入的代码,并且在操作上要复杂得多,因此执行速度会更慢。其他研究包括使用信息论准则来创建最优自适应子带组成,以及在块离散余弦变换(DCT)和重叠正交变换(LOT)编码中使用网格编码量化(TCQ)。对于给定子带编码技术,自适应子带分解在更好的压缩中是值得的。TCQ编码提供了一种在变换块上操作的高效但复杂的技术,该技术保留了图像的区域特征。最后,对SPIHT编码算法提出的几个理论问题进行了讨论。其中包括确定随机变量的倍频程范围并传递子集分区数和元素数的最小信息率,以及相依随机变量的有效位之间以及它们各自的残差位之间的互信息。对这些问题的满意回答将导致更有效的编码技术和更低的复杂性。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
William Pearlman其他文献
William Pearlman的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('William Pearlman', 18)}}的其他基金
CISE Research Resources: Processing and Display of Volume Images and High Resolution Image Sequences
CISE 研究资源:体积图像和高分辨率图像序列的处理和显示
- 批准号:
0224433 - 财政年份:2002
- 资助金额:
$ 36.91万 - 项目类别:
Standard Grant
Industry/University Cooperative Research Center for Digital Video
数字视频产学合作研究中心
- 批准号:
9615143 - 财政年份:1996
- 资助金额:
$ 36.91万 - 项目类别:
Standard Grant
EFFICIENT IMAGE CODING OF PYRAMIDAL STRUCTURES FOR THE HUMAN VISUAL SYSTEM
人类视觉系统金字塔结构的高效图像编码
- 批准号:
9004758 - 财政年份:1990
- 资助金额:
$ 36.91万 - 项目类别:
Continuing Grant
U.S.-Brazil Cooperative Research in Source Coding and Digital Phase Modulation
美国-巴西在源编码和数字相位调制方面的合作研究
- 批准号:
8802240 - 财政年份:1988
- 资助金额:
$ 36.91万 - 项目类别:
Standard Grant
Adaptation and Channel Error Susceptibility of Optimal Source Codes for Speech and Images
语音和图像最优源代码的自适应和信道误差敏感性
- 批准号:
8610029 - 财政年份:1987
- 资助金额:
$ 36.91万 - 项目类别:
Continuing Grant
Adrenocortical Hormone Metabolism By Mammary Glands
乳腺的肾上腺皮质激素代谢
- 批准号:
8119941 - 财政年份:1982
- 资助金额:
$ 36.91万 - 项目类别:
Standard Grant
Image Coding and Filtering With Squared-Error and Visual-Error Distortion Measures
使用平方误差和视觉误差失真测量进行图像编码和滤波
- 批准号:
8117410 - 财政年份:1982
- 资助金额:
$ 36.91万 - 项目类别:
Continuing Grant
Image Coding With a New Visual Error Criterion
使用新的视觉误差标准进行图像编码
- 批准号:
8003544 - 财政年份:1980
- 资助金额:
$ 36.91万 - 项目类别:
Standard Grant
Adrenocortical Hormone Metabolism in Mammary Gland
乳腺中的肾上腺皮质激素代谢
- 批准号:
7800578 - 财政年份:1978
- 资助金额:
$ 36.91万 - 项目类别:
Continuing Grant
相似国自然基金
EstimatingLarge Demand Systems with MachineLearning Techniques
- 批准号:
- 批准年份:2024
- 资助金额:万元
- 项目类别:外国学者研究基金
相似海外基金
Machine Learning Techniques for Transforming Voice Audio
用于转换语音音频的机器学习技术
- 批准号:
563164-2021 - 财政年份:2021
- 资助金额:
$ 36.91万 - 项目类别:
University Undergraduate Student Research Awards
Enhancing Audio Transformation through the Integration of Machine Learning and Digital Signal Processing Techniques
通过机器学习和数字信号处理技术的集成增强音频转换
- 批准号:
2856271 - 财政年份:2021
- 资助金额:
$ 36.91万 - 项目类别:
Studentship
3D audio techniques for acoustic monitoring of rainforest biodiversity
用于雨林生物多样性声学监测的 3D 音频技术
- 批准号:
2162833 - 财政年份:2018
- 资助金额:
$ 36.91万 - 项目类别:
Studentship
Development of novel signal processing techniques for audio reproduction using loudspeaker arrays
开发用于使用扬声器阵列进行音频再现的新型信号处理技术
- 批准号:
2106106 - 财政年份:2018
- 资助金额:
$ 36.91万 - 项目类别:
Studentship
Novel audio watermarking techniques for tracing multimedia piracy
用于追踪多媒体盗版的新颖音频水印技术
- 批准号:
LP170100458 - 财政年份:2018
- 资助金额:
$ 36.91万 - 项目类别:
Linkage Projects
Power supply compensation techniques for mitigation of audio output distortion
用于减轻音频输出失真的电源补偿技术
- 批准号:
460778-2013 - 财政年份:2013
- 资助金额:
$ 36.91万 - 项目类别:
Experience Awards (previously Industrial Undergraduate Student Research Awards)
A Disaster Prevention Broadcasting System Based on An Effective Audio Watermarking Scheme Using Spread Spectrum Techniques
基于扩频技术的有效音频水印方案的防灾广播系统
- 批准号:
24510240 - 财政年份:2012
- 资助金额:
$ 36.91万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
A Study on QoE-based Quality Enhancement Techniques for Multi-View Video and Audio IP Transmission
基于QoE的多视点音视频IP传输质量增强技术研究
- 批准号:
23760332 - 财政年份:2011
- 资助金额:
$ 36.91万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
iConcipio aims to offer higher education institutions through a web-based system called My SkillPal a variety of self-learning techniques for students using audio-visual input to address the difficulties, both psychological such as anxiety or stress
iConcipio 旨在通过名为 My SkillPal 的网络系统为高等教育机构提供多种自学技术,让学生使用视听输入来解决焦虑或压力等心理困难
- 批准号:
710060 - 财政年份:2011
- 资助金额:
$ 36.91万 - 项目类别:
GRD Proof of Concept
Cloddfa (Quarry) generating liminal experience of place through the use of sound-led recording and editing techniques in audio/visual installation
Cloddfa(采石场)通过在音频/视频装置中使用声音主导的录音和编辑技术来产生场所的阈限体验
- 批准号:
AH/G015724/1 - 财政年份:2009
- 资助金额:
$ 36.91万 - 项目类别:
Research Grant