Speech enhancement algorithms for multimedia applications

多媒体应用的语音增强算法

基本信息

  • 批准号:
    283137-2009
  • 负责人:
  • 金额:
    $ 2.62万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2015
  • 资助国家:
    加拿大
  • 起止时间:
    2015-01-01 至 2016-12-31
  • 项目状态:
    已结题

项目摘要

Speech enhancement plays a fundamental role in many applications such as teleconferencing systems, speech recognizers, human-machine interfaces, hands-free communications, mobile phones, and hearing aids. Indeed, the noise is around us and everywhere. So when a signal of interest (usually speech) is picked up by a microphone, it is always contaminated by noise, reverberation, and other undesired signals. The objective is then to clean up the noisy signal with digital signal processing tools without damaging much the desired speech signal. The problem of speech enhancement is an old one and has been around for more than 40 years. This should not come as a surprise since, contrary to what some people may believe, it is a very difficult problem for mainly two reasons. First, the nature and characteristics of the noise signals can change dramatically in time and application to application. It is therefore laborious to find versatile algorithms that really work in different practical environments. Second, the performance measure can also be defined differently for each application. Two perceptual criteria are widely used to measure the performance: quality and intelligibility. While the former is subjective (it reflects individual preferences of listeners), the latter is objective (it gives the percentage of words that could be correctly identified by listeners). It is very hard to satisfy both at the same time. So this research proposal aims at developing the next-generation speech enhancement algorithms with one and, especially, multiple microphones (or microphone arrays) with the clear objective to reduce the noise and other interference signals as much as possible with little (or no) distortion of the desired speech signal. In other words, we will try to improve the quality as much as possible without affecting the intelligibility. We will also try to develop practical algorithms that can be easy to use and easy to tune in the real world and for most applications. To achieve this goal, we will rely on some strong ideas and results we have developed recently.
语音增强在诸如电话会议系统、语音识别器、人机接口、免提通信、移动的电话和助听器的许多应用中起着基本作用。事实上,噪音就在我们周围,到处都是。因此,当麦克风拾取感兴趣的信号(通常是语音)时,它总是受到噪声,混响和其他不需要的信号的污染。然后,目标是用数字信号处理工具清除噪声信号,而不会对期望的语音信号造成太大的损害。 语音增强是一个古老的问题,已经存在了40多年。这不应令人感到意外,因为与某些人可能认为的相反,这是一个非常困难的问题,主要有两个原因。首先,噪声信号的性质和特征可以随时间和应用而显著变化。因此,要找到真正适用于不同实际环境的通用算法是很困难的。其次,性能度量也可以针对每个应用程序进行不同的定义。两个感知标准被广泛用于衡量性能:质量和可懂度。前者是主观的(它反映了听众的个人偏好),后者是客观的(它给出了听众可以正确识别的单词的百分比)。很难同时满足两者。 因此,本研究的目的是开发下一代语音增强算法与一个,特别是,多个麦克风(或麦克风阵列)的明确目标,以减少噪声和其他干扰信号尽可能少(或没有)失真的期望的语音信号。换句话说,我们将在不影响清晰度的情况下尽可能提高质量。我们还将尝试开发实用的算法,这些算法可以在真实的世界和大多数应用中易于使用和调整。为了实现这一目标,我们将依靠我们最近开发的一些强有力的想法和成果。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Benesty, Jacob其他文献

A Robust Variable Forgetting Factor Recursive Least-Squares Algorithm for System Identification
  • DOI:
    10.1109/lsp.2008.2001559
  • 发表时间:
    2008-01-01
  • 期刊:
  • 影响因子:
    3.9
  • 作者:
    Paleologu, Constantin;Benesty, Jacob;Ciochina, Silviu
  • 通讯作者:
    Ciochina, Silviu
Study of the General Kalman Filter for Echo Cancellation
Design of Planar Differential Microphone Arrays With Fractional Orders
Steered Beamforming Approaches for Acoustic Source Localization
A Widely Linear Distortionless Filter for Single-Channel Noise Reduction
  • DOI:
    10.1109/lsp.2010.2043152
  • 发表时间:
    2010-05-01
  • 期刊:
  • 影响因子:
    3.9
  • 作者:
    Benesty, Jacob;Chen, Jingdong;Huang, Yiteng (Arden)
  • 通讯作者:
    Huang, Yiteng (Arden)

Benesty, Jacob的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Benesty, Jacob', 18)}}的其他基金

Microphone Arrays for Immersive Voice Communications
用于沉浸式语音通信的麦克风阵列
  • 批准号:
    RGPIN-2018-05223
  • 财政年份:
    2022
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Microphone Arrays for Immersive Voice Communications
用于沉浸式语音通信的麦克风阵列
  • 批准号:
    RGPIN-2018-05223
  • 财政年份:
    2021
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Microphone Arrays for Immersive Voice Communications
用于沉浸式语音通信的麦克风阵列
  • 批准号:
    RGPIN-2018-05223
  • 财政年份:
    2020
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Microphone Arrays for Immersive Voice Communications
用于沉浸式语音通信的麦克风阵列
  • 批准号:
    RGPIN-2018-05223
  • 财政年份:
    2019
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Microphone Arrays for Immersive Voice Communications
用于沉浸式语音通信的麦克风阵列
  • 批准号:
    RGPIN-2018-05223
  • 财政年份:
    2018
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Speech enhancement algorithms for multimedia applications
多媒体应用的语音增强算法
  • 批准号:
    283137-2009
  • 财政年份:
    2012
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Speech enhancement algorithms for multimedia applications
多媒体应用的语音增强算法
  • 批准号:
    283137-2009
  • 财政年份:
    2011
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Speech enhancement algorithms for multimedia applications
多媒体应用的语音增强算法
  • 批准号:
    283137-2009
  • 财政年份:
    2010
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Speech enhancement algorithms for multimedia applications
多媒体应用的语音增强算法
  • 批准号:
    283137-2009
  • 财政年份:
    2009
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Advanced audio signal processing for multimedia communication systems
用于多媒体通信系统的高级音频信号处理
  • 批准号:
    283137-2004
  • 财政年份:
    2008
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual

相似国自然基金

泛文化的自我促进:基于中国人的行为和认知神经的新证据
  • 批准号:
    31070919
  • 批准年份:
    2010
  • 资助金额:
    32.0 万元
  • 项目类别:
    面上项目
纳米涂层表面上池沸腾防垢和强化传热的机理研究
  • 批准号:
    20876106
  • 批准年份:
    2008
  • 资助金额:
    35.0 万元
  • 项目类别:
    面上项目

相似海外基金

Leveraging Natural Language Processing for Reverberant Speech Enhancement in Cochlear Implants
利用自然语言处理增强人工耳蜗的混响语音
  • 批准号:
    10755798
  • 财政年份:
    2023
  • 资助金额:
    $ 2.62万
  • 项目类别:
Design and implementation of spiking neural network based speech enhancement algorithms
基于尖峰神经网络的语音增强算法的设计与实现
  • 批准号:
    RGPIN-2020-05077
  • 财政年份:
    2022
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Design and implementation of spiking neural network based speech enhancement algorithms
基于尖峰神经网络的语音增强算法的设计与实现
  • 批准号:
    RGPAS-2020-00112
  • 财政年份:
    2022
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Design and implementation of spiking neural network based speech enhancement algorithms
基于尖峰神经网络的语音增强算法的设计与实现
  • 批准号:
    RGPAS-2020-00112
  • 财政年份:
    2021
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Design and implementation of spiking neural network based speech enhancement algorithms
基于尖峰神经网络的语音增强算法的设计与实现
  • 批准号:
    RGPIN-2020-05077
  • 财政年份:
    2021
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Design and implementation of spiking neural network based speech enhancement algorithms
基于尖峰神经网络的语音增强算法的设计与实现
  • 批准号:
    RGPIN-2020-05077
  • 财政年份:
    2020
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Individual
Design and implementation of spiking neural network based speech enhancement algorithms
基于尖峰神经网络的语音增强算法的设计与实现
  • 批准号:
    RGPAS-2020-00112
  • 财政年份:
    2020
  • 资助金额:
    $ 2.62万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Wearable silent speech technology to enhance impaired oral communication
可穿戴式无声语音技术可增强受损的口语交流
  • 批准号:
    10218134
  • 财政年份:
    2019
  • 资助金额:
    $ 2.62万
  • 项目类别:
Wearable silent speech technology to enhance impaired oral communication
可穿戴式无声语音技术可增强受损的口语交流
  • 批准号:
    10456297
  • 财政年份:
    2019
  • 资助金额:
    $ 2.62万
  • 项目类别:
Wearable silent speech technology to enhance impaired oral communication
可穿戴式无声语音技术可增强受损的口语交流
  • 批准号:
    10669628
  • 财政年份:
    2019
  • 资助金额:
    $ 2.62万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了