Microphone array processing techniques for the enhancement, dereverberation and separation of speech
用于语音增强、去混响和分离的麦克风阵列处理技术
基本信息
- 批准号:477494-2014
- 负责人:
- 金额:$ 5.83万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Collaborative Research and Development Grants
- 财政年份:2015
- 资助国家:加拿大
- 起止时间:2015-01-01 至 2016-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Consumer products for voice and multimedia communications over wireless and internet access, e.g. cell phones, handheld tablets, VoIP phones, etc., make extensive use of digital speech processing (DSP) for the transmission, storage and playback of information. Speech enhancement, as a core technology and front-end to speech coding or automatic speech recognition, plays a key role in these devices. Indeed, as their usage becomes widespread, the voice signals received by the microphones are degraded by ever more adverse acoustic disturbances, including reverberation and non-stationary noise, which degrade the quality and intelligibility of the desired speech. Hence, there is a strong demand for improved enhancement techniques that can suppress these acoustic disturbances and produce a cleaner speech signal for transmission. In this project, we will investigate new DSP techniques for the enhancement of reverberant speech and its separation from a non-stationary acoustic background. Our work aims to achieve the following objectives: (1) Extension of multi-channel short time spectral amplitude (STSA) speech estimators to reverberant environments; (2) Development of joint dereverberation and noise reduction techniques based on subband decomposition; (3) Investigation of non-negative matrix factorization (NMF) algorithms for speech separation; and (4) System integration and validation with microphone array. This proposed research is an extension of a successfully completed NSERC CRD project under the sponsorship of industrial partner, Microsemi. In spite of the growing popularity of the company's broadband voice processing platforms, the reduction of non-stationary noise, the dereverberation of speech and its separation from acoustic backgrounds, remain challenging problems that hinder further technological progress. The proposed research aims to push back such limitations and provide Microsemi with innovative, cost-effective solutions to these challenges. In addition to technology transfer, the project will contribute to advance the engineering discipline of speech and audio processing and as to the training of highly qualified personal in this area at McGill and Concordia Universities.
通过无线和互联网接入进行语音和多媒体通信的消费产品,例如手机、手持平板电脑、VoIP电话等,广泛使用数字语音处理(DSP)来传输、存储和播放信息。语音增强作为语音编码或语音自动识别的核心技术和前端,在这些设备中起着关键作用。事实上,随着麦克风的广泛使用,麦克风接收到的声音信号受到越来越多的不利声学干扰,包括混响和非平稳噪声,从而降低了所需语音的质量和可理解性。因此,迫切需要改进增强技术,以抑制这些声学干扰,并产生更清晰的语音信号进行传输。在这个项目中,我们将研究新的DSP技术,用于增强混响语音及其与非平稳声学背景的分离。我们的工作旨在实现以下目标:(1)将多通道短时间频谱幅度(STSA)语音估计器扩展到混响环境;(2)基于子带分解的联合去噪技术的发展;(3)非负矩阵分解(NMF)语音分离算法研究;(4)麦克风阵列系统集成与验证。这项拟议的研究是在工业合作伙伴Microsemi赞助下成功完成的NSERC CRD项目的延伸。尽管该公司的宽带语音处理平台越来越受欢迎,但非平稳噪声的减少、语音的去音高和与声学背景的分离仍然是阻碍进一步技术进步的挑战性问题。这项研究旨在突破这些限制,为Microsemi提供创新的、具有成本效益的解决方案。除了技术转让外,该项目还将促进语音和音频处理工程学科的发展,并在麦吉尔大学和康考迪亚大学培养这一领域的高素质人才。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Champagne, Benoit其他文献
Multi-State Second-Order Nonlinear Optical Switches Incorporating One to Three Benzazolo-Oxazolidine Units: A Quantum Chemistry Investigation.
- DOI:
10.3390/molecules27092770 - 发表时间:
2022-04-26 - 期刊:
- 影响因子:4.6
- 作者:
Beaujean, Pierre;Sanguinet, Lionel;Rodriguez, Vincent;Castet, Frederic;Champagne, Benoit - 通讯作者:
Champagne, Benoit
Signature of multiradical character in second hyperpolarizabilities of rectangular graphene nanoflakes
- DOI:
10.1016/j.cplett.2010.03.013 - 发表时间:
2010-04-09 - 期刊:
- 影响因子:2.8
- 作者:
Nagai, Hiroshi;Nakano, Masayoshi;Champagne, Benoit - 通讯作者:
Champagne, Benoit
TDDFT investigation of the optical properties of cyanine dyes
- DOI:
10.1016/j.cplett.2006.05.009 - 发表时间:
2006-07-03 - 期刊:
- 影响因子:2.8
- 作者:
Champagne, Benoit;Guillaume, Maxime;Zutterman, Freddy - 通讯作者:
Zutterman, Freddy
Theoretical study on the spin state and open-shell character dependences of the second hyperpolarizability in hydrogen chain models
- DOI:
10.1103/physreva.94.042515 - 发表时间:
2016-10-28 - 期刊:
- 影响因子:2.9
- 作者:
Matsui, Hiroshi;Nakano, Masayoshi;Champagne, Benoit - 通讯作者:
Champagne, Benoit
X Polarizabilities and hyperpolarizabilities
- DOI:
10.1039/9781849730884-00043 - 发表时间:
2010-01-01 - 期刊:
- 影响因子:0
- 作者:
Champagne, Benoit - 通讯作者:
Champagne, Benoit
Champagne, Benoit的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Champagne, Benoit', 18)}}的其他基金
Array Signal Processing Techniques for Terahertz Communications and Sensing
用于太赫兹通信和传感的阵列信号处理技术
- 批准号:
RGPIN-2022-03678 - 财政年份:2022
- 资助金额:
$ 5.83万 - 项目类别:
Discovery Grants Program - Individual
Signal Prosessing Techniques for 5G Wireless and mm-Wave Communications
5G 无线和毫米波通信的信号处理技术
- 批准号:
RGPIN-2017-04223 - 财政年份:2021
- 资助金额:
$ 5.83万 - 项目类别:
Discovery Grants Program - Individual
Signal Prosessing Techniques for 5G Wireless and mm-Wave Communications
5G 无线和毫米波通信的信号处理技术
- 批准号:
DGDND-2017-00019 - 财政年份:2020
- 资助金额:
$ 5.83万 - 项目类别:
DND/NSERC Discovery Grant Supplement
Signal Prosessing Techniques for 5G Wireless and mm-Wave Communications
5G 无线和毫米波通信的信号处理技术
- 批准号:
RGPIN-2017-04223 - 财政年份:2020
- 资助金额:
$ 5.83万 - 项目类别:
Discovery Grants Program - Individual
Signal Prosessing Techniques for 5G Wireless and mm-Wave Communications
5G 无线和毫米波通信的信号处理技术
- 批准号:
DGDND-2017-00019 - 财政年份:2019
- 资助金额:
$ 5.83万 - 项目类别:
DND/NSERC Discovery Grant Supplement
Deep Learning Technologies for Acoustic Echo Cancellation in Dynamic Environments
用于动态环境中声学回声消除的深度学习技术
- 批准号:
543348-2019 - 财政年份:2019
- 资助金额:
$ 5.83万 - 项目类别:
Engage Grants Program
Signal Prosessing Techniques for 5G Wireless and mm-Wave Communications
5G 无线和毫米波通信的信号处理技术
- 批准号:
RGPIN-2017-04223 - 财政年份:2019
- 资助金额:
$ 5.83万 - 项目类别:
Discovery Grants Program - Individual
Deep neural network-based speech enhancement for robust speech recognition in smart home device
基于深度神经网络的语音增强,可在智能家居设备中实现稳健的语音识别
- 批准号:
515072-2017 - 财政年份:2019
- 资助金额:
$ 5.83万 - 项目类别:
Collaborative Research and Development Grants
Deep neural network-based speech enhancement for robust speech recognition in smart home device
基于深度神经网络的语音增强,可在智能家居设备中实现稳健的语音识别
- 批准号:
515072-2017 - 财政年份:2018
- 资助金额:
$ 5.83万 - 项目类别:
Collaborative Research and Development Grants
Signal Prosessing Techniques for 5G Wireless and mm-Wave Communications
5G 无线和毫米波通信的信号处理技术
- 批准号:
DGDND-2017-00019 - 财政年份:2018
- 资助金额:
$ 5.83万 - 项目类别:
DND/NSERC Discovery Grant Supplement
相似国自然基金
基于多禁带光子晶体微球构建"Array on One Particle"传感体系
- 批准号:21902147
- 批准年份:2019
- 资助金额:27.0 万元
- 项目类别:青年科学基金项目
基于球上超分辨的DOA估计方法研究
- 批准号:61601402
- 批准年份:2016
- 资助金额:21.0 万元
- 项目类别:青年科学基金项目
基于protein pathway array 技术导向的胃癌淋巴结转移预警蛋白表达特征的研究
- 批准号:81372295
- 批准年份:2013
- 资助金额:62.0 万元
- 项目类别:面上项目
BH3-only蛋白模拟小分子化合物S1调控内质网应激-自噬途径机制的研究
- 批准号:81141099
- 批准年份:2011
- 资助金额:10.0 万元
- 项目类别:专项基金项目
非吸烟肺癌表皮生长因子受体基因相关非编码小RNA差异表达研究
- 批准号:81071914
- 批准年份:2010
- 资助金额:36.0 万元
- 项目类别:面上项目
高糖培养的视网膜Müller细胞VEGF和PEDF表达的分子调控机制研究
- 批准号:81070736
- 批准年份:2010
- 资助金额:32.0 万元
- 项目类别:面上项目
EnSite array指导下对Stepwise approach无效的慢性房颤机制及消融径线设计的实验研究
- 批准号:81070152
- 批准年份:2010
- 资助金额:10.0 万元
- 项目类别:面上项目
地氟醚预处理对内皮细胞缺氧/复氧损伤影响分子网络调控机制
- 批准号:30972838
- 批准年份:2009
- 资助金额:31.0 万元
- 项目类别:面上项目
预测直肠癌根治术后肝转移的基因组DNA改变
- 批准号:30950013
- 批准年份:2009
- 资助金额:30.0 万元
- 项目类别:专项基金项目
基于虚拟多天线的协作广播
- 批准号:60972076
- 批准年份:2009
- 资助金额:30.0 万元
- 项目类别:面上项目
相似海外基金
Sparse Sensor Array Design and Processing
稀疏传感器阵列设计与处理
- 批准号:
2236023 - 财政年份:2023
- 资助金额:
$ 5.83万 - 项目类别:
Standard Grant
The Genetics of Personalized Functional MRI Networks
个性化功能 MRI 网络的遗传学
- 批准号:
10650032 - 财政年份:2023
- 资助金额:
$ 5.83万 - 项目类别:
Wearable Array for Ultrasound Stimulation on the Retina
用于视网膜超声刺激的可穿戴阵列
- 批准号:
10766622 - 财政年份:2023
- 资助金额:
$ 5.83万 - 项目类别:
Noncontact Measurement of Multiple Sites of Multiple People Using Array Radar Signal Processing Based on Mathematical Model of Body Displacement
基于人体位移数学模型的阵列雷达信号处理非接触多人多部位测量
- 批准号:
23H01420 - 财政年份:2023
- 资助金额:
$ 5.83万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Array Signal Processing Techniques for Terahertz Communications and Sensing
用于太赫兹通信和传感的阵列信号处理技术
- 批准号:
RGPIN-2022-03678 - 财政年份:2022
- 资助金额:
$ 5.83万 - 项目类别:
Discovery Grants Program - Individual
Broadband sensor array measurements, signal processing, detection and classification of rare events
宽带传感器阵列测量、信号处理、罕见事件的检测和分类
- 批准号:
RGPIN-2019-04902 - 财政年份:2022
- 资助金额:
$ 5.83万 - 项目类别:
Discovery Grants Program - Individual
A multi-modal wireless oscillator array for high-resolution mapping of neurovascular coupling
用于神经血管耦合高分辨率映射的多模态无线振荡器阵列
- 批准号:
10516470 - 财政年份:2022
- 资助金额:
$ 5.83万 - 项目类别:
Mechanosensing and Mechanotransduction in the Endothelial Nucleus
内皮细胞核中的机械传感和机械转导
- 批准号:
10536215 - 财政年份:2022
- 资助金额:
$ 5.83万 - 项目类别: