Multi-Speaker Separation in Reverberant Room Using Velocity-Based Microphone
使用基于力度的麦克风在混响室中进行多扬声器分离
基本信息
- 批准号:531229-2018
- 负责人:
- 金额:$ 1.82万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Engage Grants Program
- 财政年份:2018
- 资助国家:加拿大
- 起止时间:2018-01-01 至 2019-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Audio signal separation has found many applications in electrical and computer engineering fields, such as**automatic speech recognition, voice communication, hearing aids etc. These applications have driven the need**of directional microphone solutions that can pick up only the sound from the desired speaker while suppressing**the voices from other speakers as well as the background noise. Conventional solutions using array of**pressure-based microphones combined with digital signal processing (DSP) techniques do not work well in**space-constrained applications where the captured sound contains mixed voices and ambient noises. Soundskrit**is developing a revolutionary fiber-based directional microphone technology that can directly measure the**particle velocity of an incoming sound wave. Currently, the Soundskrit R&D team lacks signal processing**expertise that is required to develop sound separation software based on the real data acquired by the new**fiber-based microphone in a multi-speaker and reverberant environment.**In this project, we intend to develop DSP algorithms for the Soundskrit's prototype to separate multiple moving**speakers in a reverberant room. The project has two main objectives: (1) localizing active sources in a**reverberant room using sound intensity, and (2) beamforming toward the desired speaker to attenuate unwanted**sounds and noise. The proposed work is divided into three phases. Phase 1 is to build a baseline system to**study the performance of the existing solutions via computer simulation. Phase 2 is to develop DSP algorithms**for the Soundskrit prototype to achieve the objectives in a simulated environment and compare it with the**solutions in Phase 1. In Phase 3, real data will be collected using Soundskrit's prototype in several reverberant**rooms, and the algorithm will be tested and evaluated in real-time. The research results achieved in this project**will be transferred to Soundskrit through a detailed technical report including all software codes and hardware**design.
音频信号分离在电子和计算机工程领域有许多应用,例如自动语音识别,语音通信,助听器等。这些应用推动了对定向麦克风解决方案的需求,该解决方案可以只拾取来自所需扬声器的声音,同时抑制来自其他扬声器的声音以及背景噪声。传统的解决方案使用阵列的压力为基础的麦克风与数字信号处理(DSP)技术相结合,并不适用于 ** 空间受限的应用,其中捕获的声音包含混合的语音和环境噪声。Soundskrit** 正在开发一种革命性的基于光纤的定向麦克风技术,可以直接测量传入声波的粒子速度。目前,Soundskrit研发团队缺乏信号处理 ** 专业知识,无法根据新型 ** 光纤麦克风在多扬声器和混响环境中获取的真实的数据开发声音分离软件。**在这个项目中,我们打算为Soundskrit的原型开发DSP算法,以在混响室中分离多个移动扬声器。该项目有两个主要目标:(1)使用声音强度在混响室中定位活动源,以及(2)朝向所需扬声器进行波束成形以衰减不需要的声音和噪声。拟议的工作分为三个阶段。第一阶段是建立一个基线系统,通过计算机模拟研究现有解决方案的性能。第2阶段是为Soundskrit原型开发DSP算法 **,以在模拟环境中实现目标,并将其与第1阶段的 ** 解决方案进行比较。在第三阶段,将使用Soundskrit的原型在几个混响室中收集真实的数据,并对算法进行实时测试和评估。本项目 ** 取得的研究成果将通过一份详细的技术报告(包括所有软件代码和硬件 ** 设计)转移给Soundskrit。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Zhu, WeiPing其他文献
Zhu, WeiPing的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Zhu, WeiPing', 18)}}的其他基金
Advanced Signal Processing Enabled Massive MIMO With NOMA
先进的信号处理通过 NOMA 实现大规模 MIMO
- 批准号:
RGPIN-2020-06815 - 财政年份:2022
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Advanced Signal Processing Enabled Massive MIMO With NOMA
先进的信号处理通过 NOMA 实现大规模 MIMO
- 批准号:
RGPIN-2020-06815 - 财政年份:2021
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Advanced Signal Processing Enabled Massive MIMO With NOMA
先进的信号处理通过 NOMA 实现大规模 MIMO
- 批准号:
RGPIN-2020-06815 - 财政年份:2020
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Robust and Energy Efficient Signal Processing for Massive MIMO Communication
用于大规模 MIMO 通信的稳健且节能的信号处理
- 批准号:
RGPIN-2015-04550 - 财政年份:2019
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Robust and Energy Efficient Signal Processing for Massive MIMO Communication
用于大规模 MIMO 通信的稳健且节能的信号处理
- 批准号:
RGPIN-2015-04550 - 财政年份:2018
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Robust and Energy Efficient Signal Processing for Massive MIMO Communication
用于大规模 MIMO 通信的稳健且节能的信号处理
- 批准号:
RGPIN-2015-04550 - 财政年份:2017
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Research Equipment for Intelligent Signal Processing on Millimeter Wave MIMO Communication in 5G Networks
5G网络毫米波MIMO通信智能信号处理研究设备
- 批准号:
RTI-2018-00983 - 财政年份:2017
- 资助金额:
$ 1.82万 - 项目类别:
Research Tools and Instruments
Robust and Energy Efficient Signal Processing for Massive MIMO Communication
用于大规模 MIMO 通信的稳健且节能的信号处理
- 批准号:
RGPIN-2015-04550 - 财政年份:2016
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Robust and Energy Efficient Signal Processing for Massive MIMO Communication
用于大规模 MIMO 通信的稳健且节能的信号处理
- 批准号:
RGPIN-2015-04550 - 财政年份:2015
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
Advanced signal processing for MIMO cooperative communication
用于 MIMO 协作通信的高级信号处理
- 批准号:
227602-2010 - 财政年份:2014
- 资助金额:
$ 1.82万 - 项目类别:
Discovery Grants Program - Individual
相似海外基金
危機言語コミュニティにおけるNew Speakerの育成
在濒危语言社区培养新的使用者
- 批准号:
24K00069 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
From Native-Speaker Norms to Global Englishes (GE): Integrating GE Pedagogy in English Teacher Education in Japan
从母语人士规范到全球英语 (GE):将 GE 教育学融入日本英语教师教育
- 批准号:
24K16136 - 财政年份:2024
- 资助金额:
$ 1.82万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Development of Speech Synthesis System for Controlling Speaker Identity through Text Prompts and Visual Interfaces
通过文本提示和可视化界面控制说话人身份的语音合成系统的开发
- 批准号:
23K20017 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Understanding the role of pitch cues in multi-speaker environments
了解音高提示在多扬声器环境中的作用
- 批准号:
2886867 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Studentship
A Study on Utterance Style-dependent Speaker Verification
依赖于话语风格的说话人验证研究
- 批准号:
23K11165 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Neurocognitive and behavioral constituents of nonverbal speaker-listener attunement during science communication
科学传播过程中非语言说者-听者协调的神经认知和行为成分
- 批准号:
2302608 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Standard Grant
Articulatory and prosodic sensorimotor adaptation in speaker-listener interactions
说话者与听众互动中的发音和韵律感觉运动适应
- 批准号:
10675968 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Multi-Organ Transplant Speaker Series: Women in Transplant Day
多器官移植演讲者系列:女性移植日
- 批准号:
480804 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Miscellaneous Programs
HomePal: Developing a Smart Speaker-Based System for In-Home Loneliness Assessment for Older Adults
HomePal:开发基于智能扬声器的系统,用于老年人的家庭孤独评估
- 批准号:
10725229 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
In-between speech: the speaker-specificity of non-speech vocalisations
中间语音:非语音发声的说话人特异性
- 批准号:
2890545 - 财政年份:2023
- 资助金额:
$ 1.82万 - 项目类别:
Studentship