Collaborative Research: Separating Speech from Speech Noise to Improve Intelligibility

合作研究:将语音与语音噪声分离以提高清晰度

基本信息

项目摘要

Separating signals that have been mixed together is an archetypal engineering probelm. The past decade has seen the emergence of a several approaches applicable to separating sound mixtures -- for example, a restaurant scenario in which a desired target voice must be extracted from the background babble of other patrons. However, the most appropriate goal, and hence the way to measure performance, is not always clear. In this project, the goal is established as improving intelligibility i.e. processing sound mixtures so a human listener can better understand what can be said. This requires a collaboration between computer science/electrical engineering -- to provide the separation algorithms -- and auditory scientists/psychologists -- to guide the results towards perceptually-relevant improvements, and to evaluate the results in listener tests.The particular techniques to be developed and combined include blind source separation (such as independent component analysis), computational auditory scene analysis (simulations of what is understood about human perceptual processing), and model-driven approaches derived from the machine-learning techniques of speech recognition. One specific area of interest is the synthesis of `minimally-informative noise', acoustic tokens that effectively communicate both what can be inferred and what remains unknown about the target signal, and which can leverage the powerful perceptual inference of human listeners.This project will lead to implementations of acoustic signal separation that deliver the greatest benefit to human listeners, potentially including both normal-hearing and hearing-impaired individuals. This has a broad range of applications from processing archival recordings through to improved real-time communications technologies, as well as the potential to help automatic speech recognition systems.
分离混合在一起的信号是一个典型的工程问题。在过去的十年中,出现了几种适用于分离混合声音的方法——例如,在一个餐厅场景中,必须从其他顾客的背景语声中提取出所需的目标声音。然而,最合适的目标以及衡量性能的方法并不总是明确的。在这个项目中,目标是提高可理解性,即处理声音混合,以便人类听众可以更好地理解所说的内容。这需要计算机科学/电子工程(提供分离算法)和听觉科学家/心理学家(指导结果朝着感知相关的改进方向发展)之间的合作,并在听众测试中评估结果。需要开发和结合的特定技术包括盲源分离(如独立分量分析),计算听觉场景分析(模拟人类感知处理的理解),以及源自语音识别机器学习技术的模型驱动方法。我们感兴趣的一个特定领域是“最小信息噪声”的合成,这是一种声学标记,可以有效地传达目标信号的可推断和未知内容,并且可以利用人类听众的强大感知推断。这个项目将导致实现声信号分离,为人类听众带来最大的好处,可能包括正常听力和听力受损的人。这具有广泛的应用范围,从处理档案记录到改进的实时通信技术,以及帮助自动语音识别系统的潜力。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Pierre Divenyi其他文献

The Dynamics of Speech Production and Perception, (編集本収録:Richard E. Turner, Marc A. Al-Hames, David R. R. Smith, Hideki Kawahara Toshio Irino, and Roy D. Patterson, Vowel normalisation: Time-domain processing of the internal dynamics of speech)
语音产生和感知的动态,(由 Richard E. Turner、Marc A. Al-Hames、David R. R. Smith、Hideki Kawahara Toshio Irino 和 Roy D. Patterson 编辑,元音归一化:语音内部动态的时域处理演讲)
  • DOI:
  • 发表时间:
    2006
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Pierre Divenyi;Steven Greenberg;and George Meyer (Eds.)
  • 通讯作者:
    and George Meyer (Eds.)

Pierre Divenyi的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Pierre Divenyi', 18)}}的其他基金

Participation of Students and Postdocs at Workshop on Brain Rhythms and Speech Perception/Production
学生和博士后参加脑节律和言语感知/产生研讨会
  • 批准号:
    0837972
  • 财政年份:
    2008
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Perspectives on Speech Separation -- A Workshop
语音分离的观点——研讨会
  • 批准号:
    0345301
  • 财政年份:
    2003
  • 资助金额:
    --
  • 项目类别:
    Standard Grant

相似国自然基金

Research on Quantum Field Theory without a Lagrangian Description
  • 批准号:
    24ZR1403900
  • 批准年份:
    2024
  • 资助金额:
    0.0 万元
  • 项目类别:
    省市级项目
Cell Research
  • 批准号:
    31224802
  • 批准年份:
    2012
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research
  • 批准号:
    31024804
  • 批准年份:
    2010
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Cell Research (细胞研究)
  • 批准号:
    30824808
  • 批准年份:
    2008
  • 资助金额:
    24.0 万元
  • 项目类别:
    专项基金项目
Research on the Rapid Growth Mechanism of KDP Crystal
  • 批准号:
    10774081
  • 批准年份:
    2007
  • 资助金额:
    45.0 万元
  • 项目类别:
    面上项目

相似海外基金

CAS: Collaborative Research: Separating Electronic and Geometric Effects in Compound Catalysts: Examining Unique Selectivities for Hydrogenolysis on Transition Metal Phosphides
CAS:合作研究:分离复合催化剂中的电子效应和几何效应:检验过渡金属磷化物氢解的独特选择性
  • 批准号:
    2409888
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Collaborative Research: Separating the Climate and Weather of River Channels: Characterizing Dynamics of Coarse-Grained River Channel Response to Perturbations Across Scales
合作研究:分离河道的气候和天气:表征粗粒度河道对跨尺度扰动响应的动态
  • 批准号:
    2220504
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Collaborative Research: Separating the Climate and Weather of River Channels: Characterizing Dynamics of Coarse-Grained River Channel Response to Perturbations Across Scales
合作研究:分离河道的气候和天气:表征粗粒度河道对跨尺度扰动响应的动态
  • 批准号:
    2220505
  • 财政年份:
    2022
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAS: Collaborative Research: Separating Electronic and Geometric Effects in Compound Catalysts: Examining Unique Selectivities for Hydrogenolysis on Transition Metal Phosphides
CAS:合作研究:分离复合催化剂中的电子效应和几何效应:检验过渡金属磷化物氢解的独特选择性
  • 批准号:
    1954426
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAS: Collaborative Research: Separating Electronic and Geometric Effects in Compound Catalysts: Examining Unique Selectivities for Hydrogenolysis on Transition Metal Phosphides
CAS:合作研究:分离复合催化剂中的电子效应和几何效应:检验过渡金属磷化物氢解的独特选择性
  • 批准号:
    1954111
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
CAS: Collaborative Research: Separating Electronic and Geometric Effects in Compound Catalysts: Examining Unique Selectivities for Hydrogenolysis on Transition Metal Phosphides
CAS:合作研究:分离复合催化剂中的电子效应和几何效应:检验过渡金属磷化物氢解的独特选择性
  • 批准号:
    1954611
  • 财政年份:
    2020
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
DMREF/Collaborative Research: DNA-based Sensing, Communicating, and Phase-Separating Materials
DMREF/合作研究:基于 DNA 的传感、通信和相分离材料
  • 批准号:
    1921881
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
DMREF/Collaborative Research: DNA-based Sensing, Communicating, and Phase-Separating Materials
DMREF/合作研究:基于 DNA 的传感、通信和相分离材料
  • 批准号:
    1921955
  • 财政年份:
    2019
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Collaborative research: Coastal inertial-band dynamics: separating forced and free responses in a natural laboratory
合作研究:沿海惯性带动力学:在自然实验室中分离受迫响应和自由响应
  • 批准号:
    1635560
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
Collaborative research: Coastal inertial-band dynamics: separating forced and free responses in a natural laboratory
合作研究:沿海惯性带动力学:在自然实验室中分离受迫响应和自由响应
  • 批准号:
    1635166
  • 财政年份:
    2016
  • 资助金额:
    --
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了