Interference in spoken communication: Evaluating the corrupting and disrupting effects of other voices

言语交流中的干扰:评估其他声音的破坏和破坏效果

基本信息

  • 批准号:
    ES/N014383/1
  • 负责人:
  • 金额:
    $ 42.04万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2016
  • 资助国家:
    英国
  • 起止时间:
    2016 至 无数据
  • 项目状态:
    已结题

项目摘要

In everyday life, talking with other people is important not only for sharing knowledge and ideas, but also for maintaining a sense of belonging to a community. Most people take it for granted that they can converse with others with little or no effort. Successful communication involves understanding what is being said and being understood, but it is quite rare to hear the speech of a particular talker in isolation. Speech is typically heard in the presence of interfering sounds, which are often the voices of other talkers. The human auditory system, which is responsible for our sense of hearing, therefore faces the challenge of identifying which parts of the sounds reaching our ears have come from which talker.Solving this "auditory scene analysis" problem involves separating those sound elements arising from one source (e.g., the voice of the talker to whom you are attending) from those arising from other sources, so that the identity and meaning of the target source can be interpreted by higher-level processes in the brain. Over the course of evolution, humans have been exposed to a variety of complex listening environments, and so we are generally very successful at understanding the speech of one person in the presence of other talkers. This contrasts with attempts to develop listening machines, which often fail when confronted with adverse conditions, such as automatic transcription of a conversation in an open-plan office. Human listeners with hearing impairment often find these environments especially difficult, even when using the latest developments in hearing-aid or cochlear-implant design, and so can struggle to communicate effectively in such conditions.Much of the information necessary to understand speech (acoustic-phonetic information) is carried by the changes in frequency over time of a few broad peaks in the frequency spectrum of the speech signal, known as formants. The project aims to investigate how listeners presented with mixtures of target speech and interfering formants are able to group together the appropriate formants, and to reject others, such that the speech of the talker we want to listen to can be understood. Interfering sounds can have two kinds of effect - energetic masking, in which the neural response of the ear to the target is swamped by the response to the masker, and informational masking, in which the "auditory brain" fails to separate readily detectable parts of the target from the masker. The project will explore the informational masking component of interference - often the primary factor limiting speech intelligibility - using stimulus configurations that eliminate energetic masking. We will do so using perceptual experiments in which we measure how our ability to understand speech (e.g., the number of words reported correctly) changes under a variety of conditions.The project will examine how acoustic-phonetic information is combined across formants. It will also explore how a speech-like interferer affects intelligibility, distinguishing the circumstances in which the interferer takes up some of the available perceptual processing capacity from those in which specific properties of the interferer intrude into the perception of the target speech. Our approach is to use artificial speech-like stimuli with precisely controlled properties, to mix target speech with carefully designed interferers that offer alternative grouping possibilities, and to measure how manipulating the properties of these interferers affects listeners' abilities to recognise the target speech in the mixture. The results will improve our understanding of how human listeners separate speech from interfering sounds and the constraints on that separation, helping to refine computational models of listening. Such refinements will in turn provide ways of improving the performance of devices such as hearing aids and automatic speech recognisers when they operate in adverse listening conditions.
在日常生活中,与他人交谈不仅对分享知识和想法很重要,而且对保持社区归属感也很重要。大多数人想当然地认为他们可以毫不费力地与他人匡威。成功的沟通包括理解正在说的话并被理解,但很少单独听到某个说话者的讲话。语音通常是在存在干扰声音的情况下听到的,这些干扰声音通常是其他说话者的声音。因此,负责我们听觉的人类听觉系统面临着识别到达我们耳朵的声音的哪些部分来自哪个说话者的挑战。解决这个“听觉场景分析”问题涉及分离来自一个源的那些声音元素(例如,你所关注的谈话者的声音)与其他来源的声音区分开来,这样目标来源的身份和意义就可以被大脑中更高层次的过程解释。在进化的过程中,人类已经暴露在各种复杂的听觉环境中,因此我们通常非常成功地理解一个人在其他说话者在场的情况下的讲话。这与开发听力机器的尝试形成了鲜明对比,后者在遇到不利条件时经常失败,例如在开放式办公室中自动转录对话。有听力障碍的人类听众通常会发现这些环境特别困难,即使使用助听器或耳蜗植入设计的最新发展,因此在这种条件下也很难有效地进行交流。(声学语音信息)由语音信号频谱中的几个宽峰随时间的频率变化携带,称为共振峰。该项目旨在研究听众如何面对目标语音和干扰共振峰的混合物,能够将适当的共振峰组合在一起,并拒绝其他共振峰,这样我们想要听的说话者的语音就可以被理解。干扰声音可以产生两种效果:能量掩蔽(energetic masking),即耳朵对目标的神经反应被掩蔽者的反应淹没;信息掩蔽(information masking),即“听觉大脑”无法将目标中容易检测到的部分与掩蔽者区分开来。该项目将探讨干扰的信息掩蔽成分-通常是限制语音清晰度的主要因素-使用消除能量掩蔽的刺激配置。我们将使用感知实验来做到这一点,在这些实验中,我们测量我们理解语音的能力(例如,正确报告的单词数量)在各种条件下发生变化。该项目将研究声学语音信息如何在共振峰之间组合。它还将探讨如何一个语音样的发音影响可理解性,区分的情况下,其中的发音占用了一些可用的感知处理能力的发音的特定属性侵入到目标语音的感知。我们的方法是使用人工语音样的刺激与精确控制的属性,混合目标语音与精心设计,提供替代分组的可能性,并测量如何操纵这些objecrers的属性影响听众的能力,以识别目标语音的混合物。这些结果将提高我们对人类听众如何将语音与干扰声音分离以及这种分离的限制的理解,有助于完善听力的计算模型。这种改进反过来将提供改善助听器和自动语音识别器等设备在不利听力条件下工作时的性能的方法。

项目成果

期刊论文数量(6)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Dichotic integration of acoustic-phonetic information: Competition from extraneous formants increases the effect of second-formant attenuation on intelligibility.
声学-语音信息的二分整合:来自无关共振峰的竞争增加了第二共振峰衰减对清晰度的影响。
Informational masking of speech by time-varying competitors: Effects of frequency region and number of interfering formants.
时变竞争对手对语音的信息掩蔽:频率区域和干扰共振峰数量的影响。
Informational masking of speech by acoustically similar intelligible and unintelligible interferers
声学上相似的可理解和不可理解的干扰源对语音的信息掩蔽
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Brian Roberts其他文献

DNA synthesis patterns in the giant foot-pad nuclei of Sarcophaga bullata (Sarcophagidae, Diptera)
  • DOI:
    10.1007/bf00331806
  • 发表时间:
    1974-06-01
  • 期刊:
  • 影响因子:
    2.300
  • 作者:
    Brian Roberts;Joan M. Whitten;Lawrence I. Gilbert
  • 通讯作者:
    Lawrence I. Gilbert
Separate mechanisms govern the selection of spectral components for perceptual fusion and for the computation of global pitch.
单独的机制控制用于感知融合和全局音调计算的频谱分量的选择。
単一アクロス震源を用いたAquistore CCSサイトにおける表面波位相速度の空間分布とモニタリング
使用单个跨源在 Aquistore CCS 站点进行表面波相速度的空间分布和监测
  • DOI:
  • 发表时间:
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    池田 達紀;辻 健;中務 真志;伴 英明;加藤 文人;Kyle Worth;Don White;Brian Roberts
  • 通讯作者:
    Brian Roberts
Patterns of incorporation of tritiated thymidine by the dorsal polytene foot-pad nuclei of Sarcophaga bullata (Sarcophagidae: Diptera)
  • DOI:
    10.1007/bf00292835
  • 发表时间:
    1976-06-01
  • 期刊:
  • 影响因子:
    2.300
  • 作者:
    Brian Roberts;Joan M. Whitten;Lawrence I. Gilbert
  • 通讯作者:
    Lawrence I. Gilbert
Perceptual fusion and fragmentation of complex tones made inharmonic by applying different degrees of frequency shift and spectral stretch.
通过应用不同程度的频移和频谱拉伸,使复杂音调的感知融合和分裂变得不和谐。

Brian Roberts的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Brian Roberts', 18)}}的其他基金

RAPID: Securing the LUMCON natural history collection, a vital Gulf Coast resource
RAPID:保护 LUMCON 自然历史收藏,这是墨西哥湾沿岸的重要资源
  • 批准号:
    2203268
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Standard Grant
EAR-Climate: Collaborative Research: Methane Dynamics Across Microbe-to-Landscape Scales in Coastal Wetlands
EAR-气候:合作研究:沿海湿地从微生物到景观尺度的甲烷动力学
  • 批准号:
    2218581
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Continuing Grant
REU Site: Interdisciplinary Research Experiences in Louisiana's Changing Coastal Environments
REU 网站:路易斯安那州不断变化的沿海环境的跨学科研究经验
  • 批准号:
    2150358
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Continuing Grant
REU Site: Interdisciplinary Research Experiences in Changing Coastal Environments
REU 网站:不断变化的沿海环境的跨学科研究经验
  • 批准号:
    1757887
  • 财政年份:
    2018
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: A RAPID response to Hurricane Harvey's impacts on coastal carbon cycle, metabolic balance and ocean acidification
合作研究:快速应对飓风哈维对沿海碳循环、代谢平衡和海洋酸化的影响
  • 批准号:
    1760687
  • 财政年份:
    2017
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Standard Grant
Understanding speech in the presence of other speech: Perceptual mechanisms for auditory scene analysis in human listeners
在存在其他语音的情况下理解语音:人类听众听觉场景分析的感知机制
  • 批准号:
    ES/K004905/1
  • 财政年份:
    2013
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Research Grant
REU Site: Interdisciplinary Research Experiences in Changing Coastal Environments
REU 网站:不断变化的沿海环境的跨学科研究经验
  • 批准号:
    1063036
  • 财政年份:
    2011
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Continuing Grant
Collaborative Research: RAPID: The 2011 Atchafalaya River Flood and a possible altered system state for the Atchafalaya River Delta Estuary
合作研究:RAPID:2011 年阿查法拉亚河洪水和阿查法拉亚河三角洲河口可能发生的系统状态改变
  • 批准号:
    1141354
  • 财政年份:
    2011
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Standard Grant
RAPID: Effects of oiling and hydrologic remediation on baldcypress swamp elevation and ecosystem processes in the context of the BP Deepwater Horizon Oil Spill
RAPID:BP 深水地平线漏油事件背景下的注油和水文修复对秃柏沼泽海拔和生态系统过程的影响
  • 批准号:
    1049838
  • 财政年份:
    2010
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Standard Grant
The perceptual organization of speech: Contributions of general and speech-specific factors
言语的感知组织:一般因素和言语特定因素的贡献
  • 批准号:
    EP/F016484/1
  • 财政年份:
    2008
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Research Grant

相似海外基金

Spoken Language Technology for Online Communication Between Normal-Hearing and Hearing-Impaired People
听力正常者与听力障碍者在线交流的口语技术
  • 批准号:
    23H00995
  • 财政年份:
    2023
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Development of aging-sensitive spoken language measures in children, adolescents, and young adults with Down Syndrome
针对患有唐氏综合症的儿童、青少年和年轻人制定对年龄敏感的口语测量方法
  • 批准号:
    10644947
  • 财政年份:
    2023
  • 资助金额:
    $ 42.04万
  • 项目类别:
AG Bell Global Listening & Spoken Language Symposia
AG贝尔全球聆听
  • 批准号:
    10597693
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
User-adaptive spoken dialogue system based on communication accommodation theory
基于通信适应理论的用户自适应口语对话系统
  • 批准号:
    22K17958
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
AG Bell Global Listening & Spoken Language Symposia
AG贝尔全球聆听
  • 批准号:
    10468568
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
Adaptive processing of spoken language: flexible communication in health, disease and disorder
口语的适应性处理:健康、疾病和失调中的灵活沟通
  • 批准号:
    MC_UU_00030/6
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Intramural
Examining adaptive peer-mediated interventions for preschoolers with autism spectrum disorder and limited or no spoken language: A Sequential Multiple Assignment Randomized Trial
检查对患有自闭症谱系障碍且口语有限或没有口语的学龄前儿童的适应性同伴介导干预措施:序贯多重分配随机试验
  • 批准号:
    10491421
  • 财政年份:
    2022
  • 资助金额:
    $ 42.04万
  • 项目类别:
Infrastructural spoken language technology to support smooth communication with hearing-impaired people in education
基础口语技术支持教育中听力障碍人士的顺利沟通
  • 批准号:
    20H01716
  • 财政年份:
    2020
  • 资助金额:
    $ 42.04万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Spoken Language Processing as an Early Marker of Language Impairment in Bilingual Children
口语处理是双语儿童语言障碍的早期标志
  • 批准号:
    10307519
  • 财政年份:
    2019
  • 资助金额:
    $ 42.04万
  • 项目类别:
Spoken Language Processing as an Early Marker of Language Impairment in Bilingual Children
口语处理是双语儿童语言障碍的早期标志
  • 批准号:
    10542851
  • 财政年份:
    2019
  • 资助金额:
    $ 42.04万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了