Listener sensitivity to talker differences in phonetic properties of speech

听者对说话者语音语音特性差异的敏感度

基本信息

  • 批准号:
    7486321
  • 负责人:
  • 金额:
    $ 1.75万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-09-01 至 2009-02-15
  • 项目状态:
    已结题

项目摘要

DESCRIPTION (provided by applicant): The goal of the proposed research is to extend our knowledge of the early stages of word recognition in which listeners extract individual segments from the speech signal. Long-standing accounts of speech perception, which emphasized the abstract nature of linguistic representations, have recently been challenged by findings that indicate that talker-specific, acoustic-phonetic information is retained in memory and can facilitate word recognition. These findings raise the possibility that detailed acoustic-phonetic information is used to customize the mapping between signal and segmental representation on a talker- specific basis. In support of this alternative account, there is now evidence that listeners can track acoustic- phonetic properties for a particular talker. One such property is voice-onset-time (VOT), a temporal property of speech that marks the voicing contrast in stop consonants. Listeners can learn a talker's characteristic VOTs in the context of one word-initial voiceless stop and, moreover, can transfer this information to a novel word that begins with the same stop. A fundamental question that remains unanswered concerns the level of representation at which talker-specific, acoustic-phonetic information is tracked. The specific aim of the proposed research is to address this question by determining whether listeners track talker-specific VOT with respect to a phonetic feature or with respect to a given phonetic segment. During a training phase, listeners will learn how two talkers produce /p/ or /k/. Speech synthesis techniques will be used to manipulate the VOTs of the two talkers so that one talker has shorter VOTs and the other talker has longer VOTs. During a test phase, a two-alternative forced-choice task will be used to examine transfer to words that begin with the same voiceless stop as used during training and to words that begin with a voiceless stop at a different place of articulation. If listeners track talker-specific VOT with respect to a phonetic feature, then information learned about voiceless stop consonants in the context of /p/ should transfer to /k/ and information learned in the context of /k/ should transfer to /p/. However, if listeners track talker-specific VOT with respect to a given phonetic segment, then transfer should be limited only to words that begin with the same voiceless stop as used during training. This research will contribute to the theoretical understanding of talker specificity in speech perception as well as support the advancement of devices that recognize normal and disordered speech. One current limitation of such devices is the failure to rapidly adapt to talker differences in speech production. Examining how humans process this type of phonetic variation will provide critical information to incorporate into machine recognition of spoken language.
描述(由申请人提供):所提出的研究的目标是扩展我们对单词识别早期阶段的知识,在该阶段中,听众从语音信号中提取各个片段。长期以来的言语感知,强调语言表征的抽象性,最近受到了挑战的研究结果表明,说话者特定的,声学语音信息保留在记忆中,可以促进单词识别。这些发现提出了一种可能性,即详细的声学语音信息被用来定制信号和分段表示之间的映射在特定于说话者的基础上。为了支持这种替代性的解释,现在有证据表明,听众可以跟踪特定说话者的声学语音特性。一个这样的属性是语音起始时间(VOT),这是语音的时间属性,标志着塞音辅音中的浊音对比。听者可以在一个单词开头的无休止符的上下文中学习说话者的特征VOTs,并且可以将此信息转移到以相同休止符开头的新单词。一个基本的问题,仍然没有答案的关注的代表性水平,在特定的谈话者,声学语音信息被跟踪。建议的研究的具体目的是解决这个问题,通过确定是否听众跟踪说话者特定的VOT相对于一个语音特征或相对于一个给定的语音段。在训练阶段,听众将学习两个说话者如何产生/p/或/k/。语音合成技术将用于操纵两个说话者的VOT,使得一个说话者具有较短的VOT,而另一个说话者具有较长的VOT。在测试阶段,将使用两种选择的强迫选择任务来检查迁移到与训练期间使用的相同的无休止塞音开始开始的单词以及在不同的发音位置以无休止塞音开始开始的单词。如果听者跟踪说话者特定的VOT的语音特征,那么在/p/的上下文中学习到的关于无辅音塞音的信息应该转移到/k/,在/k/的上下文中学习到的信息应该转移到/p/。然而,如果收听者相对于给定的语音段跟踪说话者特定的VOT,则转移应仅限于以与训练期间使用的相同的无休止塞音开始开始的单词。本研究将有助于从理论上理解说话者在言语感知中的特异性,并支持识别正常和紊乱言语的设备的进步。这种设备的一个当前限制是不能快速适应讲话者在语音产生中的差异。研究人类如何处理这种类型的语音变化将提供关键信息,以纳入口语的机器识别。

项目成果

期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Individual talker differences in voice-onset-time: contextual influences.
  • DOI:
    10.1121/1.3106131
  • 发表时间:
    2009-06
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Rachel M. Theodore;Joanne L. Miller;David DeSteno
  • 通讯作者:
    Rachel M. Theodore;Joanne L. Miller;David DeSteno
Characteristics of listener sensitivity to talker-specific phonetic detail.
听者对说话者特定语音细节的敏感性特征。
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Rachel Marie Theodore其他文献

Rachel Marie Theodore的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Rachel Marie Theodore', 18)}}的其他基金

Determinants of phonetic category structure in language impairment
语言障碍中语音类别结构的决定因素
  • 批准号:
    9306458
  • 财政年份:
    2017
  • 资助金额:
    $ 1.75万
  • 项目类别:
Listener sensitivity to talker differences in phonetic properties of speech
听者对说话者语音语音特性差异的敏感度
  • 批准号:
    7408765
  • 财政年份:
    2007
  • 资助金额:
    $ 1.75万
  • 项目类别:

相似海外基金

Nonlinear Acoustics for the conditioning monitoring of Aerospace structures (NACMAS)
用于航空航天结构调节监测的非线性声学 (NACMAS)
  • 批准号:
    10078324
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
    BEIS-Funded Programmes
ORCC: Marine predator and prey response to climate change: Synthesis of Acoustics, Physiology, Prey, and Habitat In a Rapidly changing Environment (SAPPHIRE)
ORCC:海洋捕食者和猎物对气候变化的反应:快速变化环境中声学、生理学、猎物和栖息地的综合(蓝宝石)
  • 批准号:
    2308300
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Continuing Grant
University of Salford (The) and KP Acoustics Group Limited KTP 22_23 R1
索尔福德大学 (The) 和 KP Acoustics Group Limited KTP 22_23 R1
  • 批准号:
    10033989
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Knowledge Transfer Partnership
User-controllable and Physics-informed Neural Acoustics Fields for Multichannel Audio Rendering and Analysis in Mixed Reality Application
用于混合现实应用中多通道音频渲染和分析的用户可控且基于物理的神经声学场
  • 批准号:
    23K16913
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Combined radiation acoustics and ultrasound imaging for real-time guidance in radiotherapy
结合辐射声学和超声成像,用于放射治疗的实时指导
  • 批准号:
    10582051
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
Comprehensive assessment of speech physiology and acoustics in Parkinson's disease progression
帕金森病进展中言语生理学和声学的综合评估
  • 批准号:
    10602958
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
The acoustics of climate change - long-term observations in the arctic oceans
气候变化的声学——北冰洋的长期观测
  • 批准号:
    2889921
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Studentship
Collaborative Research: Estimating Articulatory Constriction Place and Timing from Speech Acoustics
合作研究:从语音声学估计发音收缩位置和时间
  • 批准号:
    2343847
  • 财政年份:
    2023
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Standard Grant
Flow Physics and Vortex-Induced Acoustics in Bio-Inspired Collective Locomotion
仿生集体运动中的流动物理学和涡激声学
  • 批准号:
    DGECR-2022-00019
  • 财政年份:
    2022
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Discovery Launch Supplement
Collaborative Research: Estimating Articulatory Constriction Place and Timing from Speech Acoustics
合作研究:从语音声学估计发音收缩位置和时间
  • 批准号:
    2141275
  • 财政年份:
    2022
  • 资助金额:
    $ 1.75万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了