权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Listener sensitivity to talker differences in phonetic properties of speech

听者对说话者语音语音特性差异的敏感度

基本信息

批准号：
7486321
负责人：
Rachel Marie Theodore
金额：
$ 1.75万
依托单位：
NORTHEASTERN UNIVERSITY
依托单位国家：
美国
项目类别：
财政年份：
2007
资助国家：
美国
起止时间：
2007-09-01 至 2009-02-15
项目状态：
已结题

项目摘要

DESCRIPTION (provided by applicant): The goal of the proposed research is to extend our knowledge of the early stages of word recognition in which listeners extract individual segments from the speech signal. Long-standing accounts of speech perception, which emphasized the abstract nature of linguistic representations, have recently been challenged by findings that indicate that talker-specific, acoustic-phonetic information is retained in memory and can facilitate word recognition. These findings raise the possibility that detailed acoustic-phonetic information is used to customize the mapping between signal and segmental representation on a talker- specific basis. In support of this alternative account, there is now evidence that listeners can track acoustic- phonetic properties for a particular talker. One such property is voice-onset-time (VOT), a temporal property of speech that marks the voicing contrast in stop consonants. Listeners can learn a talker's characteristic VOTs in the context of one word-initial voiceless stop and, moreover, can transfer this information to a novel word that begins with the same stop. A fundamental question that remains unanswered concerns the level of representation at which talker-specific, acoustic-phonetic information is tracked. The specific aim of the proposed research is to address this question by determining whether listeners track talker-specific VOT with respect to a phonetic feature or with respect to a given phonetic segment. During a training phase, listeners will learn how two talkers produce /p/ or /k/. Speech synthesis techniques will be used to manipulate the VOTs of the two talkers so that one talker has shorter VOTs and the other talker has longer VOTs. During a test phase, a two-alternative forced-choice task will be used to examine transfer to words that begin with the same voiceless stop as used during training and to words that begin with a voiceless stop at a different place of articulation. If listeners track talker-specific VOT with respect to a phonetic feature, then information learned about voiceless stop consonants in the context of /p/ should transfer to /k/ and information learned in the context of /k/ should transfer to /p/. However, if listeners track talker-specific VOT with respect to a given phonetic segment, then transfer should be limited only to words that begin with the same voiceless stop as used during training. This research will contribute to the theoretical understanding of talker specificity in speech perception as well as support the advancement of devices that recognize normal and disordered speech. One current limitation of such devices is the failure to rapidly adapt to talker differences in speech production. Examining how humans process this type of phonetic variation will provide critical information to incorporate into machine recognition of spoken language.

描述（由申请人提供）：所提出的研究的目标是扩展我们对单词识别早期阶段的知识，在该阶段中，听众从语音信号中提取各个片段。长期以来的言语感知，强调语言表征的抽象性，最近受到了挑战的研究结果表明，说话者特定的，声学语音信息保留在记忆中，可以促进单词识别。这些发现提出了一种可能性，即详细的声学语音信息被用来定制信号和分段表示之间的映射在特定于说话者的基础上。为了支持这种替代性的解释，现在有证据表明，听众可以跟踪特定说话者的声学语音特性。一个这样的属性是语音起始时间（VOT），这是语音的时间属性，标志着塞音辅音中的浊音对比。听者可以在一个单词开头的无休止符的上下文中学习说话者的特征VOTs，并且可以将此信息转移到以相同休止符开头的新单词。一个基本的问题，仍然没有答案的关注的代表性水平，在特定的谈话者，声学语音信息被跟踪。建议的研究的具体目的是解决这个问题，通过确定是否听众跟踪说话者特定的VOT相对于一个语音特征或相对于一个给定的语音段。在训练阶段，听众将学习两个说话者如何产生/p/或/k/。语音合成技术将用于操纵两个说话者的VOT，使得一个说话者具有较短的VOT，而另一个说话者具有较长的VOT。在测试阶段，将使用两种选择的强迫选择任务来检查迁移到与训练期间使用的相同的无休止塞音开始开始的单词以及在不同的发音位置以无休止塞音开始开始的单词。如果听者跟踪说话者特定的VOT的语音特征，那么在/p/的上下文中学习到的关于无辅音塞音的信息应该转移到/k/，在/k/的上下文中学习到的信息应该转移到/p/。然而，如果收听者相对于给定的语音段跟踪说话者特定的VOT，则转移应仅限于以与训练期间使用的相同的无休止塞音开始开始的单词。本研究将有助于从理论上理解说话者在言语感知中的特异性，并支持识别正常和紊乱言语的设备的进步。这种设备的一个当前限制是不能快速适应讲话者在语音产生中的差异。研究人类如何处理这种类型的语音变化将提供关键信息，以纳入口语的机器识别。