EAGER: Linguistic Event Extraction and Integration (LEXI): A New Approach to Speech Analysis

EAGER:语言事件提取和集成 (LEXI):语音分析的新方法

基本信息

项目摘要

This exploratory project develops a new system for speech signal analysis that can be used to improve automatic speech recognition (ASR) systems, and provide a testable model of human speech perception. The system is based on finding important events in the speech signal, i.e. 'acoustic edges' where the signal changes suddenly because the mouth closes or opens during the formation of a consonant (like /p/ or /s/), or a vowel (like /a/ or /u/). These abrupt changes, called Landmarks, are especially informative, because they (and the parts of the signal near them) are richly informative about the speaker's intended words and their sounds. Focusing on these events results in greater computational efficiency, by identifying the linguistically relevant information in the speech signal, rather than measuring every part of the signal. This focus on individual cues to speech sounds also means that the system can deal with non-typical speech produced by children, older people, speakers with foreign accents, or those with clinical speech disabilities. As a result, this system will bring the benefits of ASR to speakers who are not well served by current recognition systems, making it possible for more people to use cell phones, tablets and laptops. While existing systems work well for typical speakers by using statistical analysis of large samples of typical speech, they leave many people underserved. The Landmark-based system will also provide a tool for testing whether human speech recognition depends on finding the individual cues to the sounds of words, even when those cues are very different in different contexts, and so can lead to the development of a new model of human speech perception.The system works by extracting speech-related measurements from the signal, such as fundamental frequency, formant frequencies, spectral band energies and their derivatives, and interpreting these measures as acoustic cues for distinctive features. Innovative aspects of the system include the use of Landmarks, which are the most robust of the acoustic feature cues and are related to articulatory manner features. Once the landmark acoustic cues are found, other acoustic cues related to place and voicing features, and to prosodic structure, can also be found. The extraction of distinctive features and prosodic structure provides the first abstract linguistic units that can be extracted from the physical continuous signal, and this information is used to identify words, and to construct a representation of the entire utterance. To develop and evaluate the performance of this innovative system, speech databases consisting of isolated vowel-consonant-vowel sequences, read continuous speech, read radio-style speech, and spontaneous speech will be hand-labeled with Landmarks and other acoustic cues. Results of this basic speech research project will support the development of new approaches to ASR, will provide a testable computational model of human speech production, and will produce material suitable for development of a tutorial to train students in engineering, linguistics and cognitive science to label acoustic feature cues.
这个探索性项目开发了一个新的语音信号分析系统,可用于改进自动语音识别(ASR)系统,并提供一个可测试的人类语音感知模型。该系统基于发现语音信号中的重要事件,即:“声学边缘”,在发辅音(如/p/或/s/)或元音(如/a/或/u/)时,由于嘴巴闭合或张开,信号突然改变。这些突然的变化被称为“地标”,尤其具有信息量,因为它们(以及它们附近的信号部分)提供了关于说话者想要表达的单词及其发音的丰富信息。通过识别语音信号中的语言相关信息,而不是测量信号的每个部分,关注这些事件可以提高计算效率。这种对语音个体线索的关注也意味着该系统可以处理由儿童、老年人、外国口音的说话者或临床语言障碍患者发出的非典型语音。因此,该系统将把ASR的好处带给目前识别系统无法很好地服务的说话者,使更多人使用手机,平板电脑和笔记本电脑成为可能。虽然现有的系统通过对典型语音的大量样本进行统计分析,对典型的说话者很好地工作,但它们让许多人得不到充分的服务。基于landmark的系统还将提供一种工具,用于测试人类语音识别是否依赖于寻找单词声音的单个线索,即使这些线索在不同的环境中非常不同,因此可以导致人类语音感知新模型的发展。该系统的工作原理是从信号中提取与语音相关的测量值,如基频、共振峰频率、频谱带能量及其导数,并将这些测量值解释为独特特征的声学线索。该系统的创新方面包括使用地标,这是最强大的声学特征线索,与发音方式特征相关。一旦找到了标志性的声音线索,其他与地点和发声特征以及韵律结构有关的声音线索也可以找到。独特特征和韵律结构的提取提供了第一个可以从物理连续信号中提取的抽象语言单位,这些信息用于识别单词,并构建整个话语的表示。为了开发和评估这一创新系统的性能,由孤立的元音-辅音-元音序列、可读连续语音、可读广播式语音和自发语音组成的语音数据库将被手工标记为地标和其他声学线索。这个基础语音研究项目的结果将支持ASR新方法的发展,将提供一个可测试的人类语音产生的计算模型,并将产生适合开发教程的材料,以培训工程、语言学和认知科学的学生标记声学特征线索。

项目成果

期刊论文数量(18)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A framework for labeling speech with acoustic cues to linguistic distinctive features
用声音线索标记语音的框架,以表达语言的独特特征
Irregular pitch periods as a feature cue in the developing speech of English-learning children
不规则的音高周期是英语学习儿童言语发展的一个特征线索
Acoustic cues to distinctive features are modified in the speech of typically-developing versus atypically developing children
典型发育儿童与非典型发育儿童的言语中,显着特征的声音线索会发生变化
  • DOI:
    10.1121/1.4988535
  • 发表时间:
    2017
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Talkar, Tanya;Zuk, Jennifer;Guerrero, Maria X.;Choi, Jeung-Yoon;Shattuck-Hufnagel, Stefanie
  • 通讯作者:
    Shattuck-Hufnagel, Stefanie
Detecting glides and their place of articulation using speech-related measurements in a feature-cue-based model
在基于特征提示的模型中使用与语音相关的测量来检测滑音及其发音位置
Toward an analysis of Spanish glides in the acoustic landmark framework
在声学地标框架中分析西班牙滑翔
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Stefanie Shattuck-Hufnagel其他文献

A prosody tutorial for investigators of auditory sentence processing
  • DOI:
    10.1007/bf01708572
  • 发表时间:
    1996-03-01
  • 期刊:
  • 影响因子:
    1.600
  • 作者:
    Stefanie Shattuck-Hufnagel;Alice E. Turk
  • 通讯作者:
    Alice E. Turk

Stefanie Shattuck-Hufnagel的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Stefanie Shattuck-Hufnagel', 18)}}的其他基金

Collaborative Research: Exploring Variation in English Intonational Acoustic Phonetics from Grammatical Perspectives
合作研究:从语法角度探索英语语调声学语音的变异
  • 批准号:
    2042748
  • 财政年份:
    2021
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Collaborative research: An integrated model of phonetic analysis and lexical access based on individual acoustic cues to features
协作研究:基于个体声学特征特征的语音分析和词汇访问的集成模型
  • 批准号:
    1827598
  • 财政年份:
    2018
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Collaborative Research: CI-P: Reciprosody - A Repository for Prosodically Annotated Material
合作研究:CI-P:Reciprosody - 韵律注释材料存储库
  • 批准号:
    1205402
  • 财政年份:
    2012
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Collaborative Research: Integrating shape, scaling, and alignment in a global approach to F0 events in intonation systems
协作研究:将形状、缩放和对齐整合到语调系统中 F0 事件的全局方法中
  • 批准号:
    1023596
  • 财政年份:
    2010
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Collaborative Research: Global Measures of Tonal Alignment in a Level-based Theory of Intonational Phonology
合作研究:基于水平的语调音韵学理论中音调对齐的全局测量
  • 批准号:
    0842782
  • 财政年份:
    2009
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Collaborative Research: Prosodic Categories of American English in Form and Function
合作研究:美式英语的韵律类别的形式和功能
  • 批准号:
    0643054
  • 财政年份:
    2007
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Conference - From Sound to Sense: Fifty+ Years of Discoveries in Speech Communication
会议 - 从声音到感觉:语音交流的五十年发现
  • 批准号:
    0418205
  • 财政年份:
    2004
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant
Phonetic Modification of Function Words: Implications for Human and Automatic Speech Processing
功能词的语音修饰:对人类和自动语音处理的影响
  • 批准号:
    9820126
  • 财政年份:
    1999
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Standard Grant

相似海外基金

Language contrasts and their effect on memory in the context of witness narratives: A cross-linguistic study of English, German and Czech motion event
目击者叙述背景下的语言对比及其对记忆的影响:英语、德语和捷克语运动事件的跨语言研究
  • 批准号:
    2602273
  • 财政年份:
    2021
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Studentship
Effects of Event Construals in Lerners' Native Language: Form a cognitive linguistic view point
事件解释对学习者母语的影响:形成认知语言观点
  • 批准号:
    18K00772
  • 财政年份:
    2018
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The Interaction of Bayesian Pragmatics and Lexical Semantics in Linguistic Interpretation: Using Event-related Potentials to Investigate Probabilistic Predictions of Hearers
贝叶斯语用学和词汇语义学在语言解释中的相互作用:利用事件相关电位研究听者的概率预测
  • 批准号:
    367110651
  • 财政年份:
    2017
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Priority Programmes
Learner's English Proficiency and the Influence of Event Construals of their Native Language: From a Cognitive Linguistic Point of View
学习者的英语水平及其母语事件解释的影响:从认知语言学的角度来看
  • 批准号:
    16K02947
  • 财政年份:
    2016
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Event conceptualisation and linguistic realisation: The impact of semantic and lexical factors on sentence production
事件概念化与语言实现:语义和词汇因素对句子生成的影响
  • 批准号:
    274318723
  • 财政年份:
    2015
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Research Grants
A typological study on construal of event structures and its linguistic manifestations: with special reference to reflexive beneficiary-subject constructions
事件结构解释及其语言表现的类型学研究:特别参考反射性受益人主体结构
  • 批准号:
    15K02487
  • 财政年份:
    2015
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Conceptualisation and linguistic form when producing event sequences
生成事件序列时的概念化和语言形式
  • 批准号:
    5401115
  • 财政年份:
    2003
  • 资助金额:
    $ 21.44万
  • 项目类别:
    Research Grants
PRE-LINGUISTIC EVENT REPRESENTATION AND INDIVIDUATION
语言前事件的表征和个性化
  • 批准号:
    6528471
  • 财政年份:
    2002
  • 资助金额:
    $ 21.44万
  • 项目类别:
PRE-LINGUISTIC EVENT REPRESENTATION AND INDIVIDUATION
语言前事件的表征和个性化
  • 批准号:
    6391675
  • 财政年份:
    2001
  • 资助金额:
    $ 21.44万
  • 项目类别:
PRE-LINGUISTIC EVENT REPRESENTATION AND INDIVIDUATION
语言前事件的表征和个性化
  • 批准号:
    6062426
  • 财政年份:
    2000
  • 资助金额:
    $ 21.44万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了