权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

STIMULATE: Modeling Structure in Speech above the Segment for Spontaneous Speech Recognition

刺激：对自发语音识别片段上方的语音结构进行建模

基本信息

批准号：
9618926
负责人：
Mari Ostendorf
金额：
$ 45.81万
依托单位：
Trustees of Boston University
依托单位国家：
美国
项目类别：
Continuing Grant
财政年份：
1997
资助国家：
美国
起止时间：
1997-03-01 至 1999-09-29
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=9618926&HistoricalAwards=false
关键词：
STIMULATE Modeling Structure Speech above

项目摘要

Current speech recognition technology, while useful in constrained domains with cooperative speakers, still leads to unacceptably high error rates (30-50%) on unconstrained conversational or broadcast speech. An important difference between these tasks and high accuracy conditions is the larger variability in speaking style, even within data from a single speaker. Existing acoustic models do not account for the systematic factors behind this variability so must be ``broader,'' leading to more confusability among words and hence high error rates. This work proposes to improve acoustic models by representing sources of variability at three time scales: the syllable, short regions within an utterance, and the speaker. At the syllable level, automatic clustering will capture syllable position and phonetic reduction effects. At the region level, a slowly varying hidden speaking mode will indicate systematic differences in pronunciations associated with reduced vs. clearly articulated speech. At the speaker level, hierarchical models of the correlation among speech sounds will improve adaptation of acoustic models from small amounts of data. Experiments will involve large vocabulary recognition of conversational speech using a multi-pass search strategy to handle the cost of the higher-order models proposed here. By representing systematic variability, the proposed work should significantly advance both the target task of unconstrained speech recognition and human- computer speech communication more generally.

电流讲话识别技术，而有用在有限的领域与合作的发言者，仍然导致不可接受的高错误率（30-50%）对不受约束的对话或广播讲话。这些任务和高准确度条件之间的一个重要区别是说话风格的变化更大，即使在来自单个说话者的数据中也是如此。现有的声学模型没有考虑系统的因素在这种变化的背后所以必须 “更广泛”，导致单词之间更容易混淆，因此错误率更高。这项工作提出了改进声学模型表示源的变化在三个时间尺度：音节，短区域内的话语，和扬声器。在音节级别，自动聚类将捕获音节位置和语音缩减效果。在区域级别，缓慢变化的隐藏说话模式将指示与减少的语音与清晰的语音相关联的发音的系统差异。在说话人层面，语音之间的相关性的层次模型将改善声学模型从少量数据的适应。实验将涉及大词汇量的会话语音识别，使用多遍搜索策略来处理这里提出的高阶模型的成本。通过表示系统的可变性，所提出的工作应显着推进无约束语音识别和人机语音通信的目标任务更普遍。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Mari Ostendorf其他文献

Design of a speech recognition system based on acoustically derived segmental units

基于声学分段单元的语音识别系统设计

DOI：
10.1109/icassp.1996.541128
发表时间：
1996
期刊：
1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
影响因子：
0
作者：
M. Bacchiani;Mari Ostendorf;Y. Sagisaka;K. Paliwal
通讯作者：
K. Paliwal

Automatic recognition of prosodic phrases

自动识别韵律短语

DOI：
发表时间：
1991
期刊：
IEEE International Conference on Acoustics, Speech, and Signal Processing
影响因子：
0
作者：
Colin W. Wightman;Mari Ostendorf
通讯作者：
Mari Ostendorf

Representations for Question Answering from Documents with Tables and Text

带有表格和文本的文档问答的表示

DOI：
10.18653/v1/2021.eacl-main.253
发表时间：
2021
期刊：
Proceedings of the Fourth Workshop on Fact Extraction and VERification (FEVER)
影响因子：
0
作者：
V. Zayats;Kristina Toutanova;Mari Ostendorf
通讯作者：
Mari Ostendorf

The challenge of spoken language systems: research directions for the nineties

口语系统的挑战：九十年代的研究方向

DOI：
10.1109/89.365385
发表时间：
1995
期刊：
IEEE Trans. Speech Audio Process.
影响因子：
0
作者：
R. Cole;L. Hirschman;L. Atlas;M. Beckman;A. Biermann;M. Bush;M. Clements;Jordan Cohen;Oscar Garcia;B. Hanson;H. Hermansky;S. Levinson;K. McKeown;N. Morgan;D. Novick;Mari Ostendorf;S. Oviatt;P. Price;H. Silverman;J. Spitz;A. Waibel;C. Weinstein;S. Zahorian;V. Zue
通讯作者：
V. Zue