STIMULATE: Modeling Structure in Speech above the Segment for Spontaneous Speech Recognition
刺激:对自发语音识别片段上方的语音结构进行建模
基本信息
- 批准号:9618926
- 负责人:
- 金额:$ 45.81万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:1997
- 资助国家:美国
- 起止时间:1997-03-01 至 1999-09-29
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Current speech recognition technology, while useful in constrained domains with cooperative speakers, still leads to unacceptably high error rates (30-50%) on unconstrained conversational or broadcast speech. An important difference between these tasks and high accuracy conditions is the larger variability in speaking style, even within data from a single speaker. Existing acoustic models do not account for the systematic factors behind this variability so must be ``broader,'' leading to more confusability among words and hence high error rates. This work proposes to improve acoustic models by representing sources of variability at three time scales: the syllable, short regions within an utterance, and the speaker. At the syllable level, automatic clustering will capture syllable position and phonetic reduction effects. At the region level, a slowly varying hidden speaking mode will indicate systematic differences in pronunciations associated with reduced vs. clearly articulated speech. At the speaker level, hierarchical models of the correlation among speech sounds will improve adaptation of acoustic models from small amounts of data. Experiments will involve large vocabulary recognition of conversational speech using a multi-pass search strategy to handle the cost of the higher-order models proposed here. By representing systematic variability, the proposed work should significantly advance both the target task of unconstrained speech recognition and human- computer speech communication more generally.
电流 讲话 识别技术, 而 有用 在有限的领域与合作的发言者,仍然导致不可接受的 高 错误率(30-50%) 对 不受约束的对话或广播讲话。 这些任务和高准确度条件之间的一个重要区别是说话风格的变化更大,即使在来自单个说话者的数据中也是如此。 现有的声学模型没有考虑系统的 因素 在这种变化的背后 所以 必须 “更广泛”,导致单词之间更容易混淆,因此错误率更高。 这项工作提出了改进声学模型表示源的变化在三个时间尺度:音节,短区域内的话语,和扬声器。 在音节级别,自动聚类将捕获音节位置和语音缩减效果。 在区域级别,缓慢变化的隐藏说话模式将指示与减少的语音与清晰的语音相关联的发音的系统差异。在说话人层面,语音之间的相关性的层次模型将改善声学模型从少量数据的适应。 实验将涉及大词汇量的会话语音识别,使用多遍搜索策略来处理这里提出的高阶模型的成本。通过表示系统的可变性,所提出的工作应显着推进无约束语音识别和人机语音通信的目标任务更普遍。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mari Ostendorf其他文献
Design of a speech recognition system based on acoustically derived segmental units
基于声学分段单元的语音识别系统设计
- DOI:
10.1109/icassp.1996.541128 - 发表时间:
1996 - 期刊:
- 影响因子:0
- 作者:
M. Bacchiani;Mari Ostendorf;Y. Sagisaka;K. Paliwal - 通讯作者:
K. Paliwal
Automatic recognition of prosodic phrases
自动识别韵律短语
- DOI:
- 发表时间:
1991 - 期刊:
- 影响因子:0
- 作者:
Colin W. Wightman;Mari Ostendorf - 通讯作者:
Mari Ostendorf
Representations for Question Answering from Documents with Tables and Text
带有表格和文本的文档问答的表示
- DOI:
10.18653/v1/2021.eacl-main.253 - 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
V. Zayats;Kristina Toutanova;Mari Ostendorf - 通讯作者:
Mari Ostendorf
The challenge of spoken language systems: research directions for the nineties
口语系统的挑战:九十年代的研究方向
- DOI:
10.1109/89.365385 - 发表时间:
1995 - 期刊:
- 影响因子:0
- 作者:
R. Cole;L. Hirschman;L. Atlas;M. Beckman;A. Biermann;M. Bush;M. Clements;Jordan Cohen;Oscar Garcia;B. Hanson;H. Hermansky;S. Levinson;K. McKeown;N. Morgan;D. Novick;Mari Ostendorf;S. Oviatt;P. Price;H. Silverman;J. Spitz;A. Waibel;C. Weinstein;S. Zahorian;V. Zue - 通讯作者:
V. Zue
The stochastic segment model for continuous speech recognition
连续语音识别的随机分段模型
- DOI:
- 发表时间:
1991 - 期刊:
- 影响因子:0
- 作者:
Mari Ostendorf;V. Digalakis - 通讯作者:
V. Digalakis
Mari Ostendorf的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mari Ostendorf', 18)}}的其他基金
Collaborative Research: Improving Speech Technology for Better Learning Outcomes: The Case of AAE Child Speakers
合作研究:改进语音技术以获得更好的学习成果:AAE 儿童演讲者的案例
- 批准号:
2202049 - 财政年份:2022
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
RI: Small: Modeling Idiosyncrasies of Speech for Automatic Spoken Language Processing
RI:小:为自动口语处理建模语音特质
- 批准号:
1617176 - 财政年份:2016
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
RI: Small: Simplifying Text for Individual Reading Needs
RI:小:简化文本以满足个人阅读需求
- 批准号:
0916951 - 财政年份:2009
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
U.S.-Germany Dissertation Enhancement: Predicting Hidden Structure and Punctuation in Speech for Machine Translation
美德论文增强:预测机器翻译语音中的隐藏结构和标点符号
- 批准号:
0552492 - 财政年份:2006
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
A Computing Lab for Integrated Teaching of Systems Courses in Electrical Engineering
电气工程系统课程集成教学计算实验室
- 批准号:
0511635 - 财政年份:2005
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
ITR: Applying Translation Technology to Language Modeling
ITR:将翻译技术应用于语言建模
- 批准号:
0326276 - 财政年份:2003
- 资助金额:
$ 45.81万 - 项目类别:
Continuing Grant
Speech Generation for Human-Computer Interaction
人机交互的语音生成
- 批准号:
9996440 - 财政年份:1999
- 资助金额:
$ 45.81万 - 项目类别:
Continuing Grant
STIMULATE: Modeling Structure in Speech above the Segment for Spontaneous Speech Recognition
刺激:对自发语音识别片段上方的语音结构进行建模
- 批准号:
9996450 - 财政年份:1999
- 资助金额:
$ 45.81万 - 项目类别:
Continuing Grant
Workshop for Discussing Research Priorities and Evaluation Strategies in Speech Synthesis
讨论语音合成研究重点和评估策略的研讨会
- 批准号:
9872796 - 财政年份:1998
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
Speech Generation for Human-Computer Interaction
人机交互的语音生成
- 批准号:
9528990 - 财政年份:1996
- 资助金额:
$ 45.81万 - 项目类别:
Continuing Grant
相似国自然基金
Galaxy Analytical Modeling
Evolution (GAME) and cosmological
hydrodynamic simulations.
- 批准号:
- 批准年份:2025
- 资助金额:10.0 万元
- 项目类别:省市级项目
相似海外基金
Plume Structure and Mantle Layering Beneath the South Pacific: Modeling Teleseismic Waveforms from Traditional and Floating Sensors
南太平洋下方的羽流结构和地幔分层:利用传统和浮动传感器模拟远震波形
- 批准号:
2341811 - 财政年份:2024
- 资助金额:
$ 45.81万 - 项目类别:
Continuing Grant
Structure and dynamics of the subcontinental lithospheric mantle over the Central and Eastern North American continent, constrained by numerical modeling based on tomography models
基于层析成像模型的数值模拟约束北美大陆中部和东部次大陆岩石圈地幔的结构和动力学
- 批准号:
2240943 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
Postdoctoral Fellowship: MPS-Ascend: Coarse-Grained Modeling of Aggrecan- Mimetic Copolymers: Polymer Design and Architecture Effects on Structure and Phase Behavior
博士后奖学金:MPS-Ascend:聚集蛋白聚糖模拟共聚物的粗粒度建模:聚合物设计和结构对结构和相行为的影响
- 批准号:
2316666 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
Fellowship Award
Modeling the industrial distribution structure of Edo and Osaka by creating an industrial map
通过创建产业地图来模拟江户和大阪的产业分布结构
- 批准号:
23K00939 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
LEAPS-MPS: Mathematical Modeling of Brain Structure in Neurodegenerative Diseases Exhibiting Prion-Like Spreading
LEAPS-MPS:表现出朊病毒样传播的神经退行性疾病中大脑结构的数学模型
- 批准号:
2316952 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
Patient specific computational modeling of fluid-structure interactions of cerebrospinal fluid for biomarkers in Alzheimer's disease
阿尔茨海默病生物标志物脑脊液流固相互作用的患者特定计算模型
- 批准号:
10644281 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
3D modeling of Jomon cord marker by the structure from motion and identification of the pottery produced at the same time
通过运动结构对绳文绳标记进行 3D 建模并同时识别陶器
- 批准号:
23K00946 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Regularized divergences and their gradient flows, generative modeling and structure-preserving learning.
正则化散度及其梯度流、生成建模和结构保持学习。
- 批准号:
2307115 - 财政年份:2023
- 资助金额:
$ 45.81万 - 项目类别:
Standard Grant
Dependence structure modeling: New directions and applications
依赖结构建模:新方向和应用
- 批准号:
RGPIN-2019-06041 - 财政年份:2022
- 资助金额:
$ 45.81万 - 项目类别:
Discovery Grants Program - Individual
A Novel Framework for Model Reduction and Data-Driven Modeling of Fluid-Structure System: Application to Flapping Dynamics
流固系统模型简化和数据驱动建模的新框架:在扑动动力学中的应用
- 批准号:
RGPIN-2019-05065 - 财政年份:2022
- 资助金额:
$ 45.81万 - 项目类别:
Discovery Grants Program - Individual