Efficient configuration of robust spoken language technology based human interfaces

基于人机界面的强大口语技术的高效配置

基本信息

  • 批准号:
    298968-2010
  • 负责人:
  • 金额:
    $ 1.68万
  • 依托单位:
  • 依托单位国家:
    加拿大
  • 项目类别:
    Discovery Grants Program - Individual
  • 财政年份:
    2012
  • 资助国家:
    加拿大
  • 起止时间:
    2012-01-01 至 2013-12-31
  • 项目状态:
    已结题

项目摘要

Developments in automatic speech recognition (ASR) algorithms and implementations have resulted in a remarkable number of applications and services built around this technology. ASR technology appears in systems that have been developed for human-machine interaction, speech dictation, speech based indexing of audio and video resources, and computer aided systems for increasing the efficiency of various other human language related activities. However, there are two major limitations associated with the technology as it exists today. First, it is known to be fragile with respect to variation in speaker populations, speaking styles, acoustic environments, and communications channels. Second, the technology is also known to be expensive where the cost is dominated by the large quantities of transcribed speech data needed for developing systems for new applications and new languages. This project addresses these short-comings from several directions. First, efficient model representations and training algorithms will be developed to make the process of configuring ASR systems for new tasks and new language less expensive. They will facilitate scenarios where potentially huge but inexpensive application independent speech corpora can be used with much smaller more expensive task specific speech corpora to configure ASR systems. Multilingual scenarios are envisioned where speech data from many languages can be used for training an ASR system for languages where little speech or text corpora exist. Second, speaker normalization and acoustic background compensation approaches motivated by models of human speech production and speech perception will be developed. This will lead to more efficient paradigms for eliminating the effects of previously unseen speaker or acoustic environments. Finally, these techniques will be applied to creating more robust spoken language technology based human interfaces. The major goal is to develop and evaluate enabling technology with a major motivation being to extend the capabilities of text based applications like, for example, online search engines, to spoken language interfaces.
自动语音识别(ASR)算法和实现的发展已经导致了大量围绕该技术构建的应用程序和服务。 ASR技术出现在已经开发用于人机交互、语音听写、基于语音的音频和视频资源的索引的系统中,以及用于提高各种其他人类语言相关活动的效率的计算机辅助系统中。 然而,有两个主要的限制与技术,因为它存在的今天。 首先,它是已知的是脆弱的,在扬声器人口,说话风格,声学环境和通信信道的变化。 第二,该技术也被认为是昂贵的,其中成本由开发用于新应用和新语言的系统所需的大量转录语音数据支配。 本项目从几个方面解决这些短期问题。首先,将开发有效的模型表示和训练算法,以降低为新任务和新语言配置ASR系统的过程的成本。 它们将促进这样的场景,即潜在的巨大但廉价的独立于应用程序的语音语料库可以与小得多的更昂贵的特定于任务的语音语料库一起使用,以配置ASR系统。 设想了多语言场景,其中来自许多语言的语音数据可以用于训练用于存在很少语音或文本语料库的语言的ASR系统。第二,说话人规范化和声学背景补偿的方法,激励人类的语音生产和语音感知模型将被开发。 这将导致更有效的范例,以消除以前看不见的扬声器或声学环境的影响。 最后,这些技术将被应用于创建更强大的基于口语技术的人机界面。 主要目标是开发和评估使能技术,其主要动机是将基于文本的应用程序的功能扩展到口语界面,例如在线搜索引擎。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Rose, Richard其他文献

Associations Between Different Dimensions of Religious Involvement and Self-Rated Health in Diverse European Populations
  • DOI:
    10.1037/a0018036
  • 发表时间:
    2010-03-01
  • 期刊:
  • 影响因子:
    4.2
  • 作者:
    Nicholson, Amanda;Rose, Richard;Bobak, Martin
  • 通讯作者:
    Bobak, Martin
Association between attendance at religious services and self-reported health in 22 European countries
  • DOI:
    10.1016/j.socscimed.2009.06.024
  • 发表时间:
    2009-08-01
  • 期刊:
  • 影响因子:
    5.4
  • 作者:
    Nicholson, Amanda;Rose, Richard;Bobak, Martin
  • 通讯作者:
    Bobak, Martin
Mental health and special educational needs: exploring a complex relationship
  • DOI:
    10.1111/j.1467-8578.2008.00409.x
  • 发表时间:
    2009-03-01
  • 期刊:
  • 影响因子:
    1.3
  • 作者:
    Rose, Richard;Howley, Marie;Jament, Johnson
  • 通讯作者:
    Jament, Johnson
An FPGA-Based Fully Synchronized Design of a Bilateral Filter for Real-Time Image Denoising
Insulin for Hospitalized Patients With Well-Controlled Type 2 Diabetes Mellitus: A Quality Improvement Initiative
  • DOI:
    10.1097/jhq.0000000000000342
  • 发表时间:
    2022-07-01
  • 期刊:
  • 影响因子:
    1.3
  • 作者:
    Goyal, Noopur;Rose, Richard;Yarbrough, Peter
  • 通讯作者:
    Yarbrough, Peter

Rose, Richard的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Rose, Richard', 18)}}的其他基金

Efficient configuration of robust spoken language technology based human interfaces
基于人机界面的强大口语技术的高效配置
  • 批准号:
    298968-2010
  • 财政年份:
    2014
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual
Efficient configuration of robust spoken language technology based human interfaces
基于人机界面的强大口语技术的高效配置
  • 批准号:
    298968-2010
  • 财政年份:
    2013
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual
Efficient configuration of robust spoken language technology based human interfaces
基于人机界面的强大口语技术的高效配置
  • 批准号:
    298968-2010
  • 财政年份:
    2011
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual
Tools for open-ended language proficiency assessment
开放式语言能力评估工具
  • 批准号:
    419086-2011
  • 财政年份:
    2011
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Engage Grants Program
Efficient configuration of robust spoken language technology based human interfaces
基于人机界面的强大口语技术的高效配置
  • 批准号:
    298968-2010
  • 财政年份:
    2010
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual
Algorithms and architectures for robust mobile speech recognition
强大的移动语音识别算法和架构
  • 批准号:
    298968-2004
  • 财政年份:
    2008
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual
Diagnosing and modeling speech and language variability
语音和语言变异性的诊断和建模
  • 批准号:
    307188-2004
  • 财政年份:
    2007
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Special Research Opportunity Program - Project
Algorithms and architectures for robust mobile speech recognition
强大的移动语音识别算法和架构
  • 批准号:
    298968-2004
  • 财政年份:
    2007
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual
Diagnosing and modeling speech and language variability
语音和语言变异性的诊断和建模
  • 批准号:
    307188-2004
  • 财政年份:
    2006
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Special Research Opportunity Program - Project
Algorithms and architectures for robust mobile speech recognition
强大的移动语音识别算法和架构
  • 批准号:
    298968-2004
  • 财政年份:
    2006
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Discovery Grants Program - Individual

相似国自然基金

N-体问题的中心构型及动力系统的分支理论
  • 批准号:
    10601071
  • 批准年份:
    2006
  • 资助金额:
    10.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Dual Series Gate Configuration, Materials Design, and Mechanistic Modeling for Drift-Stabilized, Highly Sensitive Organic Electrochemical Transistor Biosensors
用于漂移稳定、高灵敏度有机电化学晶体管生物传感器的双串联栅极配置、材料设计和机械建模
  • 批准号:
    2402407
  • 财政年份:
    2024
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Standard Grant
SBIR Phase I: Designing the Future: Generative Configuration Design
SBIR 第一阶段:设计未来:生成式配置设计
  • 批准号:
    2333122
  • 财政年份:
    2024
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Standard Grant
AI-enabled Automated Algorithm Selection and Configuration for Mathematical Optimization Problems
针对数学优化问题的人工智能自动算法选择和配置
  • 批准号:
    2313289
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Standard Grant
Spatial and Temporal Airflow Mechanism Regarding Transient Aerodynamics of a Supersonic Aircraft Configuration with a Cranked-Arrow Main Wing
曲柄箭头主翼超音速飞机瞬态空气动力学的时空气流机制
  • 批准号:
    23K04228
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Configuration-specific cofactors of Oct4
Oct4 的配置特定辅因子
  • 批准号:
    10713592
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
Activation of metal-metal bonds in stable metal cluster compounds with closed-shell configuration.
具有闭壳结构的稳定金属簇化合物中金属-金属键的活化。
  • 批准号:
    22KJ2607
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
IRIS Digital Asset: A new automated resource configuration module for the UK DIRAC instance
IRIS Digital Asset:英国 DIRAC 实例的新自动化资源配置模块
  • 批准号:
    ST/Y003047/1
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Research Grant
CRII: SHF: RUI: Leroid: Bug Oracle and Environment Configuration Automation for Android Bug Report Reproduction
CRII:SHF:RUI:Leroid:用于 Android Bug 报告复制的 Bug Oracle 和环境配置自动化
  • 批准号:
    2246186
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Standard Grant
Collaborative Research: EAGER: Automating CI Configuration Troubleshooting with Bayesian Group Testing
协作研究:EAGER:使用贝叶斯组测试自动化 CI 配置故障排除
  • 批准号:
    2333326
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Standard Grant
Development of rational evaluation method for prestressing in concrete structures with arbitrary configuration
任意构型混凝土结构预应力合理评价方法的建立
  • 批准号:
    23K03996
  • 财政年份:
    2023
  • 资助金额:
    $ 1.68万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了