CAREER: Landmark-Based Speech Recognition in Music and Speech Backgrounds
职业:音乐和语音背景中基于地标的语音识别
基本信息
- 批准号:0132900
- 负责人:
- 金额:$ 39.58万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2002
- 资助国家:美国
- 起止时间:2002-07-01 至 2008-06-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
This is a Faculty Early Career Development (CAREER) award. The research will develop speech recognition and auditory scene analysis models that are probability distributions whose parameters can be trained from data and whose internal structures are capable of abstracting the perceptual response patterns of human listeners. Two broad research questions will be explored: (1) Can probability models representing the pitch, envelope, and timing of an acoustic source be computed and integrated in a tractable manner? (2) What are the theoretical and empirical requirements for the partitioning, training, and recognition scoring of probability models for landmark-based acoustic features? Landmarks in speech are identifiable points in the flow of sound over time, such as consonant releases and closures, vowel centers, and glide extrema. The educational component of this project includes significant curriculum development at both the undergraduate and graduate levels, and a strong investment in the mentoring of undergraduate and graduate research trainees.This CAREER award recognizes and supports the early career-development activities of a teacher-scholar who is likely to become an academic leader of the twenty-first century. This is fundamental scientific research in acoustics and computer science, but it addresses the very practical problem that computers are still far worse at recognizing speech than human beings are. Speech recognition technology has already become an important industry, but it will become far more important in the future as mobile computing and computer-mediated communications make it necessary for millions of people to control machines verbally rather than by means of keyboards. The educational component of this work will train graduate students to be teachers and communicators, as well as researchers, thus preparing them to help build the base of personnel needed in this exciting, growing area.
这是一个教师早期职业发展(CAREER)奖。 该研究将开发语音识别和听觉场景分析模型,这些模型是概率分布,其参数可以从数据中训练,其内部结构能够抽象出人类听众的感知反应模式。 将探讨两个广泛的研究问题:(1)概率模型表示的音高,包络线,声源的时间可以计算和整合在一个易于处理的方式? (2)基于地标的声学特征的概率模型的划分、训练和识别评分的理论和经验要求是什么? 言语中的地标是声音随时间变化的流动中可识别的点,例如辅音释放和闭合、元音中心和滑音极值。 该项目的教育部分包括本科生和研究生两个层次的重要课程开发,以及对本科生和研究生研究实习生的指导的大力投资。该职业生涯奖认可并支持有可能成为21世纪学术领导者的教师学者的早期职业发展活动。 这是声学和计算机科学的基础科学研究,但它解决了一个非常实际的问题,即计算机在识别语音方面仍然比人类差得多。 语音识别技术已经成为一个重要的产业,但随着移动的计算和以计算机为媒介的通信使数百万人必须通过口头而不是通过键盘来控制机器,它在未来将变得更加重要。 这项工作的教育部分将培养研究生成为教师和传播者以及研究人员,从而使他们做好准备,帮助建立这个令人兴奋的不断增长的领域所需的人员基础。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Mark Hasegawa-Johnson其他文献
Mark Hasegawa-Johnson的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Mark Hasegawa-Johnson', 18)}}的其他基金
FAI: A New Paradigm for the Evaluation and Training of Inclusive Automatic Speech Recognition
FAI:包容性自动语音识别评估和训练的新范式
- 批准号:
2147350 - 财政年份:2022
- 资助金额:
$ 39.58万 - 项目类别:
Standard Grant
RI: Small: Collaborative Research: Automatic Creation of New Speech Sound Inventories
RI:小型:协作研究:自动创建新语音库存
- 批准号:
1910319 - 财政年份:2019
- 资助金额:
$ 39.58万 - 项目类别:
Standard Grant
EAGER: Matching Non-Native Transcribers to the Distinctive Features of the Language Transcribed
EAGER:将非母语转录者与转录语言的独特特征相匹配
- 批准号:
1550145 - 财政年份:2015
- 资助金额:
$ 39.58万 - 项目类别:
Standard Grant
FODAVA-Partner: Visualizing Audio for Anomaly Detection
FODAVA-合作伙伴:可视化音频以进行异常检测
- 批准号:
0807329 - 财政年份:2008
- 资助金额:
$ 39.58万 - 项目类别:
Continuing Grant
RI Medium: Audio Diarization - Towards Comprehensive Description of Audio Events
RI Medium:音频二值化 - 全面描述音频事件
- 批准号:
0803219 - 财政年份:2008
- 资助金额:
$ 39.58万 - 项目类别:
Standard Grant
Audiovisual Distinctive-Feature-Based Recognition of Dysarthric Speech
基于视听特征的构音障碍语音识别
- 批准号:
0534106 - 财政年份:2005
- 资助金额:
$ 39.58万 - 项目类别:
Continuing Grant
Prosodic, Intonational, and Voice Quality Correlates of Disfluency
韵律、语调和语音质量与不流畅的相关性
- 批准号:
0414117 - 财政年份:2004
- 资助金额:
$ 39.58万 - 项目类别:
Continuing Grant
相似国自然基金
基于Landmark知识的规划方法研究
- 批准号:61103136
- 批准年份:2011
- 资助金额:22.0 万元
- 项目类别:青年科学基金项目
相似海外基金
The Effects of Landmark Uncertainty in VGI-based Maps: Approaches to Improve Wayfinding and Navigation Performance
基于 VGI 的地图中地标不确定性的影响:改善寻路和导航性能的方法
- 批准号:
314977345 - 财政年份:2016
- 资助金额:
$ 39.58万 - 项目类别:
Priority Programmes
Experiments and theory regarding the emergence of landmark based on point and open logic
基于点和开放逻辑的地标出现实验与理论
- 批准号:
25540059 - 财政年份:2013
- 资助金额:
$ 39.58万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
A new era for modern main group chemistry: from landmark molecules towards the replacement of transition metal based technologies
现代主族化学的新时代:从标志性分子到过渡金属技术的替代
- 批准号:
DP120101300 - 财政年份:2012
- 资助金额:
$ 39.58万 - 项目类别:
Discovery Projects
Collaborative Research: Landmark-based Robust Speech Recognition using Prosody-guided Models of Speech Variability
协作研究:使用韵律引导的语音变异模型进行基于地标的鲁棒语音识别
- 批准号:
0703805 - 财政年份:2007
- 资助金额:
$ 39.58万 - 项目类别:
Continuing Grant
Collaborative Research: Landmark-based Robust Speech Recognition Using Prosody-guided models of speech variability
协作研究:使用韵律引导的语音变异模型进行基于地标的鲁棒语音识别
- 批准号:
0703048 - 财政年份:2007
- 资助金额:
$ 39.58万 - 项目类别:
Continuing Grant
RI-Collaborative Research: Landmark-based robust speech recognition using prosody-guided models of speech variability
RI 协作研究:使用韵律引导的语音变异模型进行基于地标的鲁棒语音识别
- 批准号:
0703624 - 财政年份:2007
- 资助金额:
$ 39.58万 - 项目类别:
Continuing Grant