RI: Medium: Collaborative Research: The Effect of Subglottal Resonances on Machine and Human Speaker Normalization

RI:媒介:合作研究:声门下共振对机器和人类说话者标准化的影响

基本信息

  • 批准号:
    0905381
  • 负责人:
  • 金额:
    $ 63.97万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Standard Grant
  • 财政年份:
    2009
  • 资助国家:
    美国
  • 起止时间:
    2009-09-01 至 2015-09-30
  • 项目状态:
    已结题

项目摘要

This award is funded under the American Recovery and Reinvestment Act of 2009 (Public Law 111-5).Despite large acoustic differences in the speech of various talkers, humans are generally able to understand each other quickly and easily. The mechanisms by which humans map such variability onto a set of phonemes has been the subject of research for more than 50 years. This "speaker normalization" problem has generally been thought of in terms of normalizing the formant frequencies of a particular speaker with a reference set of formants. In this project, a novel approach to speaker normalization is explored, in which not formants but subglottal resonances(SGRs) are normalized. SGRs have previously been shown to define a set of frequency bands within which formants may vary, yet retaining the same phonemic vowel quality. Normalizing SGRs (and associated frequency bands) therefore reduces formant variability in an effective way. In this project, effects of SGR normalization on automatic speech recognition(ASR) performance are evaluated for both adult and child speakers of English and Spanish. In parallel, effects on human speech perception in multi-talker conditions are explored. Results are expected to improve ASR performance and shed light on human speech production and perception. The project will result in speech databases (including direct recordings of SGR acoustics) and ASR tools, which are critically useful for research in speech production, perception, speaker identification, and speech processing algorithms for cochlear implants and multi-lingual ASR. The collaboration in Engineering, Linguistics, Speech & Hearing, and Psychology facilitates a multidisciplinary learning environment.Publications, results, databases, and tools will be disseminated to the research community.
该奖项是根据2009年美国复苏和再投资法案(公法111-5)资助的。尽管不同说话者的语音存在很大的声学差异,但人类通常能够快速轻松地相互理解。人类将这种变异性映射到一组音素上的机制已经是50多年来的研究主题。这个“说话者归一化”问题通常被认为是用共振峰的参考集合来归一化特定说话者的共振峰频率。 在这个项目中,一个新的方法来说话人归一化进行了探索,其中没有共振峰,但声门下共振(SGR)进行归一化。SGR以前已经被证明定义一组频带,在这些频带内共振峰可能会发生变化,但保留相同的音素元音质量。因此,归一化SGR(和相关联的频带)以有效的方式减少共振峰可变性。在这个项目中,SGR标准化对自动语音识别(ASR)性能的影响进行了评估,为成人和儿童的英语和西班牙语的扬声器。同时,探讨了在多说话者条件下对人类语音感知的影响。预计结果将提高ASR性能,并阐明人类的语音产生和感知。该项目将产生语音数据库(包括SGR声学的直接记录)和ASR工具,这些工具对语音产生,感知,说话人识别以及人工耳蜗和多语言ASR的语音处理算法的研究非常有用。在工程学、语言学、言语听力和心理学方面的合作促进了多学科的学习环境。出版物、成果、数据库和工具将传播给研究界。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Abeer Alwan其他文献

Modeling auditory perception to improve robust speech recognition
建立听觉感知模型以提高稳健的语音识别能力
Unraveling the associations between voice pitch and major depressive disorder: a multisite genetic study
揭示声音音调与重度抑郁症之间的关联:一项多站点遗传研究
  • DOI:
    10.1038/s41380-024-02877-y
  • 发表时间:
    2024-12-31
  • 期刊:
  • 影响因子:
    10.100
  • 作者:
    Yazheng Di;Elior Rahmani;Joel Mefford;Jinhan Wang;Vijay Ravi;Aditya Gorla;Abeer Alwan;Kenneth S. Kendler;Tingshao Zhu;Jonathan Flint
  • 通讯作者:
    Jonathan Flint
Optical Phonetics and Visual Percep Stress in Eng
英语中的光学语音和视觉感知压力
  • DOI:
  • 发表时间:
    2003
  • 期刊:
  • 影响因子:
    0
  • 作者:
    P. Keating;Marco Baroni;Sven Matty;E. T. Auer;Rebecca Scarborough;Abeer Alwan;E. Bernstein
  • 通讯作者:
    E. Bernstein
Towards Automatically Assessing Children’s Picture Description Tasks
自动评估儿童图片描述任务
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hariram Veeramani;Natarajan Balaji Shankar;Alexander Johnson;Abeer Alwan
  • 通讯作者:
    Abeer Alwan
An Analysis of Large Language Models for African American English Speaking Children’s Oral Language Assessment
非裔美国英语儿童口语评估大语言模型分析
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Alexander Johnson;Christina Chance;Kaycee Stiemke;Hariram Veeramani;Natarajan Balaji Shankar;Abeer Alwan
  • 通讯作者:
    Abeer Alwan

Abeer Alwan的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Abeer Alwan', 18)}}的其他基金

Collaborative Research: Improving speech technology for better learning outcomes: the case of AAE child speakers
协作研究:改进语音技术以获得更好的学习成果:AAE 儿童扬声器的案例
  • 批准号:
    2202585
  • 财政年份:
    2022
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Small: From Ultrasound and MRI to articulatory and acoustic models of child speech development
合作研究:RI:小型:从超声和 MRI 到儿童言语发展的发音和声学模型
  • 批准号:
    2006979
  • 财政年份:
    2020
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Workshop for Undergraduate and MS Female Students in Speech Science and Technology
语音科学与技术本科生和女硕士讲习班
  • 批准号:
    1745166
  • 财政年份:
    2017
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
NRI: INT: COLLAB: Development, Deployment and Evaluation of Personalized Learning Companion Robots for Early Literacy and Language Learning
NRI:INT:COLLAB:用于早期识字和语言学习的个性化学习伴侣机器人的开发、部署和评估
  • 批准号:
    1734380
  • 财政年份:
    2017
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
RI: Medium: Collaborative Research: Variance and Invariance in Voice Quality: Implications for Machine and Human Speaker Identification
RI:媒介:协作研究:语音质量的方差和不变性:对机器和人类说话人识别的影响
  • 批准号:
    1704167
  • 财政年份:
    2017
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Continuing Grant
A Workshop for Junior Female Researchers in Speech Science and Technology
语音科学与技术青年女性研究员研讨会
  • 批准号:
    1637240
  • 财政年份:
    2016
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
The Role of Speech Science in Developing Robust Speech Technology Applications
语音科学在开发强大的语音技术应用中的作用
  • 批准号:
    1543522
  • 财政年份:
    2015
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
EAGER: Collaborative Research: Models of Child Speech
EAGER:合作研究:儿童言语模型
  • 批准号:
    1551113
  • 财政年份:
    2015
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
EAGER: Variance and Invariance in Voice Quality
EAGER:语音质量的方差和不变性
  • 批准号:
    1450992
  • 财政年份:
    2014
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
EAGER: Collaborative Research: Towards Modeling Human Speech Confusions in Noise
EAGER:协作研究:对噪声中的人类语音混乱进行建模
  • 批准号:
    1247809
  • 财政年份:
    2012
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant

相似海外基金

Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse
合作研究:RI:中:通过深度神经崩溃实现优化、泛化和可迁移性的原理
  • 批准号:
    2312841
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse
合作研究:RI:中:通过深度神经崩溃实现优化、泛化和可迁移性的原理
  • 批准号:
    2312842
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Lie group representation learning for vision
协作研究:RI:中:视觉的李群表示学习
  • 批准号:
    2313151
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Continuing Grant
Collaborative Research: RI: Medium: Principles for Optimization, Generalization, and Transferability via Deep Neural Collapse
合作研究:RI:中:通过深度神经崩溃实现优化、泛化和可迁移性的原理
  • 批准号:
    2312840
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: CompCog: RI: Medium: Understanding human planning through AI-assisted analysis of a massive chess dataset
合作研究:CompCog:RI:中:通过人工智能辅助分析海量国际象棋数据集了解人类规划
  • 批准号:
    2312374
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: CompCog: RI: Medium: Understanding human planning through AI-assisted analysis of a massive chess dataset
合作研究:CompCog:RI:中:通过人工智能辅助分析海量国际象棋数据集了解人类规划
  • 批准号:
    2312373
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Lie group representation learning for vision
协作研究:RI:中:视觉的李群表示学习
  • 批准号:
    2313149
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Continuing Grant
Collaborative Research: RI: Medium: Superhuman Imitation Learning from Heterogeneous Demonstrations
合作研究:RI:媒介:异质演示中的超人模仿学习
  • 批准号:
    2312955
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Informed, Fair, Efficient, and Incentive-Aware Group Decision Making
协作研究:RI:媒介:知情、公平、高效和具有激励意识的群体决策
  • 批准号:
    2313137
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Standard Grant
Collaborative Research: RI: Medium: Lie group representation learning for vision
协作研究:RI:中:视觉的李群表示学习
  • 批准号:
    2313150
  • 财政年份:
    2023
  • 资助金额:
    $ 63.97万
  • 项目类别:
    Continuing Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了