Automatic evaluation of speech quality

语音质量自动评估

基本信息

项目摘要

DESCRIPTION (provided by applicant): Tests of several different approaches to the automatic evaluation of the quality of speech segments are proposed. Previous systems for use in pronunciation training have typically employed either automatic speech-recognition (ASR) technology, or have used templates based on a limited number of utterances rated as excellent by L1 listeners (and sometimes also employing a second set of utterances containing a common pronunciation error). Here speech-processing technologies (HMM's and ANN's) will be developed specifically for use as evaluation systems (not recognition systems) to predict quality and locus-of-error judgments assigned by listeners. Termed the "evaluation-of-single-words" (ESW) approach, the special feature of these systems will derive from the training tokens employed in their development: multiple recordings of a single word made by groups of native and non-native talkers. Sixty talkers will be native speakers of Arabic, whose intelligibility in English ranges from poor to near-perfect, and 60 talkers will be native speakers of middle-American English. There will be twelve words divided between one, two, and three syllables. Ten productions of each word will be recorded by each talker, yielding 14,400 tokens. Each token will be rated by listening juries for pronunciation quality, and the tokens will also be categorized into perceptual clusters, using MDS and cluster-analysis techniques. At least two computer-based evaluation systems (HMM and ANN) will be trained for each individual word, with the goals of predicting overall pronunciation quality and identifying specific commonly occurring pronunciation errors. It is expected that these word-specific systems, each representing a discrete "evaluator" custom-built for an individual word, will approach the maximum accuracy that can be expected of this class of processors. If successful, the ESW approach may have a broad range of applications in pronunciation training, identification of a speaker's L1, foreign-language instruction, and other non-lexical applications. However, our specific goal is the development of systems that can provide informative feedback during automated pronunciation training. In ASR applications, the goal is to respond the same way to a word, no matter how it is pronounced. The goal of an ESW system is to respond differentially to pronunciation variants. This distinction between ASR and ESW is central to the development of successful evaluation systems as it dictates different modeling constraints.
描述(由申请人提供):提出了对语音段质量的自动评估的几种不同方法的测试。用于发音训练的先前系统通常采用自动语音识别(ASR)技术,或者使用基于被L1听者评为优秀的有限数量的发音的模板(并且有时还使用包含共同发音错误的第二组发音)。在这里,语音处理技术(HMM和ANN)将专门作为评估系统(而不是识别系统)来开发,以预测听者分配的质量和错误位置判断。这些系统的特殊特征被称为“单字评估”(ESW)方法,其特点来自于在其开发过程中采用的训练标志:由一组母语和非母语说话者对一个单词进行多次录音。60名演讲者将以阿拉伯语为母语,英语的可理解性从差到近乎完美,60名演讲者将以中美英语为母语。将有12个单词分为单音节、双音节和三音节。每个说话者将录制每个单词的十个作品,产生14,400个代币。每个令牌都将由听音评审团根据发音质量进行评级,并使用MDS和聚类分析技术将这些令牌归类为感知簇。将为每个单词培训至少两个基于计算机的评估系统(HMM和ANN),目的是预测总体发音质量并确定具体的常见发音错误。预计这些特定于单词的系统将接近这类处理器所能期望的最大精度,每个系统都代表一个为单个单词定制的离散“评估器”。如果成功,ESW方法可能会在发音训练、识别说话人的母语、外语教学和其他非词汇应用方面有广泛的应用。然而,我们的具体目标是开发能够在自动发音训练期间提供信息反馈的系统。在ASR应用程序中,目标是对一个单词做出相同的反应,无论它是如何发音的。ESW系统的目标是对发音变体做出不同的反应。ASR和ESW之间的这种区别是开发成功的评估系统的核心,因为它规定了不同的建模约束。

项目成果

期刊论文数量(1)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

CHARLES S WATSON其他文献

CHARLES S WATSON的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('CHARLES S WATSON', 18)}}的其他基金

Multi-site study of speech perception training for hearing-aid users
助听器使用者言语感知训练的多中心研究
  • 批准号:
    8466453
  • 财政年份:
    2010
  • 资助金额:
    $ 10万
  • 项目类别:
Multi-site study of speech perception training for hearing-aid users
助听器使用者言语感知训练的多中心研究
  • 批准号:
    8523825
  • 财政年份:
    2010
  • 资助金额:
    $ 10万
  • 项目类别:
Multi-site study of speech perception training for hearing-aid users
助听器使用者言语感知训练的多中心研究
  • 批准号:
    8704367
  • 财政年份:
    2010
  • 资助金额:
    $ 10万
  • 项目类别:
Multi-site study of speech perception training for hearing-aid users
助听器使用者言语感知训练的多中心研究
  • 批准号:
    8889053
  • 财政年份:
    2010
  • 资助金额:
    $ 10万
  • 项目类别:
Multi-site study of speech perception training for hearing-aid users
助听器使用者言语感知训练的多中心研究
  • 批准号:
    7984150
  • 财政年份:
    2010
  • 资助金额:
    $ 10万
  • 项目类别:
Telephone screening test for hearing using three-digit sequences in noise.
在噪音中使用三位数序列进行听力电话筛查测试。
  • 批准号:
    7856855
  • 财政年份:
    2009
  • 资助金额:
    $ 10万
  • 项目类别:
A national screening test for hearing, administered by telephone.
通过电话进行的全国听力筛查测试。
  • 批准号:
    8431990
  • 财政年份:
    2009
  • 资助金额:
    $ 10万
  • 项目类别:
A National Screening Test for Hearing Administered by Telephone
通过电话进行的全国听力筛查测试
  • 批准号:
    8607174
  • 财政年份:
    2009
  • 资助金额:
    $ 10万
  • 项目类别:
Telephone screening test for hearing using three-digit sequences in noise.
在噪音中使用三位数序列进行听力电话筛查测试。
  • 批准号:
    7670141
  • 财政年份:
    2009
  • 资助金额:
    $ 10万
  • 项目类别:
DISCRIMINATION AND IDENTIFICATION OF AUDITORY PATTERNS
听觉模式的辨别和识别
  • 批准号:
    2125265
  • 财政年份:
    1983
  • 资助金额:
    $ 10万
  • 项目类别:

相似国自然基金

基于MFSD2A调控血迷路屏障跨细胞囊泡转运机制的噪声性听力损失防治研究
  • 批准号:
    82371144
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
YTHDF1通过m6A修饰调控耳蜗毛细胞炎症反应在老年性聋中的作用机制研究
  • 批准号:
    82371140
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
TRIM21蛋白促进HIF1α的降解介导耳蜗血管纹缘细胞缺血再灌注致听力损伤的机制研究
  • 批准号:
    82371142
  • 批准年份:
    2023
  • 资助金额:
    49.00 万元
  • 项目类别:
    面上项目
基于WHO-HEARING理论框架的老年人听力障碍社区康复模式构建与优化策略研究
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
常染色体隐性遗传感音神经性耳聋的分子致病机理研究
  • 批准号:
    30572015
  • 批准年份:
    2005
  • 资助金额:
    26.0 万元
  • 项目类别:
    面上项目

相似海外基金

REU Site: Research Experiences for Deaf and Hard of Hearing Students in Molecular Signaling - How Cells and Organisms Make Decisions
REU 网站:聋哑学生在分子信号传导方面的研究经验 - 细胞和生物体如何做出决策
  • 批准号:
    2349274
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Standard Grant
Sensory and bioengineering approaches to predict hearing abilities in fish
预测鱼类听力的感官和生物工程方法
  • 批准号:
    DE240100188
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Discovery Early Career Researcher Award
Understanding the neural basis of hearing function and dysfunction in vivo.
了解体内听力功能和功能障碍的神经基础。
  • 批准号:
    BB/Y000374/1
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Research Grant
Cochlear implants and spatial hearing: Enabling access to the next dimension of hearing (Cherish)
人工耳蜗和空间听力:实现听力的下一个维度(Cherish)
  • 批准号:
    EP/Y031946/1
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Research Grant
Uncovering the Functional Effects of Neurotrophins in the Auditory Brainstem
揭示神经营养素对听觉脑干的功能影响
  • 批准号:
    10823506
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
An electroencephalography study of the neural correlates of visual habituation in infants with hearing loss
听力损失婴儿视觉习惯神经相关性的脑电图研究
  • 批准号:
    ES/X001946/1
  • 财政年份:
    2024
  • 资助金额:
    $ 10万
  • 项目类别:
    Research Grant
Auditory Cortex Plasticity Following Deafness
耳聋后的听觉皮层可塑性
  • 批准号:
    478943
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
    Operating Grants
Measuring the cognitive and neural underpinnings of listening effort
测量听力努力的认知和神经基础
  • 批准号:
    495552
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
    Miscellaneous Programs
Predicting language under difficult conditions: Effects of cognitive load, noise, and hearing impairment
在困难条件下预测语言:认知负荷、噪音和听力障碍的影响
  • 批准号:
    ES/X001148/1
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
    Research Grant
Social participation and changes in hearing ability
社会参与与听力变化
  • 批准号:
    2863399
  • 财政年份:
    2023
  • 资助金额:
    $ 10万
  • 项目类别:
    Studentship
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了