HCC: High-Quality Compression, Enhancement, and Personalization of Text-to-Speech Voices

HCC:文本转语音的高质量压缩、增强和个性化

基本信息

  • 批准号:
    0713617
  • 负责人:
  • 金额:
    $ 40万
  • 依托单位:
  • 依托单位国家:
    美国
  • 项目类别:
    Continuing Grant
  • 财政年份:
    2007
  • 资助国家:
    美国
  • 起止时间:
    2007-09-01 至 2011-08-31
  • 项目状态:
    已结题

项目摘要

The vast variability of the human speech signal remains a central challenge for Text-to-Speech (TTS) systems. The objective of this research is to develop TTS technologies that focus on elimination of concatenation errors, and accurate speech modifications in the areas of coarticulation, degree of articulation, prosodic effects, and speaker characteristics. The investigators are exploring an asynchronous interpolation model (AIM), which promises to provide for high-quality and flexible TTS. The core idea of AIM is to represent a short region of speech as a composition of several types of features called streams.Each stream is computed by asynchronous interpolation of basis vectors.Each basis vector is associated with a particular phoneme, allophone, or more specialized unit. Thus, the speech region is described by the varying degrees of influence of several types of preceding and following acoustic features. Using AIM, the investigators are also developing methods to optimally compress the acoustic inventories of TTS systems, given a size or a quality constraint, and to adapt the system to a new voice, given a few training samples. The system being researched forms a hybrid between traditional concatenative and formant-based synthesis, having advantages of both, resulting in a high-quality, optimized TTS system with voice adaptation capabilities. TTS has generally recognized societal benefits for universal access, education, and information access by voice. Our research will make it possible, for example, to build personalized TTS systems for individuals with speech disorders who can only intermittently produce normal speech sounds.
人类语音信号的巨大变异性仍然是文本到语音(TTS)系统的核心挑战。这项研究的目标是开发TTS技术,专注于消除拼接错误,并在协同发音、发音程度、韵律效果和说话人特征方面进行准确的语音修改。研究人员正在探索一种异步内插模型(AIM),该模型有望提供高质量和灵活的TTS。AIM的核心思想是将语音的短区域表示为几种称为流的特征的组合,每个流通过基矢量的异步内插来计算,每个基矢量与特定的音素、音素或更特殊的单元相关联。因此,通过几种类型的前后声学特征的不同程度的影响来描述语音区域。使用AIM,研究人员还在开发方法,在给定大小或质量限制的情况下优化压缩TTS系统的声学库存,并在给定几个训练样本的情况下使系统适应新的声音。正在研究的系统是传统级联合成和基于共振峰的合成的混合体,兼具两者的优势,从而产生具有语音适配能力的高质量、优化的TTS系统。TTS普遍认识到通过语音实现普遍获取、教育和信息获取的社会效益。例如,我们的研究将使我们有可能为那些只能间歇性地发出正常语音的言语障碍患者建立个性化的TTS系统。

项目成果

期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Alexander Kain其他文献

Alexander Kain的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Alexander Kain', 18)}}的其他基金

RI: Medium: Collaborative Research: Semi-Supervised Discriminative Training of Language Models
RI:媒介:协作研究:语言模型的半监督判别训练
  • 批准号:
    0964102
  • 财政年份:
    2010
  • 资助金额:
    $ 40万
  • 项目类别:
    Continuing Grant
Collaborative Research: CDI-Type I: Computational Models for the Automatic Recognition of Non-Human Primate Social Behaviors
合作研究:CDI-Type I:自动识别非人类灵长类动物社会行为的计算模型
  • 批准号:
    1027834
  • 财政年份:
    2010
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
HCC: Medium: Synthesis and Perception of Speaker Identity
HCC:媒介:说话者身份的综合和感知
  • 批准号:
    0964468
  • 财政年份:
    2010
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant
RI: Small: Modeling Coarticulation for Automatic Speech Recognition
RI:小型:自动语音识别的协同发音建模
  • 批准号:
    0915754
  • 财政年份:
    2009
  • 资助金额:
    $ 40万
  • 项目类别:
    Continuing Grant
STTR Phase I: Small Footprint Speech Synthesis
STTR 第一阶段:小规模语音合成
  • 批准号:
    0441125
  • 财政年份:
    2005
  • 资助金额:
    $ 40万
  • 项目类别:
    Standard Grant

相似海外基金

SEQUOIA: Sustainability-driven high quality video compression and delivery
SEQUOIA:可持续发展驱动的高质量视频压缩和传输
  • 批准号:
    96984
  • 财政年份:
    2021
  • 资助金额:
    $ 40万
  • 项目类别:
    Collaborative R&D
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    356076-2013
  • 财政年份:
    2017
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Individual
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    446182-2013
  • 财政年份:
    2017
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Optimizing Video Quality Using Machine-Learning-Controlled Adaptive Resolution, Video Compression
使用机器学习控制的自适应分辨率、视频压缩来优化视频质量
  • 批准号:
    510255-2017
  • 财政年份:
    2017
  • 资助金额:
    $ 40万
  • 项目类别:
    Engage Grants Program
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    446182-2013
  • 财政年份:
    2016
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Accelerator Supplements
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    356076-2013
  • 财政年份:
    2016
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Individual
Metastatic Epidural Spinal Cord Compression: Development of Clinical Prediction Rules to Assess Health-Related Quality of Life and Survival in Surgically Treated Patients
转移性硬膜外脊髓压迫:制定临床预测规则以评估接受手术治疗的患者的健康相关生活质量和生存率
  • 批准号:
    338929
  • 财政年份:
    2015
  • 资助金额:
    $ 40万
  • 项目类别:
    Fellowship Programs
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    356076-2013
  • 财政年份:
    2015
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Individual
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    356076-2013
  • 财政年份:
    2014
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Individual
Quality-of-visual-experience: perceptual assessment, compression and enhancement
视觉体验质量:感知评估、压缩和增强
  • 批准号:
    356076-2013
  • 财政年份:
    2013
  • 资助金额:
    $ 40万
  • 项目类别:
    Discovery Grants Program - Individual
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了