TRANSFORM: flexible voice synthesis through articulatory voice transformation
TRANSFORM:通过发音转换实现灵活的语音合成
基本信息
- 批准号:0414675
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:美国
- 项目类别:Standard Grant
- 财政年份:2005
- 资助国家:美国
- 起止时间:2005-05-15 至 2009-04-30
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Many people have always wanted machines to talk to them, but most have strong preferences for particular voices. Current techniques in speech synthesis can build voices that sound very close to the original speaker, capturing the style, manner and articulation of the source voice. However such systems require many hours of carefully recorded speech and expert tuning to reach an acceptable level of quality. An exciting new alternative method for building synthetic voices is voice transformation. This method uses an existing recorded database and converts it to a target voice using as little as 10-20 sentences. This technique offers the potential to make speech synthesizers talk in whatever voice desired, with significantly less effort required than previous techniques.This project offers a new direction in voice transformation. Current transformation techniques concentrate on a spectral mapping of the voice, i.e., converting the properties of the speech signal. Instead we use the underlying positions of the vocal tract articulators (i.e., the position of the teeth, tongue, lips, velum), which give rise to the spectral output of the voice. Using new statistical modeling techniques we can successfully predict the positions of a speaker's articulators from the speech signal. Then in the virtual vocal tract domain map between speakers and regenerate the speech for the target voice.This work enables the easy construction of new synthetic voices allowing personalization of speech output. It increases our knowledge of the speech generation process and characterizes what make a voice personal.
许多人一直希望机器能与他们对话,但大多数人对特定的声音有强烈的偏好。当前的语音合成技术可以构建听起来非常接近原始说话人的声音,捕捉源声音的风格、方式和发音。然而,这种系统需要经过数小时的仔细录音和专家调整,才能达到可接受的质量水平。构建合成声音的另一种令人兴奋的新方法是声音变换。这种方法利用现有的记录数据库,只需10-20个句子就可以将其转换为目标语音。这项技术使语音合成器有可能以任何想要的声音说话,而所需的工作量比以前的技术要少得多。该项目为语音转换提供了一个新的方向。当前的变换技术集中于语音的频谱映射,即,转换语音信号的属性。相反,我们使用声道发音器的潜在位置(即牙齿、舌头、嘴唇、膜的位置),这会引起声音的光谱输出。使用新的统计建模技术,我们可以成功地从语音信号中预测说话人发音器的位置。然后在虚拟声道域中映射说话人之间的关系,并为目标语音重新生成语音。这项工作使得能够容易地构建新的合成语音,从而允许语音输出的个性化。它增加了我们对语音生成过程的了解,并使声音具有个人化的特征。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Alan Black其他文献
Introducing Supplemental Context for Word Sense Disambiguation
引入补充上下文进行词义消歧
- DOI:
10.1109/ictai.2016.0164 - 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Alan Black;Rosina O. Weber - 通讯作者:
Rosina O. Weber
Experience with day stay surgery
- DOI:
10.1016/s0022-3468(80)80396-4 - 发表时间:
1980-02-01 - 期刊:
- 影响因子:
- 作者:
Douglas Cohen;John Keneally;Alan Black;Sandra Gaffney;Andra Johnson - 通讯作者:
Andra Johnson
Tweet recall: examining real-time civic discourse on twitter
推文回忆:检查推特上的实时公民话语
- DOI:
10.1145/2389176.2389233 - 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
C. Mascaro;Alan Black;S. Goggins - 通讯作者:
S. Goggins
Rethinking the smart closet as an opportunity to enhance the social currency of clothing
重新思考智能衣柜作为增强服装社交货币的机会
- DOI:
10.1145/2370216.2370245 - 发表时间:
2012 - 期刊:
- 影响因子:0
- 作者:
J. Rode;Rachel M. Magee;Melinda Sebastian;Alan Black;Rachel Yudell;Aly Gibran;Nora Mcdonald;J. Zimmerman - 通讯作者:
J. Zimmerman
Inhibition of microsomal aldrin epoxidation by diquat and several related bipyridvlium compounds
敌草快和几种相关联吡啶化合物对微粒体艾氏剂环氧化的抑制作用
- DOI:
- 发表时间:
1973 - 期刊:
- 影响因子:2.7
- 作者:
R. Krieger;Philip W. Lee;Alan Black;T. Fukuto - 通讯作者:
T. Fukuto
Alan Black的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Alan Black', 18)}}的其他基金
RI: Small: Modeling Lexical Borrowing to Bridge the "Linguistic Divide" in Natural Language Processing
RI:小:建模词汇借用以弥合自然语言处理中的“语言鸿沟”
- 批准号:
1526745 - 财政年份:2015
- 资助金额:
-- - 项目类别:
Standard Grant
ITR: Evaluation and Personalization of Synthetic Voices
ITR:合成语音的评估和个性化
- 批准号:
0219687 - 财政年份:2002
- 资助金额:
-- - 项目类别:
Continuing Grant
相似国自然基金
A study on prototype flexible multifunctional graphene foam-based sensing grid (柔性多功能石墨烯泡沫传感网格原型研究)
- 批准号:
- 批准年份:2020
- 资助金额:20 万元
- 项目类别:
相似海外基金
Flexible Control Authority With a Robotic Arm: Facilitating Seamless Transitions Between User and Robot Control in Multi-Action Manipulation Tasks.
机械臂的灵活控制权限:促进多动作操作任务中用户和机器人控制之间的无缝过渡。
- 批准号:
10637707 - 财政年份:2023
- 资助金额:
-- - 项目类别:
Lightweight flexible shoulder prosthesis with various operation input modalities composed mainly of voice and small safe intuitive feedback device
以语音和小型安全直观反馈装置为主的多种操作输入方式的轻型柔性肩假肢
- 批准号:
19K20741 - 财政年份:2019
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
EasyVis: Flexible, immersive three-dimensional laparoscopic surgical visualization through multi-camera arrays
EasyVis:通过多摄像头阵列实现灵活、身临其境的三维腹腔镜手术可视化
- 批准号:
10442392 - 财政年份:2015
- 资助金额:
-- - 项目类别:
EasyVis: Flexible, immersive three-dimensional laparoscopic surgical visualization through multi-camera arrays
EasyVis:通过多摄像头阵列实现灵活、身临其境的三维腹腔镜手术可视化
- 批准号:
10614026 - 财政年份:2015
- 资助金额:
-- - 项目类别:
Investigating neural mechanisms for flexible, robust speech perception with fMRI
利用功能磁共振成像研究灵活、稳健的语音感知的神经机制
- 批准号:
8992857 - 财政年份:2015
- 资助金额:
-- - 项目类别:
EasyVis: Flexible, immersive three-dimensional laparoscopic surgical visualization through multi-camera arrays
EasyVis:通过多摄像头阵列实现灵活、身临其境的三维腹腔镜手术可视化
- 批准号:
10209924 - 财政年份:2015
- 资助金额:
-- - 项目类别:
Development of speech analysis-synthesis system enabling flexible control of voice quality caused by peculiarity of vocal cords
开发语音分析合成系统,可根据声带的特殊性灵活控制语音质量
- 批准号:
25870883 - 财政年份:2013
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Young Scientists (B)
Automatic voice building for flexible speech synthesis
自动语音构建,实现灵活的语音合成
- 批准号:
14380160 - 财政年份:2002
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (B)
Speech Synthesis Method for Flexible Voice Quality Control
用于灵活语音质量控制的语音合成方法
- 批准号:
08680386 - 财政年份:1996
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)