权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

TRANSFORM: flexible voice synthesis through articulatory voice transformation

TRANSFORM：通过发音转换实现灵活的语音合成

基本信息

批准号：
0414675
负责人：
Alan Black
金额：
--
依托单位：
Carnegie-Mellon University
依托单位国家：
美国
项目类别：
Standard Grant
财政年份：
2005
资助国家：
美国
起止时间：
2005-05-15 至 2009-04-30
项目状态：
已结题

来源：
https://www.nsf.gov/awardsearch/showAward?AWD_ID=0414675&HistoricalAwards=false
关键词：
TRANSFORM flexible voice synthesis through

项目摘要

Many people have always wanted machines to talk to them, but most have strong preferences for particular voices. Current techniques in speech synthesis can build voices that sound very close to the original speaker, capturing the style, manner and articulation of the source voice. However such systems require many hours of carefully recorded speech and expert tuning to reach an acceptable level of quality. An exciting new alternative method for building synthetic voices is voice transformation. This method uses an existing recorded database and converts it to a target voice using as little as 10-20 sentences. This technique offers the potential to make speech synthesizers talk in whatever voice desired, with significantly less effort required than previous techniques.This project offers a new direction in voice transformation. Current transformation techniques concentrate on a spectral mapping of the voice, i.e., converting the properties of the speech signal. Instead we use the underlying positions of the vocal tract articulators (i.e., the position of the teeth, tongue, lips, velum), which give rise to the spectral output of the voice. Using new statistical modeling techniques we can successfully predict the positions of a speaker's articulators from the speech signal. Then in the virtual vocal tract domain map between speakers and regenerate the speech for the target voice.This work enables the easy construction of new synthetic voices allowing personalization of speech output. It increases our knowledge of the speech generation process and characterizes what make a voice personal.

许多人一直希望机器能与他们对话，但大多数人对特定的声音有强烈的偏好。当前的语音合成技术可以构建听起来非常接近原始说话人的声音，捕捉源声音的风格、方式和发音。然而，这种系统需要经过数小时的仔细录音和专家调整，才能达到可接受的质量水平。构建合成声音的另一种令人兴奋的新方法是声音变换。这种方法利用现有的记录数据库，只需10-20个句子就可以将其转换为目标语音。这项技术使语音合成器有可能以任何想要的声音说话，而所需的工作量比以前的技术要少得多。该项目为语音转换提供了一个新的方向。当前的变换技术集中于语音的频谱映射，即，转换语音信号的属性。相反，我们使用声道发音器的潜在位置(即牙齿、舌头、嘴唇、膜的位置)，这会引起声音的光谱输出。使用新的统计建模技术，我们可以成功地从语音信号中预测说话人发音器的位置。然后在虚拟声道域中映射说话人之间的关系，并为目标语音重新生成语音。这项工作使得能够容易地构建新的合成语音，从而允许语音输出的个性化。它增加了我们对语音生成过程的了解，并使声音具有个人化的特征。

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

Alan Black其他文献

Introducing Supplemental Context for Word Sense Disambiguation

引入补充上下文进行词义消歧

DOI：
10.1109/ictai.2016.0164
发表时间：
2016
期刊：
2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI)
影响因子：
0
作者：
Alan Black;Rosina O. Weber
通讯作者：
Rosina O. Weber

Experience with day stay surgery

DOI：
10.1016/s0022-3468(80)80396-4
发表时间：
1980-02-01
期刊：
Research article
影响因子：
作者：
Douglas Cohen;John Keneally;Alan Black;Sandra Gaffney;Andra Johnson
通讯作者：
Andra Johnson

Tweet recall: examining real-time civic discourse on twitter

推文回忆：检查推特上的实时公民话语

DOI：
10.1145/2389176.2389233
发表时间：
2012
期刊：
Proceedings of the 2012 ACM International Conference on Supporting Group Work
影响因子：
0
作者：
C. Mascaro;Alan Black;S. Goggins
通讯作者：
S. Goggins

Rethinking the smart closet as an opportunity to enhance the social currency of clothing

重新思考智能衣柜作为增强服装社交货币的机会

DOI：
10.1145/2370216.2370245
发表时间：
2012
期刊：
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
影响因子：
0
作者：
J. Rode;Rachel M. Magee;Melinda Sebastian;Alan Black;Rachel Yudell;Aly Gibran;Nora Mcdonald;J. Zimmerman
通讯作者：
J. Zimmerman