权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

Automatic voice building for flexible speech synthesis

自动语音构建，实现灵活的语音合成

基本信息

批准号：
14380160
负责人：
TOKUDA Keikhi
金额：
$ 5.95万
依托单位：
Nagoya Institute of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (B)
财政年份：
2002
资助国家：
日本
起止时间：
2002 至 2004
项目状态：
已结题

来源：
https://kaken.nii.ac.jp/en/grant/KAKENHI-PROJECT-14380160/
关键词：
speech synthesis voice quality emotional speech HMM-based speech synthesis labeling automatic voice bulding PLEd

项目摘要

The increasing availability of large speech databases makes it possible to construct speech synthesis systems, which are referred to as data-driven or corpus-based approach, by applying statistical learning algorithms. These systems, which can be automatically trained, not only generate natural and high quality synthetic speech but also can reproduce voice characteristics of the original speaker. However, to make the whole voice building process fully-automatic, we need to construct speech databases in an automatic way. In this research work, we investigate automatic voice building techniques for an HMM-based speech synthesis system which can synthesize speech with various voice qualities. First, we implemented an GUI-based labeling tool, called PLEd (Prosody and Linguistic Label Editor). Then, in order to construct an automatic voice building system, we have developed an automatic accent labeling technique. It has been shown that by using the developed system, we have successfully label accent information.

随着大型语音数据库的不断增加，人们可以通过应用统计学习算法来构建语音合成系统，这被称为数据驱动或基于语料库的方法。这些系统可以自动训练，不仅可以生成自然和高质量的合成语音，而且可以再现原始说话人的语音特征。然而，为了使整个语音构建过程完全自动化，我们需要以自动的方式构建语音数据库。在这项研究工作中，我们研究了自动语音建设技术的HMM为基础的语音合成系统，可以合成语音与各种语音质量。首先，我们实现了一个基于GUI的标签工具，称为PLEd（韵律和语言标签编辑器）。然后，为了构建一个自动语音构建系统，我们开发了一个自动口音标注技术。实验结果表明，利用该系统，我们成功地标注了口音信息。

项目成果

期刊论文数量（341）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

An HMM-based speech synthesis system applied to English

DOI：
10.1109/wss.2002.1224415
发表时间：
2002
期刊：
Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002.
影响因子：
0
作者：
Keiichi Tokuda;H. Zen;Alan W. Black
通讯作者：
Keiichi Tokuda;H. Zen;Alan W. Black

Minimum classification error interactive training for speaker identification

说话人识别的最小分类误差交互式训练

DOI：
发表时间：
2005
期刊：
2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005)
影响因子：
0
作者：
Yusuke Kida;Hiroyoshi Yamamoto;Chiyomi Miyajima;Keiichi Tokuda;Tadashi Kitamura
通讯作者：
Tadashi Kitamura

凝人化音声対話エージェント基本ソフトウェアの開発プロジェクト報告

人性化语音对话代理基础软件开发项目报告

DOI：
发表时间：
2003
期刊：
情報処理学会研究報告「音声言語情報処理」 vol.2003,no.049
影响因子：
0
作者：
嵯峨山茂樹;伊藤克亘;宇津呂武仁;甲斐充彦;小林隆夫;下平博;伝康晴;徳田恵一;中村哲;西本卓也;新田恒雄;広瀬啓吉;峯松信明;森島繁生;山下洋一;山田篤;李晃伸
通讯作者：
李晃伸

主観評価に基づくHMM感情音声合成

基于主观评价的HMM情感语音合成