权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

マルチモーダル音声認識・合成によるインターフェースの構築

使用多模态语音识别和合成构建界面

基本信息

批准号：
10780226
负责人：
徳田恵一
金额：
$ 1.47万
依托单位：
Nagoya Institute of Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Encouragement of Young Scientists (A)
财政年份：
1998
资助国家：
日本
起止时间：
1998 至 1999
项目状态：
已结题

项目摘要

人間が関わるコミュニケーションにおいては、視覚と聴覚情報の担う役割が非常に大きい。このため,人間に優しいヒューマンインターフェースを実現する上で,視覚・聴覚を融合したマルチモーダルインターフェースの開発が重要な課題となっている。このようなマルチモーダルインターフェースの一つとして,音声と唇動画像による「バイモーダル音声認識」と,任意の文字テキストから自然な音声と唇の動きを同時に生成する「バイモーダル音声合成」を,「バイモーダル音声入出力システム」として統一された枠組の中で実現することを目指し,以下のような研究を行った.・唇画像データベースの作成:音節または音素を単位とするHMM作成のために必要な唇画像を音声と同期して収録した。また,同期収録音声に基づいてラベル付けを行った。・唇動画像による音声認識唇のためのHMMの学習法について検討し,新たに提案した位置の正規化学習が効果的であることを示した.・唇動画像の生成については,輪郭モデルを用いるものと,画像ベースのものとを並行して,検討した.いずれにおいても,これまでに提案したHMMからのパラメータ生成アルゴリズムを用いることにより,良好な唇動画像を生成できることを確かめた.・以上の成果に基づいて,「入力音声に同期した唇動画像を生成するシステム」,「テキストから,音声と唇動画像を同時に生成するシステム」などを構築し,それらの有用性を示した.

The human relationship between the two countries is very important. This is an important issue in the development of human beings. "Sound recognition","Sound input force","," Sound input force "," input force "," output "," output,"output, Lip portrait creation: syllables, phonemes, HMM creation In addition, the sound recording system can also be used to record sound. Lip animation image, sound recognition, lip recognition, HMM learning method, new proposal, position normalization, learning results, etc. Lip animation image generation, wheel country animation image generation. In the middle of the film, the film is proposed to HMM, and the film is generated from the film. The above results are based on the following: "the generation of lip animation images in the same time of sound","the generation of lip animation images in the same time of sound","the construction of lip animation images in the same time of sound", and their usefulness.

项目成果

期刊论文数量（35）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

Oscar Vanegas: "Intensity/Iocation normalization for automatice lipreading" Proc. International Conference on Signal Processing. vol.2. 920-923 (1998)

Oscar Vanegas：“自动唇读的强度/位置标准化”Proc。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

K,Tokuda: "Speech parameter generation algorithms for HMM-based speech synthesis"Proceedings of International Conference on Acoustics,Speech,and Signal Processing. (採録決定済). (2000)

K，Tokuda：“基于 HMM 的语音合成的语音参数生成算法”国际声学、语音和信号处理会议论文集（已接受）。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

O.Vanegas: "Location normalization of HMM-based lip reading : Experiments for the M2VTS Database"IEEE International Conference on Image Processing. (1999)

O.Vanegas：“基于 HMM 唇读的位置标准化：M2VTS 数据库的实验”IEEE 国际图像处理会议。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

Masatsune Tamura: "Visual speech synthesis based on parameter generation from HMM:speech-driven and text-and-speech-driven approach" Proc. International Conference of Auditory-Visual Speech Proccssing. 219-224 (1998)

Masatsune Tamura：“基于 HMM 参数生成的视觉语音合成：语音驱动和文本和语音驱动方法”Proc。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

T.Yoshimura: "Simultaneous modeling of spectrum,pitch and duration in HMM-based speech synthesis"Proceedings of European Conference on Speech Communication and Technology. (1999)

T.Yoshimura：“基于 HMM 的语音合成中频谱、音调和持续时间的同步建模”欧洲语音通信与技术会议论文集。

DOI：
发表时间：
期刊：
影响因子：
0
作者：
通讯作者：

DOI：
{{ item.doi }}
发表时间：
{{ item.publish_year }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.author }}

数据更新时间：{{ patent.updateTime }}

徳田恵一其他文献

英語音声合成における韻律推定モデルと音響モデルの同時学習

英语语音合成中韵律估计模型和声学模型的同时学习

DOI：
发表时间：
2008
期刊：
影响因子：
0
作者：
大浦圭一郎;戸田智基;南角吉彦;徳田恵一;マイアハニエリ;坂井信輔;中村哲
通讯作者：
中村哲

分離型2次元格子HMMに基づく顔画像認識

基于可分离二维网格HMM的人脸图像识别

DOI：
发表时间：
2005
期刊：
2005年FIT講演論文集
影响因子：
0
作者：
布目哲也;南角吉彦;徳田恵一;北村正
通讯作者：
北村正

Blizzar Challenge 2007のための平均声に基づくHMM音声合成システムの評価

2007 年暴雪挑战赛基于平均语音的 HMM 语音合成系统评估

DOI：
发表时间：
2008
期刊：
影响因子：
0
作者：
能勢隆;山岸順一;全柄河;戸田智基;徳田恵一
通讯作者：
徳田恵一

Knowledge-based Discovery in Systems Biology using CF-Induction.

使用 CF-Induction 在系统生物学中进行基于知识的发现。

DOI：
发表时间：
2007
期刊：
New Trends in Applied Artificial Intelligence, Lecture Notes in Artificial Intelligence 4570
影响因子：
0
作者：
全柄河;南角吉彦;徳田恵一;Andrei Doncescu
通讯作者：
Andrei Doncescu