权益分类	功能权益	普通用户	{{item.name}}会员
{{category.name}}	{{benefitItem.name}}

統計的声質変換を用いた無喉頭音声の品質改善

使用统计语音转换提高非喉部语音的质量

基本信息

批准号：
11J08741
负责人：
土井啓成
金额：
$ 0.83万
依托单位：
Nara Institute of Science and Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for JSPS Fellows
财政年份：
2011
资助国家：
日本
起止时间：
2011 至 2012
项目状态：
已结题

项目摘要

本研究は,事故や病気等で喉頭を摘出した喉頭摘出者が発声する無喉頭音声の品質を,統計的声質変換を用いて改善することにより,喉頭摘出者のQOL (Quality of Life)改善を目的とした.喉頭摘出者は,呼気を利用した発声が行えないため,無喉頭音声と呼ばれる音声を用いて,音声コミュニケーションを行うが,その音質は非常に低く,また,話者の判別がつかない.そこで,本研究では,統計的声質変換を応用し,無喉頭音声の音質と話者性の改善を試みた.統計的声質変換とは,ある話者の声を別の話者の声に変換する技術であり,これを用いて,無喉頭音声を健常者の通常音声に変換することで,その音質を改善する.また,さらに1対多固有声変換を用いることで,話者性の回復も試みた.一対多固有声変換とは,変換先の音声の声質を任意に操作することができる技術である.これを用いることで,喉頭摘出者は,任意の声質で発声することが可能になり,それはすなわち,声質におけるアイデンティティーの確立につながる.本研究では,食道音声,電気音声,微弱電気音声の3つの無喉頭音声に対し,一対多固有声変換による品質改善法を導入し,それらを総合的に評価した.評価の結果,提案法が各種音声の音質を大きく改善し,話者性の回復も見込めることが分かった.また,提案法により生成された変換音声を,詳細に比較したことにより,提案法を用いた際の各無喉頭音声における利点等が明らかになった.この結果は,喉頭摘出者が用いる無喉頭音声を選ぶ際の指標の一つとなり得るものである.さらに,提案法の実用化に向けて,提案法のアプリケーションの作成を試みた.ただし,より使いやすいインターフェース作成のため,より多くの被験者が見込める歌声変換用アプリケーションとして作成し,また,そのあたりをまとめて学会等に発表した.その結果,多くの意見が得られ,使いやすいインターフェースができた.しかしながら,そのインターフェースと提案法をつなぎ合わせるところまでは至れなかった.

In this study, the patients with larynx extirpation, such as accident and illness, had no larynx sound, and the statistical data were used to improve the sound quality, while the larynx extractor QOL (Quality of Life) improved the objective. If you take out the throat, use the sound of the throat, sound, sound, The results of this study showed that the statistical data were used in the study, and the sexual improvement of the patients with larynx-free tone was tested. According to the statistics, the sound sound is not clear, and the sound is different from the sound of the other person. The sound of the normal voice without throat is usually low, and the sound is improved. If you want to use a number of inherent sound signals, you will have a sexual response. If you have more than one inherent sound, you should first use the sound, the sound, the random operation, the technical skills. If you want to make a sound, you can make a sound and make a sound. In this study, the sound of esophagus, sound of esophagus, sound of electronic sound, weak sound sound, sound sound without throat, sound sound with more than one inherent sound, sound improvement method, and sound quality improvement method were introduced into this study. As a result of the results, the proposed method proposed to improve the sound quality of all sound systems, and those who had sex responded to each other. In the proposed method, the sound is generated, the sound is better than the sound, and the sound is clear in the proposed method. The results show that the throat extractor uses the larynx-free tone to select the voice to indicate that the larynx is easy to get rid of. The proposal method will be applied to the market, and the proposal law will be used as an attempt to solve the problem. Please tell me how to make it, so that it can be made into a form, such as the one who has been killed, the one who has been killed, the song, the girl, the girl, the As a result, many opinions have been received, and the results have been greatly influenced by the results. I don't know if I want to propose a bill for the bill.

项目成果

期刊论文数量（0）

专著数量（0）

科研奖励数量（0）

会议论文数量（0）

专利数量（0）

多対多固有声変換に基づく歌声声質変換及び歌声合成を用いた学習データ生成

基于多对多特征语音转换的歌声质量转换和歌声合成的训练数据生成

DOI：
发表时间：
2012
期刊：
影响因子：
0
作者：
S. Harish;K. Ishikawa;E. Einarsson;S. Aikawa;T. Inoue;P. Zhao;M. Watanabe;S. Chiashi;J. Shiomi;S. Maruyama;土井啓成
通讯作者：
土井啓成

VocaListenerによる学習データ生成を利用した多対多固有声変換に基く歌声声質変換

基于使用 VocaListener 学习数据生成的多对多特征语音转换的歌声音质转换

DOI：
发表时间：
2012
期刊：
影响因子：
0
作者：
S.Chiashi;H.Okabe;T.Inoue;I.Shiomi;T.Sato;S.Kono;M.Terasawa;S.Maruyama;土井啓成
通讯作者：
土井啓成

Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system

DOI：
发表时间：
2012-12
期刊：
Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference
影响因子：
0
作者：
Hironori Doi;T. Toda;Tomoyasu Nakano;Masataka Goto;Satoshi Nakamura
通讯作者：
Hironori Doi;T. Toda;Tomoyasu Nakano;Masataka Goto;Satoshi Nakamura

An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

DOI：
10.1109/icassp.2011.5947513
发表时间：
2011-05
期刊：
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
影响因子：
0
作者：
Hironori Doi;Keigo Nakamura;T. Toda;H. Saruwatari;K. Shikano
通讯作者：
Hironori Doi;Keigo Nakamura;T. Toda;H. Saruwatari;K. Shikano